Jing Liu

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China


According to our database1, Jing Liu authored at least 188 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Reparameterizing and dynamically quantizing image features for image generation.
Pattern Recognit., February, 2024

Temporal Action Proposal Generation With Action Frequency Adaptive Network.
IEEE Trans. Multim., 2024

Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation.
IEEE Trans. Multim., 2024

2023
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering.
IEEE Trans. Neural Networks Learn. Syst., March, 2023

Attention-based multi-modal fusion sarcasm detection.
J. Intell. Fuzzy Syst., 2023

Anchor-free temporal action localization via Progressive Boundary-aware Boosting.
Inf. Process. Manag., 2023

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER.
CoRR, 2023

VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset.
CoRR, 2023

ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst.
CoRR, 2023

VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset.
CoRR, 2023

MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

From Pixels to Explanations: Uncovering the Reasoning Process in Visual Question Answering.
Proceedings of the ACM Multimedia Asia 2023, 2023

Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation.
Proceedings of the International Joint Conference on Neural Networks, 2023

CSDNet: Contrastive Similarity Distillation Network for Multi-lingual Image-Text Retrieval.
Proceedings of the Image and Graphics - 12th International Conference, 2023

WL-MSR: Watch and Listen for Multimodal Subtitle Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

MOSO: Decomposing MOtion, Scene and Object for Video Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Global-Guided Selective Context Network for Scene Parsing.
IEEE Trans. Neural Networks Learn. Syst., 2022

Semi-Supervised Temporal Action Proposal Generation via Exploiting 2-D Proposal Map.
IEEE Trans. Multim., 2022

An Efficient Sampling-Based Attention Network for Semantic Segmentation.
IEEE Trans. Image Process., 2022

Super-resolution semantic segmentation with relation calibrating network.
Pattern Recognit., 2022

MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning.
CoRR, 2022

2021
Scene Segmentation With Dual Relation-Aware Attention Network.
IEEE Trans. Neural Networks Learn. Syst., 2021

Visual Question Answering With Dense Inter- and Intra-Modality Interactions.
IEEE Trans. Multim., 2021

Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing.
CoRR, 2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation.
CoRR, 2021

AAformer: Auto-Aligned Transformer for Person Re-Identification.
CoRR, 2021

CPTR: Full Transformer Network for Image Captioning.
CoRR, 2021

Global-Local Propagation Network for RGB-D Semantic Segmentation.
CoRR, 2021

Fast Sequence Generation with Multi-Agent Reinforcement Learning.
CoRR, 2021

Dynamic Warping Network for Semantic Video Segmentation.
Complex., 2021

MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Temporal Memory Attention for Video Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Keypoint Context Aggregation for Human Pose Estimation.
Proceedings of the Image and Graphics - 11th International Conference, 2021

HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Consistent-Separable Feature Representation for Semantic Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Show, Tell, and Polish: Ruminant Decoding for Image Captioning.
IEEE Trans. Multim., 2020

Contextual deconvolution network for semantic segmentation.
Pattern Recognit., 2020

AutoCaption: Image Captioning with Neural Architecture Search.
CoRR, 2020

Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Modeling Local and Global Contexts for Image Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Rankvqa: Answer Re-Ranking For Visual Question Answering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Point Set Attention Network For Semantic Segmentation.
Proceedings of the IEEE International Conference on Image Processing, 2020

Normalized and Geometry-Aware Self-Attention Network for Image Captioning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
BTDP: Toward Sparse Fusion with Block Term Decomposition Pooling for Visual Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Improving visual question answering using dropout and enhanced question encoder.
Pattern Recognit., 2019

Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019.
CoRR, 2019

Attention-Guided Network for Semantic Video Segmentation.
IEEE Access, 2019

Erasing-based Attention Learning for Visual Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Aligning Linguistic Words and Visual Semantic Units for Image Captioning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Densely Connected Attention Flow for Visual Question Answering.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Language and Visual Relations Encoding for Visual Question Answering.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Adaptive Context Network for Scene Parsing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

MSCap: Multi-Style Image Captioning With Unpaired Stylized Text.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dual Attention Network for Scene Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.
IEEE Trans. Neural Networks Learn. Syst., 2018

Image captioning with triple-attention and stack parallel LSTM.
Neurocomputing, 2018

Dual Attention Network for Scene Segmentation.
CoRR, 2018

Enhancing Visual Question Answering Using Dropout.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Improving Residual Block for Semantic Image Segmentation.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Answer Distillation for Visual Question Answering.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks.
IEEE Trans. Neural Networks Learn. Syst., 2017

Hierarchically Supervised Deconvolutional Network for Semantic Video Segmentation.
Pattern Recognit., 2017

Stacked Deconvolutional Network for Semantic Segmentation.
CoRR, 2017

Sketch-based Image Retrieval using Generative Adversarial Networks.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Densely connected deconvolutional network for semantic segmentation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Same-Style Products Mining for Clothes Retrieval.
Proceedings of the Internet Multimedia Computing and Service, 2017

2016
Domain-Sensitive Recommendation with User-Item Subgroup Analysis.
IEEE Trans. Knowl. Data Eng., 2016

Multimedia News Summarization in Search.
ACM Trans. Intell. Syst. Technol., 2016

Chat with illustration.
Multim. Syst., 2016

Object co-segmentation via salient and common regions discovery.
Neurocomputing, 2016

Objectness-aware Semantic Segmentation.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Object-aware Deep Network for Commodity Image Retrieval.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

2015
Partially Shared Latent Factor Learning With Multiview Data.
IEEE Trans. Neural Networks Learn. Syst., 2015

Ordinal Distance Metric Learning for Image Ranking.
IEEE Trans. Neural Networks Learn. Syst., 2015

Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks.
IEEE Trans. Image Process., 2015

Human Age Estimation Based on Locality and Ordinal Information.
IEEE Trans. Cybern., 2015

Detection guided deconvolutional network for hierarchical feature learning.
Pattern Recognit., 2015

Robust Structured Subspace Learning for Data Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Boosted MIML method for weakly-supervised image semantic segmentation.
Multim. Tools Appl., 2015

Image classification using boosted local features with random orientation and location selection.
Inf. Sci., 2015

Joint image representation and classification in random semantic spaces.
Neurocomputing, 2015

Automatic face annotation in TV series by video/script alignment.
Neurocomputing, 2015

Learning representative and discriminative image representation by deep appearance and spatial coding.
Comput. Vis. Image Underst., 2015

Exclusive Constrained Discriminative Learning for Weakly-Supervised Semantic Segmentation.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Semi- and Weakly- Supervised Semantic Segmentation with Deep Convolutional Neural Networks.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Mobile Media Thumbnailing.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Weakly Supervised RBM for Semantic Segmentation.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Dictionary learning based superpixels clustering for weakly-supervised semantic segmentation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Color names learning using convolutional neural networks.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Concurrent group activity classification with context modeling.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization.
Proceedings of the Working Notes of CLEF 2015, 2015

2014
Personalized Geo-Specific Tag Recommendation for Photos on Social Websites.
IEEE Trans. Multim., 2014

Clustering-Guided Sparse Structural Learning for Unsupervised Feature Selection.
IEEE Trans. Knowl. Data Eng., 2014

Learning Robust Face Representation With Classwise Block-Diagonal Structure.
IEEE Trans. Inf. Forensics Secur., 2014

Undoing the codebook bias by linear transformation with sparsity and F-norm constraints for image classification.
Pattern Recognit. Lett., 2014

Sparse representation for robust abnormality detection in crowded scenes.
Pattern Recognit., 2014

Key observation selection-based effective video synopsis for camera network.
Mach. Vis. Appl., 2014

Semi-supervised Unified Latent Factor learning with multi-view data.
Mach. Vis. Appl., 2014

Sparse semantic metric learning for image retrieval.
Multim. Syst., 2014

Object categorization in sub-semantic space.
Neurocomputing, 2014

Adaptive spatial partition learning for image classification.
Neurocomputing, 2014

Image classification by non-negative sparse coding, correlation constrained low-rank and sparse decomposition.
Comput. Vis. Image Underst., 2014

Projective Matrix Factorization with unified embedding for social image tagging.
Comput. Vis. Image Underst., 2014

Regularized Hierarchical Feature Learning with Non-negative Sparsity and Selectivity for Image Classification.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Learning a Representative and Discriminative Part Model with Deep Convolutional Features for Scene Recognition.
Proceedings of the Computer Vision - ACCV 2014, 2014

Image Representation Learning by Deep Appearance and Spatial Coding.
Proceedings of the Computer Vision - ACCV 2014, 2014

Labeling Complicated Objects: Multi-View Multi-Instance Multi-Label Learning.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Learning Low-Rank Representations with Classwise Block-Diagonal Structure for Robust Face Recognition.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Enhancing news organization for convenient retrieval and browsing.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Image classification using Harr-like transformation of local features with coding residuals.
Signal Process., 2013

Ordinal regularized manifold feature extraction for image ranking.
Signal Process., 2013

Image classification using spatial pyramid robust sparse coding.
Pattern Recognit. Lett., 2013

MLRank: Multi-correlation Learning to Rank for image annotation.
Pattern Recognit., 2013

Laplacian affine sparse coding with tilt and orientation consistency for image classification.
J. Vis. Commun. Image Represent., 2013

Beyond visual features: A weak semantic image representation using exemplar classifiers for classification.
Neurocomputing, 2013

Correlation consistency constrained probabilistic matrix factorization for social tag refinement.
Neurocomputing, 2013

Nonlinear matrix factorization with unified embedding for social tag relevance learning.
Neurocomputing, 2013

Structure preserving non-negative matrix factorization for dimensionality reduction.
Comput. Vis. Image Underst., 2013

TCRec: product recommendation via exploiting social-trust network and product category information.
Proceedings of the 22nd International World Wide Web Conference, 2013

Fine-Grained Image Classification Using Color Exemplar Classifiers.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Object Categorization Using Local Feature Context.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Beyond bag of words: image representation in sub-semantic space.
Proceedings of the ACM Multimedia Conference, 2013

Object co-segmentation via discriminative low rank matrix recovery.
Proceedings of the ACM Multimedia Conference, 2013

Label localization with weakly spatial constrained graph propagation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Label localization by appearance guided graph inferring.
Proceedings of the IEEE International Conference on Image Processing, 2013

Discriminative Spatial Codebook Generation for Image Classification.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

Robust Feature Encoding with Neighborhood Information for Image Classification.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

Weakly-Supervised Dual Clustering for Image Semantic Segmentation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
User-Aware Image Tag Refinement via Ternary Semantic Analysis.
IEEE Trans. Multim., 2012

Weakly Supervised Graph Propagation Towards Collective Image Parsing.
IEEE Trans. Multim., 2012

A Boosting, Sparsity- Constrained Bilinear Model for Object Recognition.
IEEE Multim., 2012

Social tag alignment with image regions by sparse reconstructions.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Low rank metric learning for social image retrieval.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Ordinal preserving projection: a novel dimensionality reduction method for image ranking.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Key observation selection for effective video synopsis.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Learning distance metric regression for facial age estimation.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Collaborative PLSA for multi-view clustering.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Noisy Tag Alignment with Image Regions.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Anomaly detection in crowded scene via appearance and dynamics joint modeling.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Beyond local image features: Scene calssification using supervised semantic representation.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Chat with illustration: a chat system with visual aids.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Learning ordinal discriminative features for age estimation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Weighted Interaction Force Estimation for Abnormality Detection in Crowd Scenes.
Proceedings of the Computer Vision - ACCV 2012, 2012

Co-regularized PLSA for Multi-view Clustering.
Proceedings of the Computer Vision, 2012

Unsupervised Feature Selection Using Nonnegative Spectral Analysis.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Correlation Mining for Web News Information Retrieval.
Proceedings of the Computational Social Networks, 2012

2011
Boosted Exemplar Learning for Action Recognition and Annotation.
IEEE Trans. Circuits Syst. Video Technol., 2011

Latent Topic Visual Language Model for Object Categorization.
Proceedings of the SIGMAP 2011, 2011

Exploiting user information for image tag refinement.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

News contextualization with geographic and visual information.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Descriptive local feature groups for image classification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

One step beyond bags of features: Visual categorization using components.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Global Trajectory Construction across Multi-cameras via Graph Matching.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Image classification by non-negative sparse coding, low-rank and sparse decomposition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Cross-media retrieval: state-of-the-art and open issues.
Int. J. Multim. Intell. Secur., 2010

Discovering Phrase-Level Lexicon for Image Annotation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Visual Attention Model Based Object Tracking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Human Action Recognition in Videos Using Hybrid Motion Features.
Proceedings of the Advances in Multimedia Modeling, 2010

Extended CBIR via Learning Semantics of Query Image.
Proceedings of the Advances in Multimedia Modeling, 2010

Image annotation using multi-correlation probabilistic matrix factorization.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Sparse constraint nearest neighbour selection in cross-media retrieval.
Proceedings of the International Conference on Image Processing, 2010

A improved silhouette tracking approach integrating particle filter with graph cuts.
Proceedings of the IEEE International Conference on Acoustics, 2010

Multi-modal multi-correlation person-centric news retrieval.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Image Classification Using Spatial Pyramid Coding and Visual Word Reweighting.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Image annotation via graph learning.
Pattern Recognit., 2009

Concept-Specific Visual Vocabulary Construction for Object Categorization.
Proceedings of the Advances in Multimedia Information Processing, 2009

Web image mining using concept sensitive Markov stationary features.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Linking video ADS with product or service information by web search.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Web image retrieval via learning semantics of query image.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Category sensitive codebook construction for object category recognition.
Proceedings of the International Conference on Image Processing, 2009

Expanded bag of words representation for object classification.
Proceedings of the International Conference on Image Processing, 2009

Spatial pyramid based histogram representation for visual tracking with partial occlusion.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Human action recognition in videos using motion impression image.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Boosted Exemplar Learning for human action recognition.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

2008
A graph-based image annotation framework.
Pattern Recognit. Lett., 2008

Hierarchical clustering-based navigation of image search results.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Hand posture recognition with co-training.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Query oriented subspace shifting for near-duplicate image detection.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Dual cross-media relevance model for image annotation.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Human behaviour consistent relevance feedback model for image retrieval.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Image Annotation Refinement using NSC-Based Word Correlation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
An adaptive graph model for automatic image annotation.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

A Robust Method for TV Logo Tracking in Video Streams.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Web Image Mining Based on Modeling Concept-Sensitive Salient Regions.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Medical Image Annotation and Retrieval Using Visual Features.
Proceedings of the Evaluation of Multilingual and Multi-modal Information Retrieval, 2006

Medical Image Annotation and Retrieval Using Visual Features.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006


  Loading...