Liqiang Nie

Orcid: 0000-0003-1476-0273

According to our database1, Liqiang Nie authored at least 461 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
UNK-VQA: A Dataset and a Probe Into the Abstention Ability of Multi-Modal Large Models.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Contrastive Multi-Bit Collaborative Learning for Deep Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., November, 2024

Contrastive Incomplete Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., November, 2024

Similarity-Induced Weighted Consensus Laplacian Matrix Learning for Multiview Clustering.
IEEE Trans. Syst. Man Cybern. Syst., October, 2024

Talking Face Generation With Audio-Deduced Emotional Landmarks.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

Deep Multi-Modal Hashing With Semantic Enhancement for Multi-Label Micro-Video Retrieval.
IEEE Trans. Knowl. Data Eng., October, 2024

Pre-Trained Transformer-Based Parallel Multi-Channel Adaptive Image Sequence Interpolation Network.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Detecting and Grounding Multi-Modal Media Manipulation and Beyond.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Universal Relocalizer for Weakly Supervised Referring Expression Grounding.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Efficient Brain Tumor Segmentation with Lightweight Separable Spatial Convolutional Network.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Instance-level Adversarial Source-free Domain Adaptive Person Re-identification.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Dual Dynamic Threshold Adjustment Strategy.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Query-Oriented Micro-Video Summarization.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Rule-Guided Counterfactual Explainable Recommendation.
IEEE Trans. Knowl. Data Eng., May, 2024

Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

A Survey of Knowledge Enhanced Pre-Trained Language Models.
IEEE Trans. Knowl. Data Eng., April, 2024

Stochastic Latent Talking Face Generation Toward Emotional Expressions and Head Poses.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Semantic-Aware Contrastive Learning With Proposal Suppression for Video Semantic Role Grounding.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Voice-Face Homogeneity Tells Deepfake.
ACM Trans. Multim. Comput. Commun. Appl., March, 2024

Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation.
ACM Trans. Inf. Syst., March, 2024

Semantic Collaborative Learning for Cross-Modal Moment Localization.
ACM Trans. Inf. Syst., March, 2024

Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model.
ACM Trans. Inf. Syst., March, 2024

Dual-track spatio-temporal learning for urban flow prediction with adaptive normalization.
Artif. Intell., March, 2024

Stylized Data-to-text Generation: A Case Study in the E-Commerce Domain.
ACM Trans. Inf. Syst., January, 2024

TryonCM2: Try-on-Enhanced Fashion Compatibility Modeling Framework.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Source-free Style-diversity Adversarial Domain Adaptation with Privacy-preservation for person re-identification.
Knowl. Based Syst., January, 2024

An Efficient Attribute-Preserving Framework for Face Swapping.
IEEE Trans. Multim., 2024

Muti-Modal Emotion Recognition via Hierarchical Knowledge Distillation.
IEEE Trans. Multim., 2024

AAMT: Adversarial Attack-Driven Mutual Teaching for Source-Free Domain-Adaptive Person Reidentification.
IEEE Trans. Multim., 2024

Learning to Agree on Vision Attention for Visual Commonsense Reasoning.
IEEE Trans. Multim., 2024

Anti-Collapse Loss for Deep Metric Learning.
IEEE Trans. Multim., 2024

Modeling Multiple Aesthetic Views for Series Photo Selection.
IEEE Trans. Multim., 2024

Dual-Domain Aligned Deep Hierarchical Matrix Factorization Method for Micro-Video Multi-Label Classification.
IEEE Trans. Multim., 2024

SADCMF: Self-Attentive Deep Consistent Matrix Factorization for Micro-Video Multi-Label Classification.
IEEE Trans. Multim., 2024

Audio-Driven Talking Video Frame Restoration.
IEEE Trans. Multim., 2024

BadCM: Invisible Backdoor Attack Against Cross-Modal Learning.
IEEE Trans. Image Process., 2024

Spatial Structure Constraints for Weakly Supervised Semantic Segmentation.
IEEE Trans. Image Process., 2024

Heterogeneous Feature Collaboration Network for Salient Object Detection in Optical Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2024

Stereo Image Restoration via Attention-Guided Correspondence Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection.
Inf. Sci., 2024

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation.
CoRR, 2024

RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training.
CoRR, 2024

Preview-based Category Contrastive Learning for Knowledge Distillation.
CoRR, 2024

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing.
CoRR, 2024

Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding.
CoRR, 2024

Unveil Benign Overfitting for Transformer in Vision: Training Dynamics, Convergence, and Generalization.
CoRR, 2024

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture.
CoRR, 2024

Laser: Parameter-Efficient LLM Bi-Tuning for Sequential Recommendation with Collaborative Information.
CoRR, 2024

GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching.
CoRR, 2024

Social Debiasing for Fair Multi-modal LLMs.
CoRR, 2024

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks.
CoRR, 2024

EPD: Long-term Memory Extraction, Context-awared Planning and Multi-iteration Decision @ EgoPlan Challenge ICML 2024.
CoRR, 2024

Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation.
CoRR, 2024

Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding.
CoRR, 2024

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models.
CoRR, 2024

Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning.
CoRR, 2024

Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory.
CoRR, 2024

ObjectNLQ @ Ego4D Episodic Memory Challenge 2024.
CoRR, 2024

HCQA @ Ego4D EgoSchema Challenge 2024.
CoRR, 2024

A Survey on Human Preference Learning for Large Language Models.
CoRR, 2024

Unified Text-to-Image Generation and Retrieval.
CoRR, 2024

Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL.
CoRR, 2024

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models.
CoRR, 2024

Dual Dynamic Threshold Adjustment Strategy for Deep Metric Learning.
CoRR, 2024

A Survey of Generative Search and Recommendation in the Era of Large Language Models.
CoRR, 2024

MMGRec: Multimodal Generative Recommendation with Transformer Model.
CoRR, 2024

Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation.
CoRR, 2024

FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge.
CoRR, 2024

Cluster-based Graph Collaborative Filtering.
CoRR, 2024

RoboMP<sup>2</sup>: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models.
CoRR, 2024

LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model.
CoRR, 2024

WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More.
CoRR, 2024

Interactive Garment Recommendation with User in the Loop.
CoRR, 2024

Sentiment-enhanced Graph-based Sarcasm Explanation in Dialogue.
CoRR, 2024

Enhancing the Emotional Generation Capability of Large Language Models via Emotional Chain-of-Thought.
CoRR, 2024

Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Let Me Show You Step by Step: An Interpretable Graph Routing Network for Knowledge-based Visual Question Answering.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Explicit Granularity and Implicit Scale Correspondence Learning for Point-Supervised Video Moment Localization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Attribute-driven Disentangled Representation Learning for Multimodal Recommendation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Revisiting Unsupervised Temporal Action Localization: The Primacy of High-Quality Actionness and Pseudolabels.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

NovaChart: A Large-scale Dataset towards Chart Understanding and Generation of Multimodal Large Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Diffusion Facial Forgery Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Breaking Barriers of System Heterogeneity: Straggler-Tolerant Multimodal Federated Learning via Knowledge Distillation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Multi-Factor Adaptive Vision Selection for Egocentric Video Question Answering.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Mind the Boundary: Coreset Selection via Reconstructing the Decision Boundary.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Context Aggregation for Image Matting.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Two-Stage Information Bottleneck For Temporal Language Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

A Multi-View Clustering Algorithm for Short Text.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

VK-G2T: Vision and Context Knowledge Enhanced Gloss2text.
Proceedings of the IEEE International Conference on Acoustics, 2024

Thoughts to Target: Enhance Planning for Target-driven Conversation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

GPS-Gaussian: Generalizable Pixel-Wise 3D Gaussian Splatting for Real-Time Human Novel View Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-Based Human Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Discriminative Probing and Tuning for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-chats from Large Language Models Make Small Emotional Support Chatbot Better.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LRQuant: Learnable and Robust Post-Training Quantization for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Distillation Enhanced Generative Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LLM vs Small Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploiting the Social-Like Prior in Transformer for Visual Reasoning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Review Polarity-Wise Recommender.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Dual Consistency-Enhanced Semi-Supervised Sentiment Analysis Towards COVID-19 Tweets.
IEEE Trans. Knowl. Data Eng., December, 2023

A Spatial and Adversarial Representation Learning Approach for Land Use Classification with POIs.
ACM Trans. Intell. Syst. Technol., December, 2023

Multi-Granularity Interaction and Integration Network for Video Question Answering.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Attribute-Guided Collaborative Learning for Partial Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Causal Inference for Knowledge Graph Based Recommendation.
IEEE Trans. Knowl. Data Eng., November, 2023

Causal Inference for Leveraging Image-Text Matching Bias in Multi-Modal Fake News Detection.
IEEE Trans. Knowl. Data Eng., November, 2023

TME: Tree-guided Multi-task Embedding Learning towards Semantic Venue Annotation.
ACM Trans. Inf. Syst., October, 2023

Optimizing Spaced Repetition Schedule by Capturing the Dynamics of Memory.
IEEE Trans. Knowl. Data Eng., October, 2023

MM-FRec: Multi-Modal Enhanced Fashion Item Recommendation.
IEEE Trans. Knowl. Data Eng., October, 2023

Pre-Trained Semantic Embeddings for POI Categories Based on Multiple Contexts.
IEEE Trans. Knowl. Data Eng., September, 2023

Guest Editorial Introduction to the Special Issue on Video Transformers.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Multi-level adversarial attention cross-modal hashing.
Signal Process. Image Commun., September, 2023

Learning Geometric Transformation for Point Cloud Completion.
Int. J. Comput. Vis., September, 2023

HS-GCN: Hamming Spatial Graph Convolutional Networks for Recommendation.
IEEE Trans. Knowl. Data Eng., June, 2023

Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation.
Int. J. Comput. Vis., June, 2023

Egocentric Early Action Prediction via Adversarial Knowledge Distillation.
ACM Trans. Multim. Comput. Commun. Appl., 2023

DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network.
ACM Trans. Multim. Comput. Commun. Appl., 2023

On Modality Bias Recognition and Reduction.
ACM Trans. Multim. Comput. Commun. Appl., 2023

DualGNN: Dual Graph Neural Network for Multimedia Recommendation.
IEEE Trans. Multim., 2023

Siamese Alignment Network for Weakly Supervised Video Moment Retrieval.
IEEE Trans. Multim., 2023

Micro-Influencer Recommendation by Multi-Perspective Account Representation Learning.
IEEE Trans. Multim., 2023

Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions.
IEEE Trans. Multim., 2023

Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling.
IEEE Trans. Multim., 2023

Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification.
IEEE Trans. Multim., 2023

Self-Supervised Correlation Learning for Cross-Modal Retrieval.
IEEE Trans. Multim., 2023

Disentangled Multimodal Representation Learning for Recommendation.
IEEE Trans. Multim., 2023

Category-Aware Multimodal Attention Network for Fashion Compatibility Modeling.
IEEE Trans. Multim., 2023

DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation.
IEEE Trans. Multim., 2023

Neighbor-Guided Consistent and Contrastive Learning for Semi-Supervised Action Recognition.
IEEE Trans. Image Process., 2023

Toward Fine-Grained Talking Face Generation.
IEEE Trans. Image Process., 2023

Joint Answering and Explanation for Visual Commonsense Reasoning.
IEEE Trans. Image Process., 2023

Semantic-Aware Modular Capsule Routing for Visual Question Answering.
IEEE Trans. Image Process., 2023

Adaptive Edge-Aware Semantic Interaction Network for Salient Object Detection in Optical Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2023

Enhanced Multi-Domain Dialogue State Tracker With Second-Order Slot Interactions.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Preface to the Special Issue on Multimodal Learning Integrated with Pre-training Techniques.
Int. J. Softw. Informatics, 2023

Correction to: Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation.
Int. J. Comput. Vis., 2023

A Survey on Video Moment Localization.
ACM Comput. Surv., 2023

Understanding Before Recommendation: Semantic Aspect-Aware Review Exploitation via Large Language Models.
CoRR, 2023

Unsupervised Temporal Action Localization via Self-paced Incremental Learning.
CoRR, 2023

Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models.
CoRR, 2023

UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability.
CoRR, 2023

Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog.
CoRR, 2023

ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens.
CoRR, 2023

Building Emotional Support Chatbots in the Era of LLMs.
CoRR, 2023

Towards Generalizable Deepfake Detection by Primary Region Regularization.
CoRR, 2023

DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection.
CoRR, 2023

Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval.
CoRR, 2023

ChatLLM Network: More brains, More intelligence.
CoRR, 2023

Rethinking Context Aggregation in Natural Image Matting.
CoRR, 2023

Learning Reliable Representations for Incomplete Multi-View Partial Multi-Label Classification.
CoRR, 2023

Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening.
CoRR, 2023

Multi-queue Momentum Contrast for Microvideo-Product Retrieval.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Strategy-aware Bundle Recommender System.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

LightGT: A Light Graph Transformer for Multimedia Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Learnable Pillar-based Re-ranking for Image-Text Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

OFAR: A Multimodal Evidence Retrieval Framework for Illegal Live-streaming Identification.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Adapting Generative Pretrained Language Model for Open-domain Multimodal Sentence Summarization.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Dual Semantic Knowledge Composed Multimodal Dialog Systems.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Target-Guided Composed Image Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unlocking the Power of Multimodal Learning for Emotion Recognition in Conversation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

RTQ: Rethinking Video-language Understanding Based on Image-text Model.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

General Debiasing for Multimodal Sentiment Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Advancing Video Question Answering with a Multi-modal and Multi-layer Question Enhancement Network.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Realistic Conversational Head Generation: A Comprehensive Framework for Lifelike Video Synthesis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Semantic-Guided Feature Distillation for Multimodal Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mask Again: Masked Knowledge Distillation for Masked Video Modeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

StyleEDL: Style-Guided High-order Attention Network for Image Emotion Distribution Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Temporal Sentence Grounding in Streaming Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Text-based Person Search without Parallel Image-Text Data.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Interactive Object Placement with Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Modeling Product's Visual and Functional Characteristics for Recommender Systems (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Enhancing Factorization Machines with Generalized Metric Learning (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

An Empirical Study of Frame Selection for Text-to-Video Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Aspect-to-Scope Oriented Multi-view Contrastive Learning for Aspect-based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-adaptive Context and Modal-interaction Modeling For Multimodal Emotion Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Causal Intervention and Counterfactual Reasoning for Multi-modal Fake News Detection.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Mutual-Enhanced Incongruity Learning Network for Multi-Modal Sarcasm Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Response Generation by Jointly Modeling Personalized Linguistic Styles and Emotions.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Answer Questions with Right Image Regions: A Visual Attention Regularization Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Question Tagging via Graph-guided Ranking.
ACM Trans. Inf. Syst., 2022

Dynamic Graph Reasoning for Conversational Open-Domain Question Answering.
ACM Trans. Inf. Syst., 2022

Feature-Level Attentive ICF for Recommendation.
ACM Trans. Inf. Syst., 2022

Efficient Multi-modal Hashing with Online Query Adaption for Multimedia Retrieval.
ACM Trans. Inf. Syst., 2022

Hierarchical User Intent Graph Network for Multimedia Recommendation.
IEEE Trans. Multim., 2022

Discover Micro-Influencers for Brands via Better Understanding.
IEEE Trans. Multim., 2022

Tripartite Graph Regularized Latent Low-Rank Representation for Fashion Compatibility Prediction.
IEEE Trans. Multim., 2022

Modeling Product's Visual and Functional Characteristics for Recommender Systems.
IEEE Trans. Knowl. Data Eng., 2022

An Attribute-Aware Attentive GCN Model for Attribute Missing in Recommendation.
IEEE Trans. Knowl. Data Eng., 2022

Enhancing Factorization Machines With Generalized Metric Learning.
IEEE Trans. Knowl. Data Eng., 2022

Loss Re-Scaling VQA: Revisiting the Language Prior Problem From a Class-Imbalance View.
IEEE Trans. Image Process., 2022

Partially Supervised Compatibility Modeling.
IEEE Trans. Image Process., 2022

Divide-and-Conquer Predictor for Unbiased Scene Graph Generation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Hierarchical Feature Aggregation Based on Transformer for Image-Text Matching.
IEEE Trans. Circuits Syst. Video Technol., 2022

Adversarial Graph Convolutional Network for Cross-Modal Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multi-Relation Extraction via A Global-Local Graph Convolutional Network.
IEEE Trans. Big Data, 2022

MMNet: Multi-modal Fusion with Mutual Learning Network for Fake News Detection.
CoRR, 2022

Deep Convolutional Pooling Transformer for Deepfake Detection.
CoRR, 2022

Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem.
CoRR, 2022

User-controllable Recommendation Against Filter Bubbles.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

V2P: Vision-to-Prompt based Multi-Modal Product Summary Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Privacy-Preserving Synthetic Data Generation for Recommendation Systems.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Micro-video Tagging via Jointly Modeling Social Influence and Tag Relation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Search-oriented Micro-video Captioning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Baseline for ViCo Conversational Head Generation Challenge.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Image-text Retrieval: A Survey on Recent Research and Development.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Win The Lottery Ticket Via Fourier Analysis: Frequencies Guided Network Pruning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Network Binarization via Contrastive Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Lipschitz Continuity Retained Binary Neural Network.
Proceedings of the Computer Vision - ECCV 2022, 2022

Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MMCoQA: Conversational Question Answering over Text, Tables, and Images.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MERIt: Meta-Path Guided Contrastive Learning for Logical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
GuessUNeed: Recommending Courses via Neural Attention Network and Course Prerequisite Relation Embeddings.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Attribute-wise Explainable Fashion Compatibility Modeling.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Market2Dish: Health-aware Food Recommendation.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Urban Perception: Sensing Cities via a Deep Interactive Multi-task Learning Framework.
ACM Trans. Multim. Comput. Commun. Appl., 2021

HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification.
ACM Trans. Inf. Syst., 2021

Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction.
IEEE Trans. Multim., 2021

Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval.
IEEE Trans. Multim., 2021

Learning Low-Rank Sparse Representations With Robust Relationship Inference for Image Memorability Prediction.
IEEE Trans. Multim., 2021

User Identity Linkage Across Social Media via Attentive Time-Aware User Modeling.
IEEE Trans. Multim., 2021

BATCH: A Scalable Asymmetric Discrete Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., 2021

Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos.
IEEE Trans. Image Process., 2021

Conversational Image Search.
IEEE Trans. Image Process., 2021

Coarse-to-Fine Semantic Alignment for Cross-Modal Moment Localization.
IEEE Trans. Image Process., 2021

Video Moment Localization via Deep Cross-Modal Hashing.
IEEE Trans. Image Process., 2021

Cooperation Learning From Multiple Social Networks: Consistent and Complementary Perspectives.
IEEE Trans. Cybern., 2021

Reconstruction regularized low-rank subspace learning for cross-modal retrieval.
Pattern Recognit., 2021

PaintNet: A shape-constrained generative framework for generating clothing from fashion model.
Multim. Tools Appl., 2021

Learning robust affinity graph representation for multi-view clustering.
Inf. Sci., 2021

Human activity recognition by manifold regularization based dynamic graph convolutional networks.
Neurocomputing, 2021

Learning Robust Recommender from Noisy Implicit Feedback.
CoRR, 2021

GRCN: Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback.
CoRR, 2021

Hierarchical User Intent Graph Network forMultimedia Recommendation.
CoRR, 2021

A Graph-guided Multi-round Retrieval Method for Conversational Open-domain Question Answering.
CoRR, 2021

Factor-level Attentive ICF for Recommendation.
CoRR, 2021

Answer Questions with Right Image Regions: A Visual Attention Regularization Approach.
CoRR, 2021

Incremental Knowledge Based Question Answering.
CoRR, 2021

Interest-aware Message-Passing GCN for Recommendation.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Denoising Implicit Feedback for Recommendation.
Proceedings of the WSDM '21, 2021

Comprehensive Linguistic-Visual Composition Network for Image Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Dynamic Modality Interaction Modeling for Image-Text Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Multimodal Activation: Awakening Dialog Robots without Wake Words.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Adversarial-Enhanced Hybrid Graph Network for User Identity Linkage.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Hierarchical Deep Residual Reasoning for Temporal Moment Localization.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

PLM-IPE: A Pixel-Landmark Mutual Enhanced Framework for Implicit Preference Estimation.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Collocation and Try-on Network: Whether an Outfit is Compatible.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Contrastive Learning for Cold-Start Recommendation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Complementary Factorization towards Outfit Compatibility Modeling.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Graph Convolutional Multi-modal Hashing for Flexible Multimedia Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Focal and Composed Vision-semantic Modeling for Visual Question Answering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Graph Contrastive Clustering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Lipschitz Continuity Guided Knowledge Distillation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
An End-to-End Attention-Based Neural Model for Complementary Clothing Matching.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Large-Scale Question Tagging via Joint Question-Topic Embedding Learning.
ACM Trans. Inf. Syst., 2020

Fine-Grained Privacy Detection with Graph-Regularized Hierarchical Attentive Representation Learning.
ACM Trans. Inf. Syst., 2020

Low-Rank Regularized Multi-Representation Learning for Fashion Compatibility Prediction.
IEEE Trans. Multim., 2020

Learning the Traditional Art of Chinese Calligraphy via Three-Dimensional Reconstruction and Assessment.
IEEE Trans. Multim., 2020

Neural Multimodal Cooperative Learning Toward Micro-Video Understanding.
IEEE Trans. Image Process., 2020

Iterative Local-Global Collaboration Learning Towards One-Shot Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Model Optimization Boosting Framework for Linear Model Hash Learning.
IEEE Trans. Image Process., 2020

Neural Compatibility Modeling With Probabilistic Knowledge Distillation.
IEEE Trans. Image Process., 2020

Scalable Deep Hashing for Large-Scale Social Image Retrieval.
IEEE Trans. Image Process., 2020

Graph Convolutional Network Hashing.
IEEE Trans. Cybern., 2020

SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2020

Guest editorial: Image/video understanding and analysis.
Pattern Recognit. Lett., 2020

Domain Adaptation with Few Labeled Source Samples by Graph Regularization.
Neural Process. Lett., 2020

Cross-modal dual subspace learning with adversarial network.
Neural Networks, 2020

Hashtag our stories: Hashtag recommendation for micro-videos via harnessing multiple modalities.
Knowl. Based Syst., 2020

Cross-modal recipe retrieval via parallel- and cross-attention networks learning.
Knowl. Based Syst., 2020

HesGCN: Hessian graph convolutional networks for semi-supervised classification.
Inf. Sci., 2020

Loss-rescaling VQA: Revisiting Language Prior Problem from a Class-imbalance View.
CoRR, 2020

A^2-GCN: An Attribute-aware Attentive GCN Model for Recommendation.
CoRR, 2020

LARA: Attribute-to-feature Adversarial Learning for New-item Recommendation.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Generative Attribute Manipulation Scheme for Flexible Fashion Search.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Fashion Compatibility Modeling through a Multi-modal Try-on-guided Scheme.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Personalized Item Recommendation for Second-hand Trading Platform.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Context-Aware Multi-View Summarization Network for Image-Text Matching.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

What Aspect Do You Like: Multi-scale Time-aware User Interest Modeling for Micro-video Recommendation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Auxiliary Template-Enhanced Generative Compatibility Modeling.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Unified Graph and Low-Rank Tensor Learning for Multi-View Clustering.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Compatibility Modeling: Data and Knowledge Applications for Clothing Matching
Synthesis Lectures on Information Concepts, Retrieval, and Services, Morgan & Claypool Publishers, ISBN: 978-3-031-02321-7, 2019

Multimodal Learning toward Micro-Video Understanding
Synthesis Lectures on Image, Video, and Multimedia Processing, Morgan & Claypool Publishers, ISBN: 978-3-031-02255-5, 2019

From Question to Text: Question-Oriented Feature Attention for Answer Selection.
ACM Trans. Inf. Syst., 2019

Attentive Long Short-Term Preference Modeling for Personalized Product Search.
ACM Trans. Inf. Syst., 2019

Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval.
IEEE Trans. Multim., 2019

Distribution-Oriented Aesthetics Assessment With Semantic-Aware Hybrid Network.
IEEE Trans. Multim., 2019

Discrete Hashing With Multiple Supervision.
IEEE Trans. Image Process., 2019

Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning.
IEEE Trans. Image Process., 2019

A Framework of Joint Low-Rank and Sparse Regression for Image Memorability Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2019

Low-rank regularized tensor discriminant representation for image set classification.
Signal Process., 2019

Multi-criteria active deep learning for image classification.
Knowl. Based Syst., 2019

Multi-view face hallucination using SVD and a mapping model.
Inf. Sci., 2019

HpLapGCN: Hypergraph <i>p</i>-Laplacian graph convolutional networks.
Neurocomputing, 2019

Principal Component Analysis on Graph-Hessian.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Supervised Hierarchical Cross-Modal Hashing.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Online Multi-modal Hashing with Dynamic Query-adaption.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Prototype-guided Attribute-wise Interpretable Scheme for Clothing Matching.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Quantifying and Alleviating the Language Prior Problem in Visual Question Answering.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

User Attention-guided Multimodal Dialog Systems.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Learn to Gesture: Let Your Body Speak.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Virtually Trying on New Clothing with Arbitrary Poses.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Personalized Hashtag Recommendation for Micro-videos.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

GP-BPR: Personalized Compatibility Modeling for Clothing Matching.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multimodal Dialog System: Generating Responses via Adaptive Decoders.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

User Diverse Preference Modeling by Multimodal Attentive Metric Learning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Routing Micro-videos via A Temporal Graph-guided Recommendation System.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Market2Dish: A Health-aware Food Recommendation System.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Seeking Micro-influencers for Brand Promotion.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Personalized Capsule Wardrobe Creation with Garment and User Modeling.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Two-Step Cross-Modal Hashing by Exploiting Label Correlations and Preserving Similarity in Both Steps.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Video-Based Cross-Modal Recipe Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Improving Distantly-Supervised Relation Extraction with Joint Label Embedding.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Long-tail Hashtag Recommendation for Micro-videos with Graph Convolutional Network.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Explicit Interaction Model towards Text Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Guest Editorial: Special Section on "Multimedia Understanding via Multimodal Analytics".
ACM Trans. Multim. Comput. Commun. Appl., 2018

Sentence Relations for Extractive Summarization with Deep Neural Networks.
ACM Trans. Inf. Syst., 2018

Rank-Constrained Spectral Clustering With Flexible Embedding.
IEEE Trans. Neural Networks Learn. Syst., 2018

Low-Rank Multi-View Embedding Learning for Micro-Video Popularity Prediction.
IEEE Trans. Knowl. Data Eng., 2018

Exploring Web Images to Enhance Skin Disease Analysis Under A Computer Vision Framework.
IEEE Trans. Cybern., 2018

An Adaptive Semisupervised Feature Analysis for Video Semantic Recognition.
IEEE Trans. Cybern., 2018

Guest Editorial: Spatio-temporal Feature Learning for Unconstrained Video Analysis.
Multim. Tools Appl., 2018

Guest Editorial: Semantic Concept Discovery in MM Data.
Multim. Tools Appl., 2018

Identifying advisor-advisee relationships from co-author networks via a novel deep model.
Inf. Sci., 2018

Guest Editorial: Query understanding.
Neurocomputing, 2018

TEM: Tree-enhanced Embedding Model for Explainable Recommendation.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Learning on Partial-Order Hypergraphs.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Chat More: Deepening and Widening the Chatting Topic via A Deep Model.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

A Personal Privacy Preserving Framework: I Let You Know Who Can See What.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Neural Compatibility Modeling with Attentive Knowledge Distillation.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Fast Scalable Supervised Hashing.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Attentive Moment Retrieval in Videos.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Venue Prediction for Social Images by Exploiting Rich Temporal Patterns in LBSNs.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Cross-modal Moment Localization in Videos.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-modal Preference Modeling for Product Search.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Quality Matters: Assessing cQA Pair Quality via Transductive Multi-View Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

SDMCH: Supervised Discrete Manifold-Embedded Cross-Modal Hashing.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Discrete Factorization Machines for Fast Feature-based Recommendation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

An Attentive Interaction Network for Context-aware Recommendations.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Supervised Deep Hashing for Hierarchical Labeled Data.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dual Deep Neural Networks Cross-Modal Hashing.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Compact Indexing and Judicious Searching for Billion-Scale Microblog Retrieval.
ACM Trans. Inf. Syst., 2017

Targeted Advertising in Public Transportation Systems with Quantitative Evaluation.
ACM Trans. Inf. Syst., 2017

Unifying Virtual and Physical Worlds: Learning Toward Local and Global Consistency.
ACM Trans. Inf. Syst., 2017

Cross-Platform App Recommendation by Jointly Modeling Ratings and Texts.
ACM Trans. Inf. Syst., 2017

Modeling Disease Progression via Multisource Multitask Learners: A Case Study With Alzheimer's Disease.
IEEE Trans. Neural Networks Learn. Syst., 2017

Large-Scale Tracking for Images With Few Textures.
IEEE Trans. Multim., 2017

Predicting Image Memorability Through Adaptive Transfer Learning From External Sources.
IEEE Trans. Multim., 2017

I Know What You Want to Express: Sentence Element Inference by Incorporating External Knowledge Base.
IEEE Trans. Knowl. Data Eng., 2017

Data-Driven Answer Selection in Community QA Systems.
IEEE Trans. Knowl. Data Eng., 2017

Learning User Attributes via Mobile Social Multimedia Analytics.
ACM Trans. Intell. Syst. Technol., 2017

Augmented Collaborative Filtering for Sparseness Reduction in Personalized POI Recommendation.
ACM Trans. Intell. Syst. Technol., 2017

Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs.
IEEE Trans. Image Process., 2017

Perceptually Guided Photo Retargeting.
IEEE Trans. Cybern., 2017

Multiview Physician-Specific Attributes Fusion for Health Seeking.
IEEE Trans. Cybern., 2017

Special issue on cross-media big data analytics.
J. Vis. Commun. Image Represent., 2017

Version-sensitive mobile App recommendation.
Inf. Sci., 2017

Simple to complex cross-modal learning to rank.
Comput. Vis. Image Underst., 2017

Simple to Complex Cross-modal Learning to Rank.
CoRR, 2017

Neural Collaborative Filtering.
Proceedings of the 26th International Conference on World Wide Web, 2017

Item Silk Road: Recommending Items from Information Domains to Social Users.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Computational Social Indicators: A Case Study of Chinese University Ranking.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Exploring User-Specific Information in Music Retrieval.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Embedding Factorization Models for Jointly Recommending Items and User Generated Lists.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

NeuroStylist: Neural Compatibility Modeling for Clothing Matching.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Enhancing Micro-video Understanding by Harnessing External Sounds.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Towards Micro-video Understanding by Joint Sequential-Sparse Modeling.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Laplacian-Steered Neural Style Transfer.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Representativeness-aware Aspect Analysis for Brand Monitoring in Social Media.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Exploiting Music Play Sequence for Music Recommendation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

S2JSD-LSH: A Locality-Sensitive Hashing Schema for Probability Distributions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

What Happens Next? Future Subevent Prediction Using Contextual Hierarchical LSTM.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning from Multiple Social Networks
Synthesis Lectures on Information Concepts, Retrieval, and Services, Morgan & Claypool Publishers, ISBN: 978-3-031-02300-2, 2016

Semantic Photo Retargeting Under Noisy Image Labels.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Volunteerism Tendency Prediction via Harvesting Multiple Social Networks.
ACM Trans. Inf. Syst., 2016

Detecting Densely Distributed Graph Patterns for Fine-Grained Image Categorization.
IEEE Trans. Image Process., 2016

Weakly Supervised Human Fixations Prediction.
IEEE Trans. Cybern., 2016

Weakly Supervised Multilabel Clustering and its Applications in Computer Vision.
IEEE Trans. Cybern., 2016

Perceptual Attributes Optimization for Multivideo Summarization.
IEEE Trans. Cybern., 2016

A Biologically Inspired Automatic System for Media Quality Assessment.
IEEE Trans Autom. Sci. Eng., 2016

Weakly supervised image parsing via label propagation over discriminatively semantic graph.
J. Vis. Commun. Image Represent., 2016

Quality biased multimedia data retrieval in microblogs.
J. Vis. Commun. Image Represent., 2016

Genetic algorithm and mathematical morphology based binarization method for strip steel defect image with non-uniform illumination.
J. Vis. Commun. Image Represent., 2016

An aerial image recognition framework using discrimination and redundancy quality measure.
J. Vis. Commun. Image Represent., 2016

Exploring heterogeneous features for query-focused summarization of categorized community answers.
Inf. Sci., 2016

Bridge the semantic gap between pop music acoustic feature and emotion: Build an interpretable model.
Neurocomputing, 2016

A classification model for semantic entailment recognition with feature combination.
Neurocomputing, 2016

Event graph based contradiction recognition from big data collection.
Neurocomputing, 2016

From action to activity: Sensor-based activity recognition.
Neurocomputing, 2016

Surface defect classification in large-scale strip steel image collection via hybrid chromosome genetic algorithm.
Neurocomputing, 2016

Towards organizing health knowledge on community-based health services.
EURASIP J. Bioinform. Syst. Biol., 2016

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning.
CoRR, 2016

Utilizing Sensor-Social Cues to Localize Objects-of-Interest in Outdoor UGVs.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Smart Ambient Sound Analysis via Structured Statistical Modeling.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Shorter-is-Better: Venue Category Estimation from Micro-Video.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Micro Tells Macro: Predicting the Popularity of Micro-Videos via a Transductive Model.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Learning Compact Visual Representation with Canonical Views for Robust Mobile Landmark Search.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

What Does Social Media Say about Your Stress?.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Sparse Code Filtering for Action Pattern Mining.
Proceedings of the Computer Vision - ACCV 2016, 2016

Fortune Teller: Predicting Your Career Path.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Fusing Social Networks with Deep Learning for Volunteerism Tendency Prediction.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

From Tweets to Wellness: Wellness Event Detection from Twitter Streams.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Retargeting Semantically-Rich Photos.
IEEE Trans. Multim., 2015

Semantic-Based Location Recommendation With Multimodal Venue Semantics.
IEEE Trans. Multim., 2015

Bridging the Vocabulary Gap between Health Seekers and Healthcare Knowledge.
IEEE Trans. Knowl. Data Eng., 2015

Disease Inference from Health-Related Questions via Sparse Deep Learning.
IEEE Trans. Knowl. Data Eng., 2015

On robust image spam filtering via comprehensive visual modeling.
Pattern Recognit., 2015

aMM: Towards adaptive ranking of multi-modal documents.
Int. J. Multim. Inf. Retr., 2015

Multiple Social Network Learning and Its Application in Volunteerism Tendency Prediction.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Weibo-Oriented Chinese News Summarization via Multi-feature Combination.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

Biologically Inspired Media Quality Modeling.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Online Multimodal Co-indexing and Retrieval of Weakly Labeled Web Image Collections.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Harvesting Multiple Sources for User Profile Learning: a Big Data Study.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Interest Inference via Structure-Constrained Multi-Source Multi-Task Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Action2Activity: Recognizing Complex Activities from Sensor Data.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Recognizing entailment in Chinese texts with feature combination.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

2014
Learning to Recommend Descriptive Tags for Questions in Social Forums.
ACM Trans. Inf. Syst., 2014

Personalized Recommendations of Locally Interesting Venues to Tourists via Cross-Region Community Matching.
ACM Trans. Intell. Syst. Technol., 2014

WenZher: comprehensive vertical search for healthcare domain.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

A Joint Local-Global Approach for Medical Terminology Assignment.
Proceedings of the Medical Information Retrieval Workshop at SIGIR co-located with the 37th annual international ACM SIGIR conference (ACM SIGIR 2014), 2014

2013
Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information.
IEEE Trans. Multim., 2013

2012
Oracle in Image Search: A Content-Based Approach to Performance Prediction.
ACM Trans. Inf. Syst., 2012

Multimedia Question Answering.
IEEE Multim., 2012

Harvesting visual concepts for image search with complex queries.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

The Use of Dependency Relation Graph to Enhance the Term Weighting in Question Retrieval.
Proceedings of the COLING 2012, 2012

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification.
Proceedings of the COLING 2012, 2012

2011
Multimedia answering: enriching text QA with media information.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

2010
TRECVID 2010 Known-item Search by NUS.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Exploring large scale data for multimedia QA: an initial study.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010


  Loading...