Heng Tao Shen

Orcid: 0000-0002-2999-2088

Affiliations:
  • University of Electronic Science and Technology of China, School of Computer Science and Engineering, Chengdu, China
  • University of Queensland, Brisbane, Australia (2004 - 2017)
  • National University of Singapore, Singapore (PhD 2004)


According to our database1, Heng Tao Shen authored at least 546 papers between 2000 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2020, "For contributions to large-scale multimedia content understanding, indexing and retrieval".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Online Unsupervised Domain Adaptation via Reducing Inter- and Intra-Domain Discrepancies.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey.
IEEE Trans. Knowl. Data Eng., January, 2024

Modeling Hierarchical Uncertainty for Multimodal Emotion Recognition in Conversation.
IEEE Trans. Cybern., January, 2024

Imbalanced Open Set Domain Adaptation via Moving-Threshold Estimation and Gradual Alignment.
IEEE Trans. Multim., 2024

Memory-Based Augmentation Network for Video Captioning.
IEEE Trans. Multim., 2024

ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets.
IEEE Trans. Multim., 2024

DMH-CL: Dynamic Model Hardness Based Curriculum Learning for Complex Pose Estimation.
IEEE Trans. Multim., 2024

Semantics Disentangling for Cross-Modal Retrieval.
IEEE Trans. Image Process., 2024

Visually Source-Free Domain Adaptation via Adversarial Style Matching.
IEEE Trans. Image Process., 2024

FAFusion: Learning for Infrared and Visible Image Fusion via Frequency Awareness.
IEEE Trans. Instrum. Meas., 2024

Coreset Learning-Based Sparse Black-Box Adversarial Attack for Video Recognition.
IEEE Trans. Inf. Forensics Secur., 2024

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning.
CoRR, 2024

CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model.
CoRR, 2024

Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection.
CoRR, 2024

Weakly-Supervised Mirror Detection via Scribble Annotations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
UNTIE: Clustering analysis with disentanglement in multi-view information fusion.
Inf. Fusion, December, 2023

Composition-Aware Image Steganography Through Adversarial Self-Generated Supervision.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

KE-RCNN: Unifying Knowledge-Based Reasoning Into Part-Level Attribute Parsing.
IEEE Trans. Cybern., November, 2023

Adaptive Fine-Grained Predicates Learning for Scene Graph Generation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Multi-level Attention-based Domain Disentanglement for BCDR.
ACM Trans. Inf. Syst., October, 2023

Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Open Set Domain Adaptation via Joint Alignment and Category Separation.
IEEE Trans. Neural Networks Learn. Syst., September, 2023

Work Together: Correlation-Identity Reconstruction Hashing for Unsupervised Cross-Modal Retrieval.
IEEE Trans. Knowl. Data Eng., September, 2023

Less is Better: Exponential Loss for Cross-Modal Matching.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

On the Imaginary Wings: Text-Assisted Complex-Valued Fusion Network for Fine-Grained Visual Classification.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Classification Certainty Maximization for Unsupervised Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Dual-Aligned Feature Confusion Alleviation for Generalized Zero-Shot Learning.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Uneven Bi-Classifier Learning for Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., July, 2023

Relation-mining self-attention network for skeleton-based human action recognition.
Pattern Recognit., July, 2023

Modality-Invariant Asymmetric Networks for Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., May, 2023

Category Alignment Adversarial Learning for Cross-Modal Retrieval.
IEEE Trans. Knowl. Data Eng., May, 2023

Region Attention Enhanced Unsupervised Cross-Domain Facial Emotion Recognition.
IEEE Trans. Knowl. Data Eng., April, 2023

Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning.
IEEE Trans. Circuits Syst. Video Technol., March, 2023

Label-Guided Generative Adversarial Network for Realistic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Heterogeneous Knowledge Network for Visual Dialog.
IEEE Trans. Circuits Syst. Video Technol., February, 2023

TEVL: Trilinear Encoder for Video-language Representation Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation.
IEEE Trans. Multim., 2023

Learning MLatent Representations for Generalized Zero-Shot Learning.
IEEE Trans. Multim., 2023

Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval.
IEEE Trans. Multim., 2023

Quaternion Relation Embedding for Scene Graph Generation.
IEEE Trans. Multim., 2023

AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences.
IEEE Trans. Multim., 2023

Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device.
IEEE Trans. Multim., 2023

Adversarial Mixup Ratio Confusion for Unsupervised Domain Adaptation.
IEEE Trans. Multim., 2023

Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation.
IEEE Trans. Multim., 2023

Revisiting Multi-Codebook Quantization.
IEEE Trans. Image Process., 2023

From Global to Local: Multi-Scale Out-of-Distribution Detection.
IEEE Trans. Image Process., 2023

Adaptive Feature Projection With Distribution Alignment for Deep Incomplete Multi-View Clustering.
IEEE Trans. Image Process., 2023

Spherical Centralized Quantization for Fast Image Retrieval.
IEEE Trans. Image Process., 2023

End-to-End Pre-Training With Hierarchical Matching and Momentum Contrast for Text-Video Retrieval.
IEEE Trans. Image Process., 2023

Hierarchical Co-Attention Propagation Network for Zero-Shot Video Object Segmentation.
IEEE Trans. Image Process., 2023

Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.
IEEE Trans. Image Process., 2023

Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment.
IEEE Trans. Image Process., 2023

Privacy-Preserving Adaptive Remaining Useful Life Prediction via Source-Free Domain Adaption.
IEEE Trans. Instrum. Meas., 2023

TFUN: Trilinear Fusion Network for Ternary Image-Text Retrieval.
Inf. Fusion, 2023

Preface to the Special Issue on Multimodal Learning Integrated with Pre-training Techniques.
Int. J. Softw. Informatics, 2023

ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement.
CoRR, 2023

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval.
CoRR, 2023

Hierarchical Graph Pattern Understanding for Zero-Shot VOS.
CoRR, 2023

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control.
CoRR, 2023

Towards Redundancy-Free Sub-networks in Continual Learning.
CoRR, 2023

MotionZero: Exploiting Motion Priors for Zero-shot Text-to-Video Generation.
CoRR, 2023

BatchNorm-based Weakly Supervised Video Anomaly Detection.
CoRR, 2023

ACT: Adversarial Consistency Models.
CoRR, 2023

DePT: Decoupled Prompt Tuning.
CoRR, 2023

MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection.
CoRR, 2023

Cross-Modal Retrieval: A Systematic Review of Methods and Future Directions.
CoRR, 2023

CIParsing: Unifying Causality Properties into Multiple Human Parsing.
CoRR, 2023

Informative Scene Graph Generation via Debiasing.
CoRR, 2023

Generalized Unbiased Scene Graph Generation.
CoRR, 2023

Part-Aware Transformer for Generalizable Person Re-identification.
CoRR, 2023

Feature Noise Boosts DNN Generalization under Label Noise.
CoRR, 2023

Holistic Prototype Attention Network for Few-Shot VOS.
CoRR, 2023

AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data.
CoRR, 2023

An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization.
CoRR, 2023

Using Caterpillar to Nibble Small-Scale Images.
CoRR, 2023

Faster Video Moment Retrieval with Point-Level Supervision.
CoRR, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.
CoRR, 2023

Instance-Variant Loss with Gaussian RBF Kernel for 3D Cross-modal Retriveal.
CoRR, 2023

T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering.
CoRR, 2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement.
CoRR, 2023

Co-attention Propagation Network for Zero-Shot Video Object Segmentation.
CoRR, 2023

Attention Map Guided Transformer Pruning for Edge Device.
CoRR, 2023

Investigating and Mitigating the Side Effects of Noisy Views in Multi-view Clustering in Practical Scenarios.
CoRR, 2023

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding.
CoRR, 2023

A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
CoRR, 2023

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction.
CoRR, 2023

Imbalanced Open Set Domain Adaptation via Moving-threshold Estimation and Gradual Alignment.
CoRR, 2023

A Comprehensive Survey on Source-free Domain Adaptation.
CoRR, 2023

Multimodal Apology: Using WebXR to Repair Trust with Virtual Companion.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Self-Weighted Contrastive Learning among Multiple Views for Mitigating Representation Degeneration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Precise Target-Oriented Attack against Deep Hashing-based Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Depth-Aware Sparse Transformer for Video-Language Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Open-Scenario Domain Adaptive Object Detection in Autonomous Driving.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Style-Controllable Generalized Person Re-identification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DCEL: Deep Cross-modal Evidential Learning for Text-Based Person Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Faster Video Moment Retrieval with Point-Level Supervision.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cross-modality Representation Interactive Learning for Multimodal Sentiment Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CUCL: Codebook for Unsupervised Continual Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Joint Searching and Grounding: Multi-Granularity Video Content Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Co-assistant Networks for Label Correction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

A Universal Unbiased Method for Classification from Aggregate Observations.
Proceedings of the International Conference on Machine Learning, 2023

Disentangled Multiplex Graph Representation Learning.
Proceedings of the International Conference on Machine Learning, 2023

Relational Temporal Graph Convolutional Networks for Ranking-Based Stock Prediction.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DETA: Denoised Task Adaptation for Few-Shot Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Part-Aware Transformer for Generalizable Person Re-identification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multivariate, Multi-Frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognition in Conversation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multilateral Semantic Relations Modeling for Image Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multiplex Graph Representation Learning via Common and Private Information Mining.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
MetaMixUp: Learning Adaptive Interpolation Policy of MixUp With Metalearning.
IEEE Trans. Neural Networks Learn. Syst., 2022

Adversarial Entropy Optimization for Unsupervised Domain Adaptation.
IEEE Trans. Neural Networks Learn. Syst., 2022

Mind the Remainder: Taylor's Theorem View on Recurrent Neural Networks.
IEEE Trans. Neural Networks Learn. Syst., 2022

One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework.
IEEE Trans. Multim., 2022

Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning.
IEEE Trans. Multim., 2022

Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query.
IEEE Trans. Multim., 2022

AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs.
IEEE Trans. Multim., 2022

View-Invariant Human Action Recognition Via View Transformation Network (VTN).
IEEE Trans. Multim., 2022

Push & Pull: Transferable Adversarial Examples With Attentive Attack.
IEEE Trans. Multim., 2022

Answer Again: Improving VQA With Cascaded-Answering Model.
IEEE Trans. Knowl. Data Eng., 2022

Faster Domain Adaptation Networks.
IEEE Trans. Knowl. Data Eng., 2022

Video Question Answering With Prior Knowledge and Object-Sensitive Learning.
IEEE Trans. Image Process., 2022

Continual Referring Expression Comprehension via Dual Modular Memorization.
IEEE Trans. Image Process., 2022

Hierarchical Representation Network With Auxiliary Tasks for Video Captioning and Video Question Answering.
IEEE Trans. Image Process., 2022

Weighted Adversarial Domain Adaptation for Machine Remaining Useful Life Prediction.
IEEE Trans. Instrum. Meas., 2022

Domain Adaptive Remaining Useful Life Prediction With Transformer.
IEEE Trans. Instrum. Meas., 2022

Learning Cross-Modal Common Representations by Private-Shared Subspaces Separation.
IEEE Trans. Cybern., 2022

Investigating the Bilateral Connections in Generative Zero-Shot Learning.
IEEE Trans. Cybern., 2022

Relation Regularized Scene Graph Generation.
IEEE Trans. Cybern., 2022

Flow-Edge Guided Unsupervised Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Progressive Meta-Learning With Curriculum.
IEEE Trans. Circuits Syst. Video Technol., 2022

Action-Centric Relation Transformer Network for Video Question Answering.
IEEE Trans. Circuits Syst. Video Technol., 2022

UAV-Satellite View Synthesis for Cross-View Geo-Localization.
IEEE Trans. Circuits Syst. Video Technol., 2022

Modeling Two-Stream Correspondence for Visual Sound Separation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Universal Weighting Metric Learning for Cross-Modal Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Divergence-Agnostic Unsupervised Domain Adaptation by Adversarial Attacks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Semantic guided knowledge graph for large-scale zero-shot learning.
J. Vis. Commun. Image Represent., 2022

Comprehensive Framework of Early and Late Fusion for Image-Sentence Retrieval.
IEEE Multim., 2022

Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning.
CoRR, 2022

Visual Commonsense-aware Representation Network for Video Captioning.
CoRR, 2022

Thunder: Thumbnail based Fast Lightweight Image Denoising Network.
CoRR, 2022

Practical No-box Adversarial Attacks with Training-free Hybrid Image Transformation.
CoRR, 2022

Structure-Aware Semantic-Aligned Network for Universal Cross-Domain Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Multimodal Disentanglement Variational AutoEncoders for Zero-Shot Cross-Modal Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

A Lower Bound of Hash Codes' Performance.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Free-Lunch for Cross-Domain Few-Shot Learning: Style-Aware Episodic Training with Robust Contrastive Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Alleviating Style Sensitivity then Adapting: Source-free Domain Adaptation for Medical Image Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Point to Rectangle Matching for Image Text Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Rethinking Open-World Object Detection in Autonomous Driving Scenarios.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

ARRA: Absolute-Relative Ranking Attack against Image Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DHHN: Dual Hierarchical Hybrid Network for Weakly-Supervised Audio-Visual Video Parsing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Non-Autoregressive Cross-Modal Coherence Modelling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Support-Set Based Multi-Modal Representation Enhancement for Video Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Meta Distribution Alignment for Generalizable Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Fine-Grained Predicates Learning for Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Semi-supervised Video Paragraph Grounding with Contrastive Encoder.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TVT: Three-Way Vision Transformer through Multi-Modal Hypersphere Learning for Zero-Shot Sketch-Based Image Retrieval.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Cross-Modal Hybrid Feature Fusion for Image-Sentence Matching.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Zero-shot Cross-modal Retrieval by Assembling AutoEncoder and Generative Adversarial Network.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Inductive Structure Consistent Hashing via Flexible Semantic Calibration.
IEEE Trans. Neural Networks Learn. Syst., 2021

Radial Graph Convolutional Network for Visual Question Generation.
IEEE Trans. Neural Networks Learn. Syst., 2021

Half-Quadratic Minimization for Unsupervised Feature Selection on Incomplete Data.
IEEE Trans. Neural Networks Learn. Syst., 2021

Large Factor Image Super-Resolution With Cascaded Convolutional Neural Networks.
IEEE Trans. Multim., 2021

Deep Collaborative Discrete Hashing With Semantic-Invariant Structure Construction.
IEEE Trans. Multim., 2021

Interclass-Relativity-Adaptive Metric Learning for Cross-Modal Matching and Beyond.
IEEE Trans. Multim., 2021

Exploiting Subspace Relation in Semantic Labels for Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., 2021

On Both Cold-Start and Long-Tail Recommendation with Social Data.
IEEE Trans. Knowl. Data Eng., 2021

Salience-Guided Iterative Asymmetric Mutual Hashing for Fast Person Re-Identification.
IEEE Trans. Image Process., 2021

Adversarial Attack Against Urban Scene Segmentation for Autonomous Vehicles.
IEEE Trans. Ind. Informatics, 2021

Remote Sensing Image Super-Resolution via Mixed High-Order Attention Network.
IEEE Trans. Geosci. Remote. Sens., 2021

Deep Fuzzy Hashing Network for Efficient Image Retrieval.
IEEE Trans. Fuzzy Syst., 2021

Adversarial Energy Disaggregation.
Trans. Data Sci., 2021

Adaptive Component Embedding for Domain Adaptation.
IEEE Trans. Cybern., 2021

Multi-Branch Networks for Video Super-Resolution With Dynamic Reconstruction Strategy.
IEEE Trans. Circuits Syst. Video Technol., 2021

Arbitrary-View Human Action Recognition: A Varying-View RGB-D Action Dataset.
IEEE Trans. Circuits Syst. Video Technol., 2021

View-invariant action recognition via Unsupervised AttentioN Transfer (UANT).
Pattern Recognit., 2021

Arbitrary-view human action recognition via novel-view action generation.
Pattern Recognit., 2021

Lightweight dynamic conditional GAN with pyramid attention for text-to-image synthesis.
Pattern Recognit., 2021

Maximum Density Divergence for Domain Adaptation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Reducing bias to source samples for unsupervised domain adaptation.
Neural Networks, 2021

Adaptive Square Attack: Fooling Autonomous Cars With Adversarial Traffic Signs.
IEEE Internet Things J., 2021

Fusing functional connectivity with network nodal information for sparse network pattern learning of functional brain networks.
Inf. Fusion, 2021

Heterogeneous data fusion for predicting mild cognitive impairment conversion.
Inf. Fusion, 2021

Adversarial Energy Disaggregation for Non-intrusive Load Monitoring.
CoRR, 2021

Staircase Sign Method for Boosting Adversarial Attacks.
CoRR, 2021

Hybrid Fusion with Intra- and Cross-Modality Attention for Image-Recipe Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Scene Text Image Super-Resolution via Parallelly Contextual Attention Network.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multi-scale Dynamic Network for Temporal Action Detection.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Graph Convolutional Hourglass Networks for Skeleton-Based Action Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

SKANet: Structured Knowledge-Aware Network for Visual Dialog.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Attention-Based Relation Reasoning Network for Video-Text Retrieval.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Combine Early and Late Fusion Together: A Hybrid Fusion Framework for Image-Text Matching.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

From General to Specific: Informative Scene Graph Generation via Balance Adjustment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Enhancing Audio-Visual Association with Self-Supervised Curriculum Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Kernel Attention Network for Single Image Super-Resolution.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Exploiting Web Images for Multi-Output Classification: From Category to Subcategories.
IEEE Trans. Neural Networks Learn. Syst., 2020

Cross-Modal Attention With Semantic Consistence for Image-Text Matching.
IEEE Trans. Neural Networks Learn. Syst., 2020

Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search.
IEEE Trans. Multim., 2020

Towards Automatic Construction of Diverse, High-Quality Image Datasets.
IEEE Trans. Knowl. Data Eng., 2020

Temporal Reasoning Graph for Activity Recognition.
IEEE Trans. Image Process., 2020

A Context Knowledge Map Guided Coarse-to-Fine Action Recognition.
IEEE Trans. Image Process., 2020

Graph Convolutional Network Hashing.
IEEE Trans. Cybern., 2020

Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval.
IEEE Trans. Cybern., 2020

Bidirectional Discrete Matrix Factorization Hashing for Image Search.
IEEE Trans. Cybern., 2020

A Survey of Human Action Analysis in HRI Applications.
IEEE Trans. Circuits Syst. Video Technol., 2020

Deep quantization generative networks.
Pattern Recognit., 2020

Play and rewind: Context-aware video temporal action proposals.
Pattern Recognit., 2020

The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Hierarchical LSTMs with Adaptive Attention for Visual Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Impulsive synchronization of coupled delayed neural networks with actuator saturation and its application to image encryption.
Neural Networks, 2020

Similarity preserving feature generating networks for zero-shot learning.
Neurocomputing, 2020

Unified Binary Generative Adversarial Network for Image Retrieval and Compression.
Int. J. Comput. Vis., 2020

Patch-wise++ Perturbation for Adversarial Targeted Attacks.
CoRR, 2020

Dual ResGCN for Balanced Scene GraphGeneration.
CoRR, 2020

Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.
CoRR, 2020

Data-Driven Spatio-Temporal Analysis via Multi-Modal Zeitgebers and Cognitive Load in VR.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2020

Correlated Features Synthesis and Alignment for Zero-shot Cross-modal Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

3D Self-Attention for Unsupervised Video Quantization.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

DMCR-GAN: Adversarial Denoising for Monte Carlo Renderings with Residual Attention Networks and Hierarchical Features Modulation of Auxiliary Buffers.
Proceedings of the SA '20: SIGGRAPH Asia 2020 Technical Communications, 2020

Graph-based variational auto-encoder for generalized zero-shot learning.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Self-supervised adversarial learning for cross-modal retrieval.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

EvoGAN: an evolutionary GAN for face aging and rejuvenation.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Temporal Denoising Mask Synthesis Network for Learning Blind Video Temporal Consistency.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning Optimization-based Adversarial Perturbations for Attacking Sequential Recognition Models.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

KTN: Knowledge Transfer Network for Multi-person DensePose Estimation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

One-shot Scene Graph Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Lab2Pix: Label-Adaptive Generative Adversarial Network for Unsupervised Image Synthesis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Bottom-up and Top-down: Bidirectional Additive Net for Edge Detection.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

CC-LSTM: Cross and Conditional Long-Short Time Memory for Video Captioning.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Ocean: A Dual Learning Approach For Generalized Zero-Shot Sketch-Based Image Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Patch-Wise Attack for Fooling Deep Neural Network.
Proceedings of the Computer Vision - ECCV 2020, 2020

What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Universal Weighting Metric Learning for Cross-Modal Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Searching for Actions on the Hyperbole.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Embedding and predicting the event at early stage.
World Wide Web, 2019

From Deterministic to Generative: Multimodal Stochastic RNNs for Video Captioning.
IEEE Trans. Neural Networks Learn. Syst., 2019

Heterogeneous Domain Adaptation Through Progressive Alignment.
IEEE Trans. Neural Networks Learn. Syst., 2019

Hierarchical Multi-Clue Modelling for POI Popularity Prediction with Heterogeneous Tourist Information.
IEEE Trans. Knowl. Data Eng., 2019

More is Better: Precise and Detailed Image Captioning Using Online Positive Recall and Missing Concepts Mining.
IEEE Trans. Image Process., 2019

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.
IEEE Trans. Image Process., 2019

Locality Preserving Joint Transfer for Domain Adaptation.
IEEE Trans. Image Process., 2019

Collective Reconstructive Embeddings for Cross-Modal Hashing.
IEEE Trans. Image Process., 2019

Transfer Independently Together: A Generalized Framework for Domain Adaptation.
IEEE Trans. Cybern., 2019

Describing Video With Attention-Based Bidirectional LSTM.
IEEE Trans. Cybern., 2019

Towards Accurate Georeferenced Video Search With Camera Field of View Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2019

Fusion by synthesizing: A multi-view deep neural network for zero-shot recognition.
Signal Process., 2019

Order-aware convolutional pooling for video based action recognition.
Pattern Recognit., 2019

Binary Multi-View Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Web-based SBLR method of multimedia tools for computer-aided drawing.
Multim. Tools Appl., 2019

Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network.
Neurocomputing, 2019

Special Issue of APWeb-WAIM 2019.
Data Sci. Eng., 2019

Cooperative Cross-Stream Network for Discriminative Action Representation.
CoRR, 2019

MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with Meta-Learning.
CoRR, 2019

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition.
CoRR, 2019

Statistical Karyotype Analysis Using CNN and Geometric Optimization.
IEEE Access, 2019

Residual Graph Convolutional Networks for Zero-Shot Learning.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Generative Reconstructive Hashing for Incomplete Video Analysis.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Time-aware Session Embedding for Click-Through-Rate Prediction.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learnable Aggregating Net with Diversity Learning for Video Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention Transfer (ANT) Network for View-invariant Action Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Adaptive Multi-Path Aggregation for Human DensePose Estimation in the Wild.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Recurrent Quantization for Generating Sequential Binary Codes.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exact Adversarial Attack to Image Captioning via Structured Output Learning With Latent Variables.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Template-Based Math Word Problem Solvers with Recursive Neural Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Structured Two-Stream Attention Network for Video Question Answering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Deliberate Attention Networks for Image Captioning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Perceptual Pyramid Adversarial Networks for Text-to-Image Synthesis.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

MR-NET: Exploiting Mutual Relation for Visual Relationship Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Video Sequence Indexing.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Principal Component Analysis.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Near-Duplicate Retrieval.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Multidimensional Scaling.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Dimensionality Reduction.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Personalized semantic trajectory privacy preservation through trajectory reconstruction.
World Wide Web, 2018

Augmented keyword search on spatial entity databases.
VLDB J., 2018

Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval.
IEEE Trans. Neural Networks Learn. Syst., 2018

Video Captioning by Adversarial LSTM.
IEEE Trans. Image Process., 2018

Hashing with Angular Reconstructive Embeddings.
IEEE Trans. Image Process., 2018

Recognition and Detection of Two-Person Interactive Actions Using Automatically Selected Skeleton Features.
IEEE Trans. Hum. Mach. Syst., 2018

View-Consistent MeshFlow for Stereoscopic Video Stabilization.
IEEE Trans. Computational Imaging, 2018

One-shot learning based pattern transition map for action early recognition.
Signal Process., 2018

Trajectory Simplification: An Experimental Study and Quality Analysis.
Proc. VLDB Endow., 2018

Robust discrete code modeling for supervised hashing.
Pattern Recognit., 2018

A Survey on Learning to Hash.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Stroke-based stylization by learning sequential drawing examples.
J. Vis. Commun. Image Represent., 2018

Deep appearance and motion learning for egocentric activity recognition.
Neurocomputing, 2018

Learning binary codes with local and inner data structure.
Neurocomputing, 2018

The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers.
CoRR, 2018

GraphCAR: Content-aware Multimedia Recommendation with Graph Autoencoder.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

DT-Zheng: digital twin method for Zheng musical instrument.
Proceedings of the SIGGRAPH Asia 2018 Posters, Tokyo, Japan, December 04-07, 2018, 2018

Cumulative Nets for Edge Detection.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Pseudo Transfer with Marginalized Corrupted Attribute for Zero-shot Learning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Large-scale RGB-D Database for Arbitrary-view Human Action Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Visual Spatial Attention Network for Relationship Detection.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Examine before You Answer: Multi-task Learning with Adaptive-attentions for Multiple-choice VQA.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Feature Reconstruction by Laplacian Eigenmaps for Efficient Instance Search.
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

From Pixels to Objects: Cubic Visual Attention for Visual Question Answering.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Dual Conditional GANs for Face Aging and Rejuvenation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Coarse-to-fine Image Co-segmentation with Intra and Inter Rank Constraints.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Person Re-identification Using Two-Stage Convolutional Neural Network.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Unpaired Image-to-Image Translation from Shared Deep Space.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A Graph-Theoretic Fusion Framework for Unsupervised Entity Resolution.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Continuous Proximity Detection via Predictive Safe Region Construction.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Generative Domain-Migration Hashing for Sketch-to-Image Retrieval.
Proceedings of the Computer Vision - ECCV 2018, 2018

Highly-Economized Multi-view Binary Compression for Scalable Image Clustering.
Proceedings of the Computer Vision - ECCV 2018, 2018

TBN: Convolutional Neural Network with Ternary Inputs and Binary Weights.
Proceedings of the Computer Vision - ECCV 2018, 2018

MathDQN: Solving Arithmetic Word Problems via Deep Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Deep Region Hashing for Generic Instance Search from Images.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Binary Generative Adversarial Networks for Image Retrieval.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Distributed shortest path query processing on dynamic road networks.
VLDB J., 2017

Compact Indexing and Judicious Searching for Billion-Scale Microblog Retrieval.
ACM Trans. Inf. Syst., 2017

Processing Long Queries Against Short Text: Top-<i>k</i> Advertisement Matching in News Stream Applications.
ACM Trans. Inf. Syst., 2017

Targeted Advertising in Public Transportation Systems with Quantitative Evaluation.
ACM Trans. Inf. Syst., 2017

Asymmetric Binary Coding for Image Search.
IEEE Trans. Multim., 2017

Video Captioning With Attention-Based LSTM and Semantic Consistency.
IEEE Trans. Multim., 2017

Discrete Nonnegative Spectral Clustering.
IEEE Trans. Knowl. Data Eng., 2017

IF-Matching: Towards Accurate Map-Matching with Information Fusion.
IEEE Trans. Knowl. Data Eng., 2017

Bilinear Optimized Product Quantization for Scalable Visual Content Analysis.
IEEE Trans. Image Process., 2017

Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval.
IEEE Trans. Image Process., 2017

Hierarchical Latent Concept Discovery for Video Event Detection.
IEEE Trans. Image Process., 2017

Robust Web Image Annotation via Exploring Multi-Facet and Structural Knowledge.
IEEE Trans. Image Process., 2017

Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.
IEEE Trans. Image Process., 2017

Semi-Paired Discrete Hashing: Learning Latent Hash Codes for Semi-Paired Cross-View Retrieval.
IEEE Trans. Cybern., 2017

Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2017

Beyond Frame-level CNN: Saliency-Aware 3-D CNN With LSTM for Video Action Recognition.
IEEE Signal Process. Lett., 2017

Compositional Model Based Fisher Vector Coding for Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Editorial: Good practices in multimedia modeling.
Neurocomputing, 2017

Structured Learning of Binary Codes with Column Generation for Optimizing Ranking Measures.
Int. J. Comput. Vis., 2017

Towards Automatic Construction of Diverse, High-quality Image Dataset.
CoRR, 2017

From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning.
CoRR, 2017

Deep Region Hashing for Efficient Large-scale Instance Search from Images.
CoRR, 2017

Classification by Retrieval: Binarizing Data and Classifiers.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Event Early Embedding: Predicting Event Volume Dynamics at Early Stage.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Unifying Multi-Source Social Media Data for Personalized Travel Route Planning.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

POI Popularity Prediction via Hierarchical Fusion of Multiple Social Clues.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Cosmetic-vis: sample-based 3D facial editor for cosmetic medical visualization.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2017

Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A System for Spatiotemporal Anomaly Localization in Surveillance Videos.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adversarial Cross-Modal Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep Asymmetric Pairwise Hashing.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Two Birds One Stone: On both Cold-Start and Long-Tail Recommendation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Local Deep Descriptors in Bag-of-Words for Image Retrieval.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Adaptively Attending to Visual Attributes and Linguistic Knowledge for Captioning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

CFM@MediaEval 2017 Retrieving Diverse Social Images Task via Re-ranking and Hierarchical Clustering.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

BMC@MediaEval 2017 Multimedia Satellite Task via Regression Random Forest.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Attribute hashing for zero-shot image retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Asymmetric sparse hashing.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Unsupervised cross-modal retrieval through adversarial learning.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Preserving-Ignoring Transformation Based Index for Approximate k Nearest Neighbor Search.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

IF-Matching: Towards Accurate Map-Matching with Information Fusion.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Leveraging Weak Semantic Relevance for Complex Video Event Classification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

WebPainter: Collaborative Stroke-Based Rendering Through HTML5 and WebGL.
Proceedings of the E-Learning and Games - 11th International Conference, 2017

Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Multi-attention Network for One Shot Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Movie Fill in the Blank with Adaptive Temporal Attention and Description Update.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Deep Semantic Indexing Using Convolutional Localization Network with Region-Based Visual Attention for Image Database.
Proceedings of the Databases Theory and Applications, 2017

A Deep Approach for Multi-modal User Attribute Modeling.
Proceedings of the Databases Theory and Applications, 2017

Efficient Supervised Hashing via Exploring Local and Inner Data Structure.
Proceedings of the Databases Theory and Applications, 2017

Jointly Learning Attentions with Semantic Cross-Modal Correlation for Visual Question Answering.
Proceedings of the Databases Theory and Applications, 2017

An Integrated Model for Effective Saliency Prediction.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Web-Based Semantic Fragment Discovery for On-Line Lingual-Visual Similarity.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Event Video Mashup: From Hundreds of Videos to Minutes of Skeleton.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Scalable Video Event Retrieval by Visual State Binary Embedding.
IEEE Trans. Multim., 2016

A Distance-Computation-Free Search Scheme for Binary Code Databases.
IEEE Trans. Multim., 2016

Web Video Event Recognition by Semantic Analysis From Ubiquitous Documents.
IEEE Trans. Image Process., 2016

Dual Diversified Dynamical Gaussian Process Latent Variable Model for Video Repairing.
IEEE Trans. Image Process., 2016

Optimized Graph Learning Using Partial Tags and Multiple Features for Image and Video Annotation.
IEEE Trans. Image Process., 2016

A Fast Optimization Method for General Binary Code Learning.
IEEE Trans. Image Process., 2016

Robust Cross-view Hashing for Multimedia Retrieval.
IEEE Signal Process. Lett., 2016

Face image classification by pooling raw features.
Pattern Recognit., 2016

Robust regression based face recognition with fast outlier removal.
Multim. Tools Appl., 2016

Binary Subspace Coding for Query-by-Image Video Retrieval.
CoRR, 2016

Hi Detector, What's Wrong with that Object? Identifying Irregular Object From Images by Modelling the Detection Score Distribution.
CoRR, 2016

Learning Binary Codes and Binary Weights for Efficient Classification.
CoRR, 2016

Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering.
CoRR, 2016

Structured Learning of Binary Codes with Column Generation.
CoRR, 2016

Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps.
CoRR, 2016

Bidirectional Long-Short Term Memory for Video Description.
CoRR, 2016

Cross-modal Retrieval with Label Completion.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Attention-based LSTM with Semantic Consistency for Videos Captioning.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Quartet-net Learning for Visual Instance Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Bidirectional Long-Short Term Memory for Video Description.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Zero-Shot Hashing via Transferring Supervised Knowledge.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Discriminant Cross-modal Hashing.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

A Unified Framework for Discrete Spectral Clustering.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

What's Wrong with That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Graph-without-cut: An Ideal Graph Learning for Image Segmentation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Tag Features for Geo-Aware Image Classification.
IEEE Trans. Multim., 2015

Optimized Cartesian K-Means.
IEEE Trans. Knowl. Data Eng., 2015

Hashing on Nonlinear Manifolds.
IEEE Trans. Image Process., 2015

Multitask Spectral Clustering by Exploring Intertask Correlation.
IEEE Trans. Cybern., 2015

Robust Discrete Spectral Hashing for Large-Scale Image Semantic Indexing.
IEEE Trans. Big Data, 2015

Max-margin adaptive model for complex video pattern recognition.
Multim. Tools Appl., 2015

Temporal Pyramid Pooling Based Convolutional Neural Networks for Action Recognition.
CoRR, 2015

Geographical Constraint and Temporal Similarity Modeling for Point-of-Interest Recommendation.
Proceedings of the Web Information Systems Engineering - WISE 2015, 2015

UQMG @ TRECVID 2015: Instance Search.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Zero-shot Image Categorization by Image Correlation Exploration.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Learning Binary Codes for Maximum Inner Product Search.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Supervised Discrete Hashing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Optimal graph learning with partial tags and multiple features for image and video annotation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Guest Editorial Special Section on Socio-Mobile Media Analysis and Retrieval.
IEEE Trans. Multim., 2014

Effectively Indexing the Multidimensional Uncertain Objects.
IEEE Trans. Knowl. Data Eng., 2014

On the Influence Propagation of Web Videos.
IEEE Trans. Knowl. Data Eng., 2014

SK-LSH: An Efficient Index Structure for Approximate Nearest Neighbor Search.
Proc. VLDB Endow., 2014

Hashing for Similarity Search: A Survey.
CoRR, 2014

Face Identification with Second-Order Pooling.
CoRR, 2014

Face Image Classification by Pooling Raw Features.
CoRR, 2014

WeMash: An Online System for Web Video Mashup.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Optimized Distances for Binary Code Ranking.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

UQ-DKE's Participation at MediaEval 2014 Placing Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

2013
Effective transfer tagging from image to video.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Sparse hashing for fast multimedia search.
ACM Trans. Inf. Syst., 2013

Video-to-Shot Tag Propagation by Graph Sparse Group Lasso.
IEEE Trans. Multim., 2013

Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval.
IEEE Trans. Multim., 2013

Discriminative Nonnegative Spectral Clustering with Out-of-Sample Extension.
IEEE Trans. Knowl. Data Eng., 2013

VChunkJoin: An Efficient Algorithm for Edit Similarity Joins.
IEEE Trans. Knowl. Data Eng., 2013

Self-taught dimensionality reduction on the high-dimensional small-sized data.
Pattern Recognit., 2013

Local image tagging via graph regularized joint group sparsity.
Pattern Recognit., 2013

Personalized query evaluation in ring-based P2P networks.
Inf. Sci., 2013

Near-duplicate video retrieval: Current research and future trends.
ACM Comput. Surv., 2013

Inter-media hashing for large-scale retrieval from heterogeneous data sources.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Robust Semantic Video Indexing by Harvesting Web Images.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Linear cross-modal hashing for efficient multimedia search.
Proceedings of the ACM Multimedia Conference, 2013

Presenting diverse location views with real-time near-duplicate photo elimination.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

2012
Automatic tagging by exploring tag information capability and correlation.
World Wide Web, 2012

Quick identification of near-duplicate video sequences with cut signature.
World Wide Web, 2012

Guest editorial: content, concept and context mining in social media.
World Wide Web, 2012

Web and Personal Image Annotation by Mining Label Correlation With Relaxed Visual Graph Embedding.
IEEE Trans. Image Process., 2012

Dimensionality reduction by Mixed Kernel Canonical Correlation Analysis.
Pattern Recognit., 2012

Extracting representative motion flows for effective video retrieval.
Multim. Tools Appl., 2012

Introducing Cloud Computing Topics in Curricula.
J. Inf. Syst. Educ., 2012

Discovering areas of interest with geo-tagged images and check-ins.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Effective Data Density Estimation in Ring-Based P2P Networks.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

2011
Mining multi-tag association for image tagging.
World Wide Web, 2011

Correlation-based retrieval for heavily changed near-duplicate videos.
ACM Trans. Inf. Syst., 2011

Exploring Distributional Discrepancy for Multidimensional Point Set Retrieval.
IEEE Trans. Multim., 2011

UQMSG Experiments for TRECVID 2011.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Effective data co-reduction for multimedia similarity search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Video-to-shot tag allocation by weighted sparse group lasso.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Transfer tagging from image to video.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Multiple feature hashing for real-time large scale near-duplicate video retrieval.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

l<sub>2, 1</sub>-Norm Regularized Discriminative Feature Selection for Unsupervised Learning.
Proceedings of the IJCAI 2011, 2011

Discovering popular routes from trajectories.
Proceedings of the 27th International Conference on Data Engineering, 2011

Probabilistic Image Tagging with Tags Expanded By Text-Based Search.
Proceedings of the Database Systems for Advanced Applications, 2011

Efficient Histogram-Based Similarity Search in Ultra-High Dimensional Space.
Proceedings of the Database Systems for Advanced Applications, 2011

Tag localization with spatial correlations and joint group sparsity.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Tagging Image with Informative and Correlative Tags.
Proceedings of the Web Technologies and Applications - 13th Asia-Pacific Web Conference, 2011

Nonnegative Spectral Clustering with Discriminative Regularization.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Mining near-duplicate graph for cluster-based reranking of web video search results.
ACM Trans. Inf. Syst., 2010

Practical Online Near-Duplicate Subsequence Detection for Continuous Video Streams.
IEEE Trans. Multim., 2010

Searching trajectories by locations: an efficiency study.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Distributed Cache Indexing for Efficient Subspace Skyline Computation in P2P Networks.
Proceedings of the Database Systems for Advanced Applications, 2010

Efficient and Continuous Near-duplicate Video Detection.
Proceedings of the Advances in Web Technologies and Applications, 2010

2009
Video Sequence Indexing.
Proceedings of the Encyclopedia of Database Systems, 2009

Principal Component Analysis.
Proceedings of the Encyclopedia of Database Systems, 2009

Multidimensional Scaling.
Proceedings of the Encyclopedia of Database Systems, 2009

Dimensionality Reduction.
Proceedings of the Encyclopedia of Database Systems, 2009

Speed up interactive image retrieval.
VLDB J., 2009

Instance optimal query processing in spatial networks.
VLDB J., 2009

Bounded coordinate system indexing for real-time video clip search.
ACM Trans. Inf. Syst., 2009

Effective and Efficient Query Processing for Video Subsequence Identification.
IEEE Trans. Knowl. Data Eng., 2009

Hybrid information retrieval policies based on cooperative cache in mobile P2P networks.
Frontiers Comput. Sci. China, 2009

Monitoring path nearest neighbor in road networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

A Novel Content Distribution Mechanism in DHT Networks.
Proceedings of the NETWORKING 2009, 2009

Interactive near-duplicate video retrieval and detection.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

High-dimensional indexing with oriented cluster representation for multimedia databases.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Online Near-Duplicate Video Clip Detection and Retrieval: An Accurate and Fast System.
Proceedings of the 25th International Conference on Data Engineering, 2009

Processing Group Nearest Group Query.
Proceedings of the 25th International Conference on Data Engineering, 2009

Instant Advertising in Mobile Peer-to-Peer Networks.
Proceedings of the 25th International Conference on Data Engineering, 2009

Hybrid Retrieval Mechanisms in Vehicle-Based P2P Networks.
Proceedings of the Computational Science, 2009

Dimension-Specific Search for Multimedia Retrieval.
Proceedings of the Database Systems for Advanced Applications, 2009

Video Annotation System Based on Categorizing and Keyword Labelling.
Proceedings of the Database Systems for Advanced Applications, 2009

Efficient information retrieval in mobile peer-to-peer networks.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Large-scale Video Sequence Indexing: Impacts, Ideas and Trends.
Proceedings of the Database Technologies 2009, 2009

2008
A multi-resolution surface distance model for <i>k</i>-NN query processing.
VLDB J., 2008

Batch Nearest Neighbor Search for Video Retrieval.
IEEE Trans. Multim., 2008

Localized Co-Occurrence Model for Fast Approximate Search in 3D Structure Databases.
IEEE Trans. Knowl. Data Eng., 2008

Challenges and techniques for effective and efficient similarity search in large video databases.
Proc. VLDB Endow., 2008

Discovery of convoys in trajectory databases.
Proc. VLDB Endow., 2008

Achieving Effective Multi-term Queries for Fast DHT Information Retrieval.
Proceedings of the Web Information Systems Engineering, 2008

Distribution-based similarity measures for multi-dimensional point set retrieval applications.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Locality condensation: a new dimensionality reduction method for image retrieval.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Convoy Queries in Spatio-Temporal Databases.
Proceedings of the 24th International Conference on Data Engineering, 2008

A Hybrid Prediction Model for Moving Objects.
Proceedings of the 24th International Conference on Data Engineering, 2008

2007
An adaptive and dynamic dimensionality reduction method for high-dimensional indexing.
VLDB J., 2007

Capture local information in shape representation.
Multim. Tools Appl., 2007

A New Similarity Measure for Near Duplicate Video Clip Detection.
Proceedings of the Advances in Data and Web Management, 2007

UQLIPS: A Real-time Near-duplicate Video Clip Detection System.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

The University of Queensland at TRECVID 2007 Search Task.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Dimensionality reduction for dimension-specific search.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Statistical summarization of content features for fast near-duplicate video detection.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Dynamic Batch Nearest Neighbor Search in Video Retrieval.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Multi-source Skyline Query Processing in Road Networks.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Mining Trajectory Patterns Using Hidden Markov Models.
Proceedings of the Data Warehousing and Knowledge Discovery, 9th International Conference, 2007

Dual Dimensionality Reduction for Efficient Video Similarity Search.
Proceedings of the Data Warehousing and Knowledge Discovery, 9th International Conference, 2007

Efficient Similarity Search by Summarization in Large Video Database.
Proceedings of the Database Technologies 2007. Proceedings of the Eighteenth Australasian Database Conference, 2007

Selectivity Estimation by Batch-Query based Histogram and Parametric Method.
Proceedings of the Database Technologies 2007. Proceedings of the Eighteenth Australasian Database Conference, 2007

2006
Indexing and Integrating Multiple Features for WWW Images.
World Wide Web, 2006

A Multiresolution Terrain Model for Efficient Visualization Query Processing.
IEEE Trans. Knowl. Data Eng., 2006

Hierarchical Indexing Structure for Efficient Similarity Search in Video Retrieval.
IEEE Trans. Knowl. Data Eng., 2006

Toward Efficient Multifeature Query Processing.
IEEE Trans. Knowl. Data Eng., 2006

ICICLE: A semantic-based retrieval system for WWW images.
Multim. Syst., 2006

Exploring composite acoustic features for efficient music similarity query.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

SaveRF: Towards Efficient Relevance Feedback Search.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Surface k-NN Query Processing.
Proceedings of the 22nd International Conference on Data Engineering, 2006

3D Protein Structure Matching by Patch Signatures.
Proceedings of the Database and Expert Systems Applications, 17th International Conference, 2006

2005
Semantic Caching for Multiresolution Spatial Query Processing in Mobile Environments.
Proceedings of the Advances in Spatial and Temporal Databases, 9th International Symposium, 2005

Towards Effective Indexing for Very Large Video Sequence Database.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Indexing Text and Visual Features for WWW Images.
Proceedings of the Web Technologies Research and Development - APWeb 2005, 7th Asia-Pacific Web Conference, Shanghai, China, March 29, 2005

Exploring Bit-Difference for Approximate KNN Search in High-dimensional Databases.
Proceedings of the Database Technologies 2005, 2005

2004
Efficient Semantic-Based Content Search in P2P Network.
IEEE Trans. Knowl. Data Eng., 2004

LDC: Enabling Search By Partial Distance In A Hyper-Dimensional Space.
Proceedings of the 20th International Conference on Data Engineering, 2004

Adaptive Quantization of the High-Dimensional Data for Efficient KNN Processing.
Proceedings of the Database Systems for Advances Applications, 2004

Diagonal Ordering: A New Approach to High-Dimensional KNN Processing.
Proceedings of the Database Technologies 2004, 2004

2003
An Adaptive and Efficient Dimensionality Reduction Algorithm for High-Dimensional Indexing.
Proceedings of the 19th International Conference on Data Engineering, 2003

2001
Finding Similar Images Quickly Using Object Shapes.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Finding semantically related images in the WWW.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Giving meanings to WWW images.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000


  Loading...