Qi Tian

Affiliations:
  • Huawei Noah's Ark Lab
  • University of Texas at San Antonio, Department of Computer Science, TX, USA (2002-2019)
  • Microsoft Research Asia, Beijing, China (2008 - 2009)
  • University of Illinois at Urbana-Champaign, Champaign, IL, USA (PhD 2002)
  • Drexel University, Philadelphia, PA, USA (until 1996)
  • Tsinghua University, Department of Electronic Engineering, Beijing, China (until 1992)


According to our database1, Qi Tian authored at least 911 papers between 1994 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
One-Bit Supervision for Image Classification: Problem, Solution, and Beyond.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

MgSvF: Multi-Grained Slow versus Fast Framework for Few-Shot Class-Incremental Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Consensus Synergizes With Memory: A Simple Approach for Anomaly Segmentation in Urban Scenes.
IEEE Trans. Circuits Syst. Video Technol., February, 2024

Room-Object Entity Prompting and Reasoning for Embodied Referring Expression.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Explanatory Object Part Aggregation for Zero-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Efficient Supervised Graph Embedding Hashing for large-scale cross-media retrieval.
Pattern Recognit., January, 2024

Towards Codebook-Free Deep Probabilistic Quantization for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Accurate Fine-Grained Object Recognition with Structure-Driven Relation Graph Networks.
Int. J. Comput. Vis., January, 2024

Structure Aware Multi-Graph Network for Multi-Modal Emotion Recognition in Conversations.
IEEE Trans. Multim., 2024

Multi-Granularity Matching Transformer for Text-Based Person Search.
IEEE Trans. Multim., 2024

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting.
CoRR, 2024

Towards 3D Molecule-Text Interpretation in Language Models.
CoRR, 2024

ChatterBox: Multi-round Multimodal Referring and Grounding.
CoRR, 2024

Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering.
CoRR, 2024

Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models.
CoRR, 2024

Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

2023
Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition.
IEEE Trans. Circuits Syst. Video Technol., November, 2023

CIPS-3D++: End-to-End Real-Time High-Resolution 3D-Aware GANs for GAN Inversion and Stylization.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

GAIA-Universe: Everything is Super-Netify.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Learnable Distribution Calibration for Few-Shot Class-Incremental Learning.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Self-Supervised Tumor Segmentation With Sim2Real Adaptation.
IEEE J. Biomed. Health Informatics, September, 2023

PSLT: A Light-Weight Vision Transformer With Ladder Self-Attention and Progressive Shift.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Regularized Differentiable Architecture Search.
IEEE Embed. Syst. Lett., September, 2023

A Survey on Label-Efficient Deep Image Segmentation: Bridging the Gap Between Weak Supervision and Dense Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Conformer: Local Features Coupling Global Representations for Recognition and Detection.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

General Greedy De-Bias Learning.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Feature Distillation in Deep Attention Network Against Adversarial Examples.
IEEE Trans. Neural Networks Learn. Syst., July, 2023

Exploring the diversity and invariance in yourself for visual pre-training task.
Pattern Recognit., July, 2023

HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Self-Regulated Learning for Egocentric Video Activity Anticipation.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation.
Int. J. Comput. Vis., June, 2023

HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Seed the Views: Hierarchical Semantic Alignment for Contrastive Representation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Entity-Enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Space-Time Cascaded Processing-Based Adaptive Transient Interference Mitigation for Compact HFSWR.
Remote. Sens., February, 2023

User Behavior Simulation for Search Result Re-ranking.
ACM Trans. Inf. Syst., January, 2023

A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension.
IEEE Trans. Neural Networks Learn. Syst., 2023

Semi-Supervised Contrastive Learning With Similarity Co-Calibration.
IEEE Trans. Multim., 2023

Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2023

Deep Graph Convolutional Quantization Networks for Image Retrieval.
IEEE Trans. Multim., 2023

Discrete Robust Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval.
IEEE Trans. Knowl. Data Eng., 2023

Efficient Fine-Grained Object Recognition in High-Resolution Remote Sensing Images From Knowledge Distillation to Filter Grafting.
IEEE Trans. Geosci. Remote. Sens., 2023

M²NAS: Joint Neural Architecture Optimization System With Network Transmission.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2023

Query-Efficient Black-Box Adversarial Attack With Customized Iteration and Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Accurate medium-range global weather forecasting with 3D neural networks.
Nat., 2023

VoxSeP: semi-positive voxels assist self-supervised 3D medical segmentation.
Multim. Syst., 2023

Preface to the Special Issue on Multimodal Learning Integrated with Pre-training Techniques.
Int. J. Softw. Informatics, 2023

Correction to: Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation.
Int. J. Comput. Vis., 2023

Preliminary Study on Incremental Learning for Large Language Model-based Recommender Systems.
CoRR, 2023

When Parameter-efficient Tuning Meets General-purpose Vision-language Models.
CoRR, 2023

Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views.
CoRR, 2023

Boosting Segment Anything Model Towards Open-Vocabulary Learning.
CoRR, 2023

Segment Any 3D Gaussians.
CoRR, 2023

Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model.
CoRR, 2023

GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions.
CoRR, 2023

AiluRus: A Scalable ViT Framework for Dense Prediction.
CoRR, 2023

GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors.
CoRR, 2023

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering.
CoRR, 2023

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models.
CoRR, 2023

Computation-efficient Deep Learning for Computer Vision: A Survey.
CoRR, 2023

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems.
CoRR, 2023

Human Motion Generation: A Survey.
CoRR, 2023

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners.
CoRR, 2023

Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models.
CoRR, 2023

Joint Channel Estimation and Feedback with Masked Token Transformers in Massive MIMO Systems.
CoRR, 2023

Exploring Effective Mask Sampling Modeling for Neural Image Compression.
CoRR, 2023

ControlVideo: Training-free Controllable Text-to-Video Generation.
CoRR, 2023

Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided Relation Alignment and Adaptation.
CoRR, 2023

Mode Approximation Makes Good Multimodal Prompts.
CoRR, 2023

Visual Tuning.
CoRR, 2023

Segment Anything in 3D with NeRFs.
CoRR, 2023

Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism.
CoRR, 2023

SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval.
CoRR, 2023

Learning Transferable Pedestrian Representation from Multimodal Information Supervision.
CoRR, 2023

Multi-modal Prompting for Low-Shot Temporal Action Localization.
CoRR, 2023

LION: Implicit Vision Prompt Tuning.
CoRR, 2023

R-Tuning: Regularized Prompt Tuning in Open-Set Scenarios.
CoRR, 2023

Rethinking Visual Prompt Learning as Masked Visual Token Modeling.
CoRR, 2023

Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding.
CoRR, 2023

Constraint and Union for Partially-Supervised Temporal Sentence Grounding.
CoRR, 2023

Session Search with Pre-trained Graph Classification Model.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Segment Anything in 3D with NeRFs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Parameter-efficient Tuning of Large-scale Multimodal Foundation Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CFGL-LCR: A Counterfactual Graph Learning Framework for Legal Case Retrieval.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation.
Proceedings of the International Conference on Machine Learning, 2023

Continual Vision-Language Representation Learning with Off-Diagonal Information.
Proceedings of the International Conference on Machine Learning, 2023

Progressively Compressed Auto-Encoder for Self-supervised Representation Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The KFIoU Loss for Rotated Object Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

HiViT: A Simpler and More Efficient Design of Hierarchical Vision Transformer.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

USAGE: A Unified Seed Area Generation Paradigm for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Focus on Your Target: A Dual Teacher-Student Framework for Domain-adaptive Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Federated Domain Generalization with Generalization Adjustment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adapting Shortcut with Normalizing Flow: An Efficient Tuning Framework for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Integrally Pre-Trained Transformer Pyramid Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Recognition by Request.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Being Comes from Not-Being: Open-Vocabulary Text-to-Motion Generation with Wordless Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Distilling Vision-Language Pre-Training to Collaborate with Weakly-Supervised Temporal Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Fine-Grained Retrieval Prompt Tuning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

DE-net: Dynamic Text-Guided Image Editing Adversarial Networks.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Low-Light Video Enhancement with Synthetic Event Guidance.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A Novel Multi-Sample Generation Method for Adversarial Attacks.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Position-Aware Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022

Filter Sketch for Network Pruning.
IEEE Trans. Neural Networks Learn. Syst., 2022

Hierarchical Semantic Graph Reasoning for Train Component Detection.
IEEE Trans. Neural Networks Learn. Syst., 2022

Deep Enhanced Weakly-Supervised Hashing With Iterative Tag Refinement.
IEEE Trans. Multim., 2022

Learning Representation on Optimized High-Order Manifold for Visual Classification.
IEEE Trans. Multim., 2022

Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations.
IEEE Trans. Multim., 2022

Deep Shape-Aware Person Re-Identification for Overcoming Moderate Clothing Changes.
IEEE Trans. Multim., 2022

Feature Calibration Network for Occluded Pedestrian Detection.
IEEE Trans. Intell. Transp. Syst., 2022

Curiosity-Driven Salient Object Detection With Fragment Attention.
IEEE Trans. Image Process., 2022

Visible-Infrared Person Re-Identification With Modality-Specific Memory Network.
IEEE Trans. Image Process., 2022

Loss Re-Scaling VQA: Revisiting the Language Prior Problem From a Class-Imbalance View.
IEEE Trans. Image Process., 2022

Big-Hypergraph Factorization Neural Network for Survival Prediction From Whole Slide Image.
IEEE Trans. Image Process., 2022

Disentangling Task-Oriented Representations for Unsupervised Domain Adaptation.
IEEE Trans. Image Process., 2022

SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem.
IEEE Trans. Image Process., 2022

Camera-Based Batch Normalization: An Effective Distribution Alignment Method for Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2022

Large-Scale Spatio-Temporal Person Re-Identification: Algorithms and Benchmark.
IEEE Trans. Circuits Syst. Video Technol., 2022

Adaptive Spatial Location With Balanced Loss for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

DEF-Net: A Face Aging Model by Using Different Emotional Learnings.
IEEE Trans. Circuits Syst. Video Technol., 2022

Spontaneous Speech Emotion Recognition Using Multiscale Deep Convolutional LSTM.
IEEE Trans. Affect. Comput., 2022

Searching Towards Class-Aware Generators for Conditional Generative Adversarial Networks.
IEEE Signal Process. Lett., 2022

Actionness-Guided Transformer for Anchor-Free Temporal Action Localization.
IEEE Signal Process. Lett., 2022

Meta-Learning Paradigm and CosAttn for Streamer Action Recognition in Live Video.
IEEE Signal Process. Lett., 2022

Progressive privileged knowledge distillation for online action detection.
Pattern Recognit., 2022

Exploring rich intermediate representations for reconstructing 3D shapes from 2D images.
Pattern Recognit., 2022

Effective full-scale detection for salient object based on condensing-and-filtering network.
Pattern Recognit., 2022

Fine-Grained Video Captioning via Graph-based Multi-Granularity Interaction Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Pose-Guided Representation Learning for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

TapLab: A Fast Framework for Semantic Video Segmentation Tapping Into Compressed-Domain Knowledge.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Scalable NAS with factorizable architectural parameters.
Neurocomputing, 2022

Special issue on cross-modal retrieval and analysis.
Int. J. Multim. Inf. Retr., 2022

GhostNets on Heterogeneous Devices via Cheap Operations.
Int. J. Comput. Vis., 2022

Network Adjustment: Channel and Block Search Guided by Resource Utilization Ratio.
Int. J. Comput. Vis., 2022

Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap.
ACM Comput. Surv., 2022

Prototype-guided Cross-task Knowledge Distillation for Large-scale Models.
CoRR, 2022

Integrally Pre-Trained Transformer Pyramid Networks.
CoRR, 2022

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast.
CoRR, 2022

OhMG: Zero-shot Open-vocabulary Human Motion Generation.
CoRR, 2022

See Blue Sky: Deep Image Dehaze Using Paired and Unpaired Training Images.
CoRR, 2022

Towards a Unified View on Visual Parameter-Efficient Transfer Learning.
CoRR, 2022

Motion-inductive Self-supervised Object Discovery in Videos.
CoRR, 2022

T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up.
CoRR, 2022

Prompt-Matched Semantic Segmentation.
CoRR, 2022

Pro-tuning: Unified Prompt Tuning for Vision Tasks.
CoRR, 2022

A Survey on Label-efficient Deep Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction.
CoRR, 2022

Masked Autoencoders are Robust Data Augmentors.
CoRR, 2022

HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling.
CoRR, 2022

HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval.
CoRR, 2022

CenterNet++ for Object Detection.
CoRR, 2022

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers.
CoRR, 2022

Deep Class Incremental Learning from Decentralized Data.
CoRR, 2022

Global or Local: Constructing Personalized Click Models for Web Search.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Fast Dynamic Radiance Fields with Time-Aware Neural Voxels.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Fine-Grained Semantically Aligned Vision-Language Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ConfounderGAN: Protecting Image Data Privacy with Causal Confounder.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Finding the Host from the Lesion by Iteratively Mining the Registration Graph.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Search-oriented Micro-video Captioning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Boundary-Enhanced Self-supervised Learning for Brain Structure Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Unsupervised Ensemble Distillation for Multi-Organ Segmentation.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Bag of Instances Aggregation Boosts Self-supervised Distillation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

FedSkip: Combatting Statistical Heterogeneity with Federated Skip Aggregation.
Proceedings of the IEEE International Conference on Data Mining, 2022

MVP: Multimodality-Guided Visual Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022

Cornerformer: Purifying Instances for Corner-Based Detectors.
Proceedings of the Computer Vision - ECCV 2022, 2022

Active Pointly-Supervised Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining.
Proceedings of the Computer Vision - ECCV 2022, 2022

TAPE: Task-Agnostic Prior Embedding for Image Restoration.
Proceedings of the Computer Vision - ECCV 2022, 2022

Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022

Domain-Conditioned Normalization for Test-Time Domain Generalization.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Vibration-Based Uncertainty Estimation for Learning from Limited Supervision.
Proceedings of the Computer Vision - ECCV 2022, 2022

SdAE: Self-distillated Masked Autoencoder.
Proceedings of the Computer Vision - ECCV 2022, 2022

Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

HyperDet3D: Learning a Scene-conditioned 3D Object Detector.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

One-bit Active Query with Contrastive Pairs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Contextual Similarity Distillation for Asymmetric Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Partial Class Activation Attention for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Domain-Agnostic Prior for Transfer Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DeeCap: Dynamic Early Exiting for Efficient Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Learn by Jointly Optimizing Neural Architecture and Weights.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DATA: Domain-Aware and Task-Aware Self-supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Efficient and Scalable Implicit Graph Neural Networks with Virtual Equilibrium.
Proceedings of the IEEE International Conference on Big Data, 2022

Can Semantic Labels Assist Self-Supervised Visual Representation Learning?
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

SiamTrans: Zero-Shot Multi-Frame Image Restoration with Pre-trained Siamese Transformers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Universal-to-Specific Framework for Complex Action Recognition.
IEEE Trans. Multim., 2021

Graph Regularized Encoder-Decoder Networks for Image Representation Learning.
IEEE Trans. Multim., 2021

Progressive Unsupervised Person Re-Identification by Tracklet Association With Spatio-Temporal Regularization.
IEEE Trans. Multim., 2021

Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data.
IEEE Trans. Multim., 2021

Collaborative Image Relevance Learning for Visual Re-Ranking.
IEEE Trans. Multim., 2021

A Novel Multi-task Tensor Correlation Neural Network for Facial Attribute Prediction.
ACM Trans. Intell. Syst. Technol., 2021

Deep Relation Embedding for Cross-Modal Retrieval.
IEEE Trans. Image Process., 2021

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection.
IEEE Trans. Image Process., 2021

BiSPL: Bidirectional Self-Paced Learning for Recognition From Web Data.
IEEE Trans. Image Process., 2021

Interaction-Integrated Network for Natural Language Moment Localization.
IEEE Trans. Image Process., 2021

Conversational Image Search.
IEEE Trans. Image Process., 2021

An End-to-End Foreground-Aware Network for Person Re-Identification.
IEEE Trans. Image Process., 2021

Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based Motion Prediction.
IEEE Trans. Image Process., 2021

Multi-View Gait Image Generation for Cross-View Gait Recognition.
IEEE Trans. Image Process., 2021

Beyond Universal Person Re-Identification Attack.
IEEE Trans. Inf. Forensics Secur., 2021

Long-Term Video Question Answering via Multimodal Hierarchical Memory Attentive Networks.
IEEE Trans. Circuits Syst. Video Technol., 2021

Cascaded Regression Tracking: Towards Online Hard Distractor Discrimination.
IEEE Trans. Circuits Syst. Video Technol., 2021

Age Estimation Using Aging/Rejuvenation Features With Device-Edge Synergy.
IEEE Trans. Circuits Syst. Video Technol., 2021

Semantic-Guided Pixel Sampling for Cloth-Changing Person Re-Identification.
IEEE Signal Process. Lett., 2021

Diverse part attentive network for video-based person re-identification.
Pattern Recognit. Lett., 2021

3D-GAT: 3D-Guided adversarial transform network for person re-identification in unseen domains.
Pattern Recognit., 2021

Partially-Connected Neural Architecture Search for Reduced Computational Redundancy.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Learning Part-based Convolutional Features for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Harmonized Multimodal Learning with Gaussian Process Latent Variable Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

3D sketching for 3D object retrieval.
Multim. Tools Appl., 2021

Cyclic CNN: Image Classification With Multiscale and Multilocation Contexts.
IEEE Internet Things J., 2021

Real-time semantic segmentation via sequential knowledge distillation.
Neurocomputing, 2021

Progressive DARTS: Bridging the Optimization Gap for NAS in the Wild.
Int. J. Comput. Vis., 2021

Multi-agent Communication with Graph Information Bottleneck under Limited Bandwidth.
CoRR, 2021

Exploring Complicated Search Spaces with Interleaving-Free Sampling.
CoRR, 2021

NeuSample: Neural Sample Field for Efficient View Synthesis.
CoRR, 2021

Consensus Synergizes with Memory: A Simple Approach for Anomaly Segmentation in Urban Scenes.
CoRR, 2021

Semantic-Aware Generation for Self-Supervised Visual Representation Learning.
CoRR, 2021

DVCFlow: Modeling Information Flow Towards Human-like Video Captioning.
CoRR, 2021

DocScanner: Robust Document Image Rectification with Progressive Learning.
CoRR, 2021

CIPS-3D: A 3D-Aware Generator of GANs Based on Conditionally-Independent Pixel Synthesis.
CoRR, 2021

Rectifying the Shortcut Learning of Background: Shared Object Concentration for Few-Shot Image Recognition.
CoRR, 2021

Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion.
CoRR, 2021

Fast Batch Nuclear-norm Maximization and Minimization for Robust Domain Adaptation.
CoRR, 2021

Bag of Instances Aggregation Boosts Self-supervised Learning.
CoRR, 2021

Multi-dataset Pretraining: A Unified Model for Semantic Segmentation.
CoRR, 2021

Large-Scale Spatio-Temporal Person Re-identification: Algorithm and Benchmark.
CoRR, 2021

What Is Considered Complete for Visual Recognition?
CoRR, 2021

Location-Sensitive Visual Recognition with Cross-IOU Loss.
CoRR, 2021

Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization.
CoRR, 2021

Spatiotemporal Transformer for Video-based Person Re-identification.
CoRR, 2021

Unsupervised Domain Adaptation for Image Classification via Structure-Conditioned Adversarial Learning.
CoRR, 2021

Handwritten Chinese Font Generation with Collaborative Stroke Refinement.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Appending Adversarial Frames for Universal Video Attack.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Rectifying the Shortcut Learning of Background for Few-Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Semi-Autoregressive Image Captioning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Cross-modal Self-Supervised Learning for Lip Reading: When Contrastive Learning meets Adversarial Training.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Towards Fast and High-Quality Sign Language Production.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Towards Multiple Black-boxes Attack via Adversarial Example Generation Network.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Analysis and Applications of Class-wise Robustness in Adversarial Training.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss.
Proceedings of the 38th International Conference on Machine Learning, 2021

Skeleton Graph Scattering Networks for 3D Skeleton-based Human Motion Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Omni-GAN: On the Secrets of cGANs and Beyond.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Differentiable Convolution Search for Point Cloud Processing.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Foreground Activation Maps for Weakly Supervised Object Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Divide and Conquer for Single-frame Temporal Action Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Greedy Gradient Ensemble for Robust Visual Question Answering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Visformer: The Vision-friendly Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Shape Self-Correction for Unsupervised Point Cloud Understanding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Pixel Difference Networks for Efficient Edge Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

CondenseNet V2: Sparse Feature Reactivation for Deep Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Fourier-Based Framework for Domain Generalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Compact CNNs via Collaborative Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ESAD: End-to-end Semi-supervised Anomaly Detection.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Dual Distribution Alignment Network for Generalizable Person Re-Identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
EGroupNet: A Feature-enhanced Network for Age Estimation with Novel Age Group Schemes.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Adaptive Hashing With Sparse Matrix Factorization.
IEEE Trans. Neural Networks Learn. Syst., 2020

Sample Balancing for Deep Learning-Based Visual Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2020

Neighborhood Pyramid Preserving Hashing.
IEEE Trans. Multim., 2020

Adversarial Training Towards Robust Multimedia Recommender System.
IEEE Trans. Knowl. Data Eng., 2020

Tensor Multi-Task Learning for Person Re-Identification.
IEEE Trans. Image Process., 2020

Learning to Align via Wasserstein for Person Re-Identification.
IEEE Trans. Image Process., 2020

A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos.
IEEE Trans. Image Process., 2020

Multi-View Image Classification With Visual, Semantic and View Consistency.
IEEE Trans. Image Process., 2020

Adaptive Graph Representation Learning for Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Group-Group Loss-Based Global-Regional Feature Learning for Vehicle Re-Identification.
IEEE Trans. Image Process., 2020

Adaptive MultiScale Segmentations for Hyperspectral Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2020

Multiview Semantic Representation for Visual Recognition.
IEEE Trans. Cybern., 2020

Discrete Semantic Alignment Hashing for Cross-Media Retrieval.
IEEE Trans. Cybern., 2020

Porn Streamer Recognition in Live Video Streaming via Attention-Gated Multimodal Deep Features.
IEEE Trans. Circuits Syst. Video Technol., 2020

Small Object Detection in Unmanned Aerial Vehicle Images Using Feature Fusion and Scaling-Based Single Shot Detector With Spatial Context Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2020

Node-Sensitive Graph Fusion via Topo-Correlation for Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2020

A Survey of Open-World Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2020

Multi-view adaptive semi-supervised feature selection with the self-paced learning.
Signal Process., 2020

Fast discrete cross-modal hashing with semantic consistency.
Neural Networks, 2020

A half-precision compressive sensing framework for end-to-end person re-identification.
Neural Comput. Appl., 2020

Efficient discrete supervised hashing for large-scale cross-modal retrieval.
Neurocomputing, 2020

E<sup>2</sup>BoWs: An end-to-end Bag-of-Words model via deep convolutional neural network for image retrieval.
Neurocomputing, 2020

Style-adaptive photo aesthetic rating via convolutional neural networks and multi-task learning.
Neurocomputing, 2020

CcNet: A cross-connected convolutional network for segmenting retinal vessels using multi-scale features.
Neurocomputing, 2020

The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline.
Int. J. Comput. Vis., 2020

Hadamard Matrix Guided Online Hashing.
Int. J. Comput. Vis., 2020

Accelerated CPU-GPUs implementations for quaternion polar harmonic transform of color images.
Future Gener. Comput. Syst., 2020

Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses.
CoRR, 2020

ESAD: End-to-end Deep Semi-supervised Anomaly Detection.
CoRR, 2020

Hierarchical Semantic Aggregation for Contrastive Representation Learning.
CoRR, 2020

Omni-GAN: On the Secrets of cGANs and Beyond.
CoRR, 2020

Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations.
CoRR, 2020

Privileged Knowledge Distillation for Online Action Detection.
CoRR, 2020

Can Semantic Labels Assist Self-Supervised Visual Representation Learning?
CoRR, 2020

Center-wise Local Image Mixture For Contrastive Representation Learning.
CoRR, 2020

Loss-rescaling VQA: Revisiting Language Prior Problem from a Class-imbalance View.
CoRR, 2020

Learning Task-oriented Disentangled Representations for Unsupervised Domain Adaptation.
CoRR, 2020

Dual Distribution Alignment Network for Generalizable Person Re-Identification.
CoRR, 2020

GOLD-NAS: Gradual, One-Level, Differentiable.
CoRR, 2020

Searching towards Class-Aware Generators for Conditional Generative Adversarial Networks.
CoRR, 2020

ATSO: Asynchronous Teacher-Student Optimizationfor Semi-Supervised Medical Image Segmentation.
CoRR, 2020

Distilling Object Detectors with Task Adaptive Regularization.
CoRR, 2020

Constraining Temporal Relationship for Action Localization.
CoRR, 2020

Widening and Squeezing: Towards Accurate and Efficient QNNs.
CoRR, 2020

Disassembling the Dataset: A Camera Alignment Mechanism for Multiple Tasks in Person Re-identification.
CoRR, 2020

Filter Sketch for Network Pruning.
CoRR, 2020

Latency-Aware Differentiable Neural Architecture Search.
CoRR, 2020

Attribute Mix: Semantic Data Augmentation for Fine Grained Recognition.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Self-Adaptively Learning to Demoiré from Focused and Defocused Image Pairs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

One-bit Supervision for Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Deep Multimodal Neural Architecture Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Discernible Image Compression.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Context-Aware Multi-View Summarization Network for Image-Text Matching.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Towards More Explainability: Concept Knowledge Mining Network for Event Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cascade Grouped Attention Network for Referring Expression Segmentation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Attacking Image Captioning Towards Accuracy-Preserving Target Words Removal.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Polar Relative Positional Encoding for Video-Language Segmentation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search.
Proceedings of the 8th International Conference on Learning Representations, 2020

Cross-VAE: Towards Disentangling Expression from Identity For Human Faces.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Rethinking the Distribution Gap of Person Re-identification with Camera-Based Batch Normalization.
Proceedings of the Computer Vision - ECCV 2020, 2020

Bottom-Up Temporal Action Localization with Mutual Regularization.
Proceedings of the Computer Vision - ECCV 2020, 2020

Social Adaptive Module for Weakly-Supervised Group Activity Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Circumventing Outliers of AutoAugment with Knowledge Distillation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Large-Scale Few-Shot Learning via Multi-modal Knowledge Discovery.
Proceedings of the Computer Vision - ECCV 2020, 2020

Reinforced Axial Refinement Network for Monocular 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Wavelet-Based Dual-Branch Network for Image Demoiréing.
Proceedings of the Computer Vision - ECCV 2020, 2020

Video Super-Resolution with Recurrent Structure-Detail Network.
Proceedings of the Computer Vision - ECCV 2020, 2020

Interpretable Visual Reasoning via Probabilistic Formulation Under Natural Supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

Corner Proposal Network for Anchor-Free, Two-Stage Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

FTL: A Universal Framework for Training Low-Bit DNNs via Feature Transfer.
Proceedings of the Computer Vision - ECCV 2020, 2020

API-Net: Robust Generative Classifier via a Single Discriminator.
Proceedings of the Computer Vision - ECCV 2020, 2020

CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing.
Proceedings of the Computer Vision - ECCV 2020, 2020

Extract and Merge: Superpixel Segmentation with Regional Attributes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Select Base Classes for Few-Shot Classification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Rethinking Performance Estimation in Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

CARS: Continuous Evolution for Efficient Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-Domain Detection via Graph-Induced Prototype Alignment.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Label Decoupling Framework for Salient Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Transformation GAN for Unsupervised Image Synthesis and Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Semi-Supervised Assessor of Neural Architectures.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Polishing Decision-Based Adversarial Noise With a Customized Sampling.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Noise-Aware Fully Webly Supervised Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Joint Demosaicing and Denoising With Self Guidance.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

GradNet Image Denoising.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Person Re-Identification via Softened Similarity Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Projection & Probability-Driven Black-Box Attack.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Video Super-Resolution With Temporal Group Attention.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Creating Something From Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

GhostNet: More Features From Cheap Operations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Temporal Co-Attention Models for Unsupervised Video Action Localization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Gradually Vanishing Bridge for Adversarial Domain Adaptation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AdderNet: Do We Really Need Multiplications in Deep Learning?
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Frequency Domain Compact 3D Convolutional Neural Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Image Super-Resolution with an Indirect Supervised Path.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Single Camera Training for Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Adversarial Domain Adaptation with Domain Mixup.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Discovering Latent Topics by Gaussian Latent Dirichlet Allocation and Spectral Clustering.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Deep Scalable Supervised Quantization by Self-Organizing Map.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Search Result Reranking with Visual and Structure Information Sources.
ACM Trans. Inf. Syst., 2019

Semantically Modeling of Object and Context for Categorization.
IEEE Trans. Neural Networks Learn. Syst., 2019

Two Birds With One Stone: A Coupled Poisson Deconvolution for Detecting and Describing Topics From Multimodal Web Data.
IEEE Trans. Neural Networks Learn. Syst., 2019

Editorial: Booming of Neural Networks and Learning Systems.
IEEE Trans. Neural Networks Learn. Syst., 2019

Personalized Recommendation of Social Images by Constructing a User Interest Tree With Deep Features and Tag Trees.
IEEE Trans. Multim., 2019

Effective Image Retrieval via Multilinear Multi-Index Fusion.
IEEE Trans. Multim., 2019

Unsupervised and Semi-Supervised Image Classification With Weak Semantic Consistency.
IEEE Trans. Multim., 2019

SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning.
IEEE Trans. Multim., 2019

GLAD: Global-Local-Alignment Descriptor for Scalable Person Re-Identification.
IEEE Trans. Multim., 2019

Improving Object Retrieval Quality by Integration of Similarity Propagation and Query Expansion.
IEEE Trans. Multim., 2019

Deep Representation Learning With Part Loss for Person Re-Identification.
IEEE Trans. Image Process., 2019

Unifying Sum and Weighted Aggregations for Efficient Yet Effective Image Representation Computation.
IEEE Trans. Image Process., 2019

Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning.
IEEE Trans. Image Process., 2019

Automatic Ensemble Diffusion for 3D Shape and Image Retrieval.
IEEE Trans. Image Process., 2019

Multiview, Few-Labeled Object Categorization by Predicting Labels With View Consistency.
IEEE Trans. Cybern., 2019

Increasing Interpretation of Web Topic Detection via Prototype Learning From Sparse Poisson Deconvolution.
IEEE Trans. Cybern., 2019

Weak to Strong Detector Learning for Simultaneous Classification and Localization.
IEEE Trans. Circuits Syst. Video Technol., 2019

An Adaptive Multi-Projection Metric Learning for Person Re-Identification Across Non-Overlapping Cameras.
IEEE Trans. Circuits Syst. Video Technol., 2019

Online latent semantic hashing for cross-media retrieval.
Pattern Recognit., 2019

Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Regularized Diffusion Process on Bidirectional Context for Object Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Exploiting weak mask representation with convolutional neural networks for accurate object tracking.
Multim. Tools Appl., 2019

Cloud Detection Using Super Pixel Classification and Semantic Segmentation.
J. Comput. Sci. Technol., 2019

DR<sup>2</sup>-Net: Deep Residual Reconstruction Network for image compressive sensing.
Neurocomputing, 2019

Semi-supervised feature selection analysis with structured multi-view sparse regularization.
Neurocomputing, 2019

Editorial for the ICMR 2018 special issue.
Int. J. Multim. Inf. Retr., 2019

Contextual modeling on auxiliary points for robust image reranking.
Frontiers Comput. Sci., 2019

Hybrid feature-based analysis of video's affective content using protagonist detection.
Expert Syst. Appl., 2019

Scalable NAS with Factorizable Architectural Parameters.
CoRR, 2019

Stabilizing DARTS with Amended Gradient Estimation on Architectural Parameters.
CoRR, 2019

Unsupervised Image Super-Resolution with an Indirect Supervised Path.
CoRR, 2019

RNAS: Architecture Ranking for Powerful Networks.
CoRR, 2019

Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data.
CoRR, 2019

Adaptive Graph Representation Learning for Video Person Re-identification.
CoRR, 2019

Multimodal Unified Attention Networks for Vision-and-Language Interactions.
CoRR, 2019

PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search.
CoRR, 2019

Defending Adversarial Attacks by Correcting logits.
CoRR, 2019

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval.
CoRR, 2019

Handwritten Chinese Font Generation with Collaborative Stroke Refinement.
CoRR, 2019

Adversarial Attack and Defense on Point Sets.
CoRR, 2019

Discrete Robust Supervised Hashing for Cross-Modal Retrieval.
IEEE Access, 2019

Det2Seg: A Two-Stage Approach for Road Object Segmentation from 3D Point Clouds.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Information Competing Process for Learning Diversified Representations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Structured Stochastic Recurrent Network for Linguistic Video Prediction.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multimodal Dialog System: Generating Responses via Adaptive Decoders.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Data Priming Network for Automatic Check-Out.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Fast Non-Local Neural Networks with Spectral Residual Learning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Video-Based Cross-Modal Recipe Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Adversarial Learning for Content-Based Image Retrieval.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Dense Temporal Convolution Network for Sign Language Translation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

PCPCAD: Proposal Complementary Action Detector.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Two-Stream Video Classification with Cross-Modality Attention.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Accelerate CNN via Recursive Bayesian Pruning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multinomial Distribution Learning for Effective Neural Architecture Search.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Co-Evolutionary Compression for Unpaired Image Translation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Dynamic Points Agglomeration for Hierarchical Point Sets Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Global-Local Temporal Representations for Video Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Universal Perturbation Attack Against Image Retrieval.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CenterNet: Keypoint Triplets for Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Data-Free Learning of Student Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning to Learn Image Classifiers With Visual Analogy.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Variational Convolutional Neural Network Pruning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep Modular Co-Attention Networks for Visual Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Iterative Reorganization With Weak Spatial Constraints: Solving Arbitrary Jigsaw Puzzles for Unsupervised Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Channel-Wise Interactions for Binary Convolutional Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep Fitting Degree Scoring Network for Monocular 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Visual Feature Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Structural Relational Reasoning of Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Context-Aware Ranking by Constructing a Virtual Environment for Reinforcement Learning.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Related Attention Network for Person Re-Identification.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

2018
Adaptive Sliding Mode Control of Robot Manipulator Based on Second Order Approximation Accuracy and Decomposed Fuzzy Compensator.
Wirel. Pers. Commun., 2018

Structured Weak Semantic Space Construction for Visual Categorization.
IEEE Trans. Neural Networks Learn. Syst., 2018

Image-Specific Classification With Local and Global Discriminations.
IEEE Trans. Neural Networks Learn. Syst., 2018

Object Categorization Using Class-Specific Representations.
IEEE Trans. Neural Networks Learn. Syst., 2018

Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval.
IEEE Trans. Multim., 2018

Multiview Label Sharing for Visual Representations and Classifications.
IEEE Trans. Multim., 2018

GLA: Global-Local Attention for Image Description.
IEEE Trans. Multim., 2018

AutoBD: Automated Bi-Level Description for Scalable Fine-Grained Visual Categorization.
IEEE Trans. Image Process., 2018

Sequential Video VLAD: Training the Aggregation Locally and Temporally.
IEEE Trans. Image Process., 2018

A General Framework for Linear Distance Preserving Hashing.
IEEE Trans. Image Process., 2018

Assessing Image Retrieval Quality at the First Glance.
IEEE Trans. Image Process., 2018

Retrieval Oriented Deep Feature Learning With Complementary Supervision Mining.
IEEE Trans. Image Process., 2018

Iterative Graph Seeking for Object Tracking.
IEEE Trans. Image Process., 2018

Incremental Codebook Adaptation for Visual Representation and Categorization.
IEEE Trans. Cybern., 2018

Pooling the Convolutional Layers in Deep ConvNets for Video Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Image Class Prediction by Joint Object, Context, and Background Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2018

Learning Affective Features With a Hybrid Deep Model for Audio-Visual Emotion Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Bundled Local Features for Image Representation.
IEEE Trans. Circuits Syst. Video Technol., 2018

Multi-type attributes driven multi-camera person re-identification.
Pattern Recognit., 2018

Blind image quality prediction by exploiting multi-level deep representations.
Pattern Recognit., 2018

Improving context-sensitive similarity via smooth neighborhood for object retrieval.
Pattern Recognit., 2018

Collaborative Index Embedding for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

SIFT Meets CNN: A Decade Survey of Instance Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Multi-Task Learning with Low Rank Attribute Embedding for Multi-Camera Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Deep hashing with top similarity preserving for image retrieval.
Multim. Tools Appl., 2018

A novel two-stream saliency image fusion CNN architecture for person re-identification.
Multim. Syst., 2018

Pseudo-positive regularization for deep person re-identification.
Multim. Syst., 2018

Multimedia analysis with collective intelligence.
J. Vis. Commun. Image Represent., 2018

Birds of a feather flock together: Visual representation with scale and class consistency.
Inf. Sci., 2018

Image-level classification by hierarchical structure learning with visual and semantic similarities.
Inf. Sci., 2018

Haze removal method for natural restoration of images with sky.
Neurocomputing, 2018

Region similarity arrangement for large-scale image retrieval.
Neurocomputing, 2018

A two-step approach to describing web topics via probable keywords and prototype images from background-removed similarities.
Neurocomputing, 2018

Aggregating hierarchical binary activations for image retrieval.
Neurocomputing, 2018

Network Compression via Recursive Bayesian Pruning.
CoRR, 2018

Domain-Invariant Adversarial Learning for Unsupervised Domain Adaption.
CoRR, 2018

Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation.
CoRR, 2018

DropFilter: Dropout for Convolutions.
CoRR, 2018

A Novel Multi-Task Tensor Correlation Neural Network for Facial Attribute Prediction.
CoRR, 2018

Learning Affective Features Based on VIP for Video Affective Content Analysis.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

On the Large-Scale Transferability of Convolutional Neural Networks.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2018

Scalable Bag of Selected Deep Features for Visual Instance Retrieval.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Comprehensive Distance-Preserving Autoencoders for Cross-Modal Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Cross-modal Moment Localization in Videos.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Cascaded Feature Augmentation with Diffusion for Image Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Saliency Guided Shallow Convolutional Neural Network for Traffic Signs Retrieval.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

UHCL-Darknet: An OpenCL-based Deep Neural Network Framework for Heterogeneous Multi-/Many-core Clusters.
Proceedings of the 47th International Conference on Parallel Processing, 2018

Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline).
Proceedings of the Computer Vision - ECCV 2018, 2018

Collaborative Deep Reinforcement Learning for Multi-object Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

Person Transfer GAN to Bridge Domain Gap for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Cue Correlation Filters for Robust Visual Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Hashing via Discrepancy Minimization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Zigzag Learning for Weakly Supervised Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Relevance Estimation with Multiple Information Sources on Search Engine Result Pages.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Super-Pixel Cloud Detection Using Hierarchical Fusion CNN.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Image Captioning Based on Adaptive Balancing Loss.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Enhancing Person Re-identification in a Self-Trained Subspace.
ACM Trans. Multim. Comput. Commun. Appl., 2017

Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks.
IEEE Trans. Neural Networks Learn. Syst., 2017

Personalized Social Image Recommendation Method Based on User-Image-Tag Model.
IEEE Trans. Multim., 2017

Picking Neural Activations for Fine-Grained Recognition.
IEEE Trans. Multim., 2017

Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning.
IEEE Trans. Multim., 2017

Trip Outfits Advisor: Location-Oriented Clothing Recommendation.
IEEE Trans. Multim., 2017

Inferring Emotional Tags From Social Images With User Demographics.
IEEE Trans. Multim., 2017

Guest Editorial: Large-Scale Multimedia Data Retrieval, Classification, and Understanding.
IEEE Trans. Multim., 2017

Novel Visual and Statistical Image Features for Microblogs News Verification.
IEEE Trans. Multim., 2017

GIFT: Towards Scalable 3D Shape Retrieval.
IEEE Trans. Multim., 2017

Road Recognition From Remote Sensing Imagery Using Incremental Learning.
IEEE Trans. Intell. Transp. Syst., 2017

Part-Based Deep Hashing for Large-Scale Person Re-Identification.
IEEE Trans. Image Process., 2017

Codebook Guided Feature-Preserving for Recognition-Oriented Image Retargeting.
IEEE Trans. Image Process., 2017

LEGO-MM: LEarning Structured Model by Probabilistic loGic Ontology Tree for MultiMedia.
IEEE Trans. Image Process., 2017

Multimodal Similarity Gaussian Process Latent Variable Model.
IEEE Trans. Image Process., 2017

Robust ImageGraph: Rank-Level Feature Fusion for Image Search.
IEEE Trans. Image Process., 2017

Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud.
IEEE Trans. Image Process., 2017

Geometric Hypergraph Learning for Visual Tracking.
IEEE Trans. Cybern., 2017

Contextual Exemplar Classifier-Based Image Representation for Classification.
IEEE Trans. Circuits Syst. Video Technol., 2017

Multiview Hessian Semisupervised Sparse Feature Selection for Multimedia Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2017

Attributes driven tracklet-to-tracklet person re-identification using latent prototypes space mapping.
Pattern Recognit., 2017

A human motion feature based on semi-supervised learning of GMM.
Multim. Syst., 2017

Guest Editorial: Intermediate representation for vision and multimedia applications.
J. Vis. Commun. Image Represent., 2017

Interpretation of users' feedback via swarmed particles for content-based image retrieval.
Inf. Sci., 2017

Image classification by search with explicitly and implicitly semantic representations.
Inf. Sci., 2017

Local residual similarity for image re-ranking.
Inf. Sci., 2017

Hierarchical deep semantic representation for visual categorization.
Neurocomputing, 2017

Neighborhood geometry based feature matching for geostationary satellite remote sensing image.
Neurocomputing, 2017

Towards Reversal-Invariant Image Representation.
Int. J. Comput. Vis., 2017

Supervised Coarse-to-Fine Semantic Hashing for cross-media retrieval.
Digit. Signal Process., 2017

Beyond Part Models: Person Retrieval with Refined Part Pooling.
CoRR, 2017

Learning to Learn Image Classifiers with Informative Visual Analogy.
CoRR, 2017

E$^2$BoWs: An End-to-End Bag-of-Words Model via Deep Convolutional Neural Network.
CoRR, 2017

Recent Advance in Content-based Image Retrieval: A Literature Survey.
CoRR, 2017

Ensemble of Part Detectors for Simultaneous Classification and Localization.
CoRR, 2017

Deep Representation Learning with Part Loss for Person Re-Identification.
CoRR, 2017

Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

One-Shot Fine-Grained Instance Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

First International ACM Thematic Workshops 2017.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep Supervised Quantization by Self-Organizing Map.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Enhancing Micro-video Understanding by Harnessing External Sounds.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Large-scale person re-identification as retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

SORT: Second-Order Response Transform for Visual Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Pose-Driven Deep Convolutional Model for Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Multimodal Gaussian Process Latent Variable Models with Harmonization.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Ensemble Diffusion for Retrieval.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Multi-index fusion via similarity matrix pooling for image retrieval.
Proceedings of the IEEE International Conference on Communications, 2017

Person Re-identification in the Wild.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Video-Based Person Re-identification by Deep Feature Guided Pooling.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Scalable Person Re-identification on Supervised Smoothed Manifold.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Structured Multi-view Supervised Feature Selection Algorithm Research.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

Automatic density clustering with multiple kernels for high-dimension bioinformatics data.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

Image Caption with Global-Local Attention.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Regularized Diffusion Process for Visual Retrieval.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Multidimensional Scaling on Multiple Input Distance Matrices.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Scalable Object Retrieval with Compact Image Representation from Generic Object Regions.
ACM Trans. Multim. Comput. Commun. Appl., 2016

A Boosting Approach to Exploit Instance Correlations for Multi-Instance Classification.
IEEE Trans. Neural Networks Learn. Syst., 2016

Saliency-Aware Nonparametric Foreground Annotation Based on Weakly Labeled Data.
IEEE Trans. Neural Networks Learn. Syst., 2016

Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search.
IEEE Trans. Multim., 2016

Democratic Diffusion Aggregation for Image Retrieval.
IEEE Trans. Multim., 2016

Coarse-to-Fine Description for Fine-Grained Visual Categorization.
IEEE Trans. Image Process., 2016

Fused One-vs-All Features With Semantic Alignments for Fine-Grained Visual Categorization.
IEEE Trans. Image Process., 2016

Simple Techniques Make Sense: Feature Pooling and Normalization for Image Classification.
IEEE Trans. Circuits Syst. Video Technol., 2016

Making Residual Vector Distribution Uniform for Distinctive Image Representation.
IEEE Trans. Circuits Syst. Video Technol., 2016

Scalable Feature Matching by Dual Cascaded Scalar Quantization for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Guest Editorial: Large-Scale Multimedia Content Analysis on Social Media.
Multim. Tools Appl., 2016

Special issue: When social media meets physical world.
Multim. Syst., 2016

Boosted random contextual semantic space based representation for visual recognition.
Inf. Sci., 2016

Large-scale video copy retrieval with temporal-concentration SIFT.
Neurocomputing, 2016

Socio-mobile landmark recognition using local features with adaptive region selection.
Neurocomputing, 2016

Face database generation based on text-video correlation.
Neurocomputing, 2016

Semantic consistency hashing for cross-modal retrieval.
Neurocomputing, 2016

Incorporating visual adjectives for image classification.
Neurocomputing, 2016

Fine-residual VLAD for image retrieval.
Neurocomputing, 2016

Accurate Image Search with Multi-Scale Contextual Evidences.
Int. J. Comput. Vis., 2016

Good Practice in CNN Feature Transfer.
CoRR, 2016

Person Re-identification in the Wild.
CoRR, 2016

Coarse2Fine: Two-Layer Fusion For Image Retrieval.
CoRR, 2016

Sparse Matrix Based Hashing for Approximate Nearest Neighbor Search.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Category Aggregation Among Region Proposals for Object Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Automatic Endmember Extraction Using Pixel Purity Index for Hyperspectral Imagery.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Exploiting Hierarchical Activations of Neural Network for Image Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Region similarity arrangement for image retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Towards temporal adaptive representation for video action recognition.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Creating Descriptive Visual Word Tree for Tag Ranking of Social Image.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Adaptively Weighted Graph Fusion for Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

MARS: A Video Benchmark for Large-Scale Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2016, 2016

Geometric Neural Phrase Pooling: Modeling the Spatial Co-occurrence of Neurons.
Proceedings of the Computer Vision - ECCV 2016, 2016

Deep Attributes Driven Multi-camera Person Re-identification.
Proceedings of the Computer Vision - ECCV 2016, 2016

Smooth Neighborhood Structure Mining on Multiple Affinity Graphs with Applications to Context-Sensitive Similarity.
Proceedings of the Computer Vision - ECCV 2016, 2016

Cascaded Interactional Targeting Network for Egocentric Video Analysis.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Picking Deep Filter Responses for Fine-Grained Image Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

InterActive: Inter-Layer Activeness Propagation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DisturbLabel: Regularizing CNN on the Loss Layer.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Fast Image Retrieval: Query Pruning and Early Termination.
IEEE Trans. Multim., 2015

Retargeting Semantically-Rich Photos.
IEEE Trans. Multim., 2015

Cross Indexing With Grouplets.
IEEE Trans. Multim., 2015

Fine-Grained Image Search.
IEEE Trans. Multim., 2015

Uniting Keypoints: Local Visual Information Fusion for Large-Scale Image Search.
IEEE Trans. Multim., 2015

Understanding Blooming Human Groups in Social Networks.
IEEE Trans. Multim., 2015

When Location Meets Social Multimedia: A Survey on Vision-Based Recognition and Mining for Geo-Social Multimedia Analytics.
ACM Trans. Intell. Syst. Technol., 2015

BSIFT: Toward Data-Independent Codebook for Large Scale Image Search.
IEEE Trans. Image Process., 2015

Full-Space Local Topology Extraction for Cross-Modal Retrieval.
IEEE Trans. Image Process., 2015

Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks.
IEEE Trans. Image Process., 2015

Polar Embedding for Aurora Image Retrieval.
IEEE Trans. Image Process., 2015

Heterogeneous Graph Propagation for Large-Scale Web Image Search.
IEEE Trans. Image Process., 2015

Angular-Similarity-Preserving Binary Signatures for Linear Subspaces.
IEEE Trans. Image Process., 2015

Image Annotation by Latent Community Detection and Multikernel Learning.
IEEE Trans. Image Process., 2015

An Attribute-Assisted Reranking Model for Web Image Search.
IEEE Trans. Image Process., 2015

Image Search Reranking With Hierarchical Topic Awareness.
IEEE Trans. Cybern., 2015

Feature representation for statistical-learning-based object detection: A review.
Pattern Recognit., 2015

Semantic-Aware Co-Indexing for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Visual word expansion and BSIFT verification for large-scale image search.
Multim. Syst., 2015

Tensor index for large scale image retrieval.
Multim. Syst., 2015

Multi-order visual phrase for scalable partial-duplicate visual search.
Multim. Syst., 2015

Fast large-scale object retrieval with binary quantization.
J. Electronic Imaging, 2015

Binary feature from intensity quantization and weakly spatial contextual coding for image search.
Inf. Sci., 2015

Image classification using boosted local features with random orientation and location selection.
Inf. Sci., 2015

Joint image representation and classification in random semantic spaces.
Neurocomputing, 2015

Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval.
Neurocomputing, 2015

Image re-ranking with an alternating optimization.
Neurocomputing, 2015

A survey of recent advances in visual feature detection.
Neurocomputing, 2015

Visual Topic Network: Building better image representations for images in social media.
Comput. Vis. Image Underst., 2015

Person Re-identification Meets Image Search.
CoRR, 2015

Orientational Spatial Part Modeling for Fine-Grained Visual Categorization.
Proceedings of the 2015 IEEE International Conference on Mobile Services, MS 2015, New York City, NY, USA, June 27, 2015

Augmented Feature Fusion for Image Retrieval System.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Image Classification and Retrieval are ONE.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Hierarchical Encoding of Binary Descriptors for Image Matching.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Fast Democratic Aggregation and Query Fusion for Image Search.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Heterogeneous Semantic Level Features Fusion for Action Recognition.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Exploring feature space with semantic attributes.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Rank-aware graph fusion with contextual dissimilarity measurement for image retrieval.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Fine-grained visual categorization with fine-tuned segmentation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Scalable local feature matching without visual codebook training.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Scalable Person Re-identification: A Benchmark.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

RIDE: Reversal Invariant Descriptor Enhancement.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Multi-Task Learning with Low Rank Attribute Embedding for Person Re-Identification.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Similarity Gaussian Process Latent Variable Model for Multi-modal Data Analysis.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Interaction part mining: A mid-level approach for fine-grained action recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Query-adaptive late fusion for image search and person re-identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Coloring image search with coupled multi-index.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Fusing feature and similarity for multimodal search.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Salient target detection in hyperspectral images using spectral saliency.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

2014
Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search.
IEEE Trans. Multim., 2014

A Prior-Free Weighting Scheme for Binary Code Ranking.
IEEE Trans. Multim., 2014

\(\mathcal {L}_p\) -Norm IDF for Scalable Image Retrieval.
IEEE Trans. Image Process., 2014

Coupled Binary Embedding for Large-Scale Image Retrieval.
IEEE Trans. Image Process., 2014

Scalable Similarity Search With Topology Preserving Hashing.
IEEE Trans. Image Process., 2014

USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval.
IEEE Trans. Image Process., 2014

Cascade Category-Aware Visual Search.
IEEE Trans. Image Process., 2014

Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation.
IEEE Trans. Image Process., 2014

Spatial Pooling of Heterogeneous Features for Image Classification.
IEEE Trans. Image Process., 2014

Cross-Indexing of Binary SIFT Codes for Large-Scale Image Search.
IEEE Trans. Image Process., 2014

Contextual Hashing for Large-Scale Image Search.
IEEE Trans. Image Process., 2014

Learning Cascaded Shared-Boost Classifiers for Part-Based Object Detection.
IEEE Trans. Image Process., 2014

Batch-Orthogonal Locality-Sensitive Hashingfor Angular Similarity.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Special issue on contextual vision computing.
Mach. Vis. Appl., 2014

Interactive ads recommendation with contextual search on product topic space.
Multim. Tools Appl., 2014

Real-time motion data annotation via action string.
Comput. Animat. Virtual Worlds, 2014

Encoding Spatial Context for Large-Scale Partial-Duplicate Web Image Retrieval.
J. Comput. Sci. Technol., 2014

Salient region detection for complex background images using integrated features.
Inf. Sci., 2014

Object categorization in sub-semantic space.
Neurocomputing, 2014

Recognizing human group action by layered model with multiple cues.
Neurocomputing, 2014

Online MIL tracking with instance-level semi-supervised learning.
Neurocomputing, 2014

Indexing heterogeneous features with superimages.
Int. J. Multim. Inf. Retr., 2014

Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2014

ObjectPatchNet: Towards scalable and semantic image annotation and retrieval.
Comput. Vis. Image Underst., 2014

Fast and accurate near-duplicate image search with affinity propagation on the ImageWeb.
Comput. Vis. Image Underst., 2014

Exploiting local linear geometric structure for identifying correct matches.
Comput. Vis. Image Underst., 2014

Guest Editorial: Special issue on large scale multimedia semantic indexing.
Comput. Vis. Image Underst., 2014

Social-oriented visual image search.
Comput. Vis. Image Underst., 2014

Multimedia search reranking: A literature survey.
ACM Comput. Surv., 2014

Seeing the Big Picture: Deep Embedding with Contextual Evidences.
CoRR, 2014

Discriminative coupled dictionary hashing for fast cross-media retrieval.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Perception-Guided Multimodal Feature Fusion for Photo Aesthetics Assessment.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Salable Image Search with Reliable Binary Code.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Personalized Visual Vocabulary Adaption for Social Image Retrieval.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Social Embedding Image Distance Learning.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Fused one-vs-all mid-level features for fine-grained visual categorization.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Superimage: Packing Semantic-Relevant Images for Indexing and Retrieval.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Scalable Image Search with Multiple Index Tables.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Cross-media hashing with kernel regression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Mobile visual search via hievarchical sparse coding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

A Stereo-Vision-Assisted model for depth map super-resolution.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Hamming embedding with fragile bits for image search.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Max-SIFT: Flipping invariant descriptors for Web logo search.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Attribute prediction with long-range interactions via path coding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Evaluation on the Impact of Image Quality on Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Search by Detection: Object-Level Feature for Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Visual reranking with improved image graph.
Proceedings of the IEEE International Conference on Acoustics, 2014

Pipelining Localized Semantic Features for Fine-Grained Action Recognition.
Proceedings of the Computer Vision - ECCV 2014, 2014

Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Packing and Padding: Coupled Multi-index for Accurate Image Retrieval.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Cross-Scale Cost Aggregation for Stereo Matching.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Orientational Pyramid Matching for Recognizing Indoor Scenes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Semi-supervised Relational Topic Model for Weakly Annotated Image Recognition in Social Media.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Hybrid-Indexing Multi-type Features for Large-Scale Image Search.
Proceedings of the Computer Vision - ACCV 2014, 2014

Similarity-Preserving Binary Signature for Linear Subspaces.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
SIFT match verification by geometric coding for large-scale partial-duplicate web image search.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Learning to Photograph: A Compositional Perspective.
IEEE Trans. Multim., 2013

Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search.
IEEE Trans. Image Process., 2013

Discovering Discriminative Graphlets for Aerial Image Categories Recognition.
IEEE Trans. Image Process., 2013

High-Order Local Spatial Context Modeling by Spatialized Random Forest.
IEEE Trans. Image Process., 2013

Image classification using Harr-like transformation of local features with coding residuals.
Signal Process., 2013

Weakly supervised codebook learning by iterative label propagation with graph quantization.
Signal Process., 2013

Image classification using spatial pyramid robust sparse coding.
Pattern Recognit. Lett., 2013

Laplacian affine sparse coding with tilt and orientation consistency for image classification.
J. Vis. Commun. Image Represent., 2013

Sparse representations for image and video analysis.
J. Vis. Commun. Image Represent., 2013

A semantic feature for human motion retrieval.
Comput. Animat. Virtual Worlds, 2013

Beyond visual features: A weak semantic image representation using exemplar classifiers for classification.
Neurocomputing, 2013

Interactive social group recommendation for Flickr photos.
Neurocomputing, 2013

Mining spatiotemporal video patterns towards robust action retrieval.
Neurocomputing, 2013

COGE: A Novel Binary Feature Descriptor Exploring Anisotropy and Non-uniformity.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

A Novel Binary Feature from Intensity Difference Quantization between Random Sample of Points.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Distribution-Aware Locality Sensitive Hashing.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Social Visual Image Ranking for Web Image Search.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Improved binary feature matching through fusion of hamming distance and fragile bit weight.
Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices, 2013

Undo the codebook bias by linear transformation for visual applications.
Proceedings of the ACM Multimedia Conference, 2013

Topology preserving hashing for similarity search.
Proceedings of the ACM Multimedia Conference, 2013

Beyond bag of words: image representation in sub-semantic space.
Proceedings of the ACM Multimedia Conference, 2013

Stereotime: a wireless 2D and 3D switchable video communication system.
Proceedings of the ACM Multimedia Conference, 2013

Locality preserving verification for image search.
Proceedings of the ACM Multimedia Conference, 2013

Static saliency vs. dynamic saliency: a comparative study.
Proceedings of the ACM Multimedia Conference, 2013

What are the distance metrics for local features?
Proceedings of the ACM Multimedia Conference, 2013

Scale based region growing for scene text detection.
Proceedings of the ACM Multimedia Conference, 2013

Object coding on the semantic graph for scene classification.
Proceedings of the ACM Multimedia Conference, 2013

Learning attribute-aware dictionary for image classification and search.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Image search reranking with multi-latent topical graph.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Feature normalization for part-based image classification.
Proceedings of the IEEE International Conference on Image Processing, 2013

Improving scene classification with weakly spatial symmetry information.
Proceedings of the IEEE International Conference on Image Processing, 2013

Multi-order visual phrase for scalable image search.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Scalable mobile search with binary phrase.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Min-Max Hash for Jaccard Similarity.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Multimedia LEGO: Learning Structured Model by Probabilistic Logic Ontology Tree.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Hierarchical Part Matching for Fine-Grained Visual Categorization.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Lp-Norm IDF for Large Scale Image Search.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Binary Code Ranking with Weighted Hamming Distance.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Multiview Face Recognition: From TensorFace to V-TensorFace and K-TensorFace.
IEEE Trans. Syst. Man Cybern. Part B, 2012

S<sup>3</sup>MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications.
IEEE Trans. Multim., 2012

Learning Semantics From Multimedia Web Resources: An Introduction to the Special Issue.
IEEE Trans. Multim., 2012

Visually Summarizing Web Pages Through Internal and External Images.
IEEE Trans. Multim., 2012

Context-Aware Semi-Local Feature Detector.
ACM Trans. Intell. Syst. Technol., 2012

Introduction to the Special Section on Intelligent Multimedia Systems and Technology Part II.
ACM Trans. Intell. Syst. Technol., 2012

Principal Visual Word Discovery for Automatic License Plate Detection.
IEEE Trans. Image Process., 2012

Discriminant Learning Through Multiple Principal Angles for Visual Recognition.
IEEE Trans. Image Process., 2012

Task-Dependent Visual-Codebook Compression.
IEEE Trans. Image Process., 2012

Image Annotation by Input-Output Structural Grouping Sparsity.
IEEE Trans. Image Process., 2012

Intelligent photo clustering with user interaction and distance metric learning.
Pattern Recognit. Lett., 2012

Intelligent multimedia interactivity.
Pattern Recognit. Lett., 2012

Tactic analysis based on real-world ball trajectory in soccer video.
Pattern Recognit., 2012

Exploring Context and Content Links in Social Media: A Latent Space Method.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Nearest-neighbor method using multiple neighborhood similarities for social media data mining.
Neurocomputing, 2012

A Boosting, Sparsity- Constrained Bilinear Model for Object Recognition.
IEEE Multim., 2012

Exploring tag relevance for image tag re-ranking.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Super-Bit Locality-Sensitive Hashing.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Scalar quantization for large scale image search.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Spatial pooling of heterogeneous features for image applications.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Image tag re-ranking by coupled probability transition.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Embedding spatial context information into inverted filefor large-scale image retrieval.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Query expansion enhancement by fast binary matching.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Correlated attribute transfer with multi-task graph-guided fusion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Attribute-assisted reranking for web image retrieval.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Visual query attributes suggestion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Binary SIFT: towards efficient feature matching verification for image search.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Human Daily Action Analysis with Multi-view and Color-Depth Data.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Multi-feature metric learning with knowledge transfer among semantics and social tagging.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Context aware topic model for scene recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Graph-guided sparse reconstruction for region tagging.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Weakly supervised sparse coding with geometric consistency pooling.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Mining flickr landmarks by modeling reconstruction sparsity.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Less is More: Efficient 3-D Object Retrieval With Query View Selection.
IEEE Trans. Multim., 2011

Introduction to the special issue on intelligent multimedia systems and technology.
ACM Trans. Intell. Syst. Technol., 2011

Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications.
IEEE Trans. Image Process., 2011

Latent visual context learning for web image applications.
Pattern Recognit., 2011

Building descriptive and discriminative visual codebook for large-scale image applications.
Multim. Tools Appl., 2011

Personalization in multimedia retrieval: A survey.
Multim. Tools Appl., 2011

Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search.
IEEE Multim., 2011

Modeling spatial and semantic cues for large-scale near-duplicated image retrieval.
Comput. Vis. Image Underst., 2011

ObjectBook construction for large-scale semantic-aware image retrieval.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Semi-automatic Flickr Group Suggestion.
Proceedings of the Advances in Multimedia Modeling, 2011

Large scale image search with geometric coding.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning to judge image search results.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Picture-in-picture copy detection using spatial coding techniques.
Proceedings of the 2011 ACM international workshop on Automated media analysis and production for novel TV services, 2011

Learning heterogeneous data for hierarchical web video classification.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Spatial pooling for transformation invariant image representation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Human group activity analysis with fusion of motion and appearance information.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

One step beyond bags of features: Visual categorization using components.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Probabilistic indexing of media sequences.
Proceedings of the ICIMCS 2011, 2011

Image classification by non-negative sparse coding, low-rank and sparse decomposition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Locality-sensitive support vector machine by exploring local correlation and global regularization.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Towards cross-category knowledge propagation for learning visual concepts.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Spatial-DiscLDA for visual recognition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Geometric ℓp-norm feature pooling for image classification.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Efficient l<sub>p</sub>-norm multiple feature metric learning for image categorization.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Effective image representation based on bi-layer visual codebook.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010
Affective Visualization and Retrieval for Music Video.
IEEE Trans. Multim., 2010

Constructing Concept Lexica With Small Semantic Gaps.
IEEE Trans. Multim., 2010

Correlation-Based Feature Selection and Regression.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

AdVR: Linking Ad Video with Products or Service.
Proceedings of the Advances in Multimedia Modeling, 2010

Large scale partially duplicated web image retrieval.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Spatial coding for large scale partial-duplicate web image search.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Building contextual visual vocabulary for large-scale image applications.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multi-label boosting for image annotation by structural grouping sparsity.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

S3MKL: scalable semi-supervised multiple kernel learning for image data mining.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Nearest-neighbor classification using unlabeled data for real world image application.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

The use of non-conventional methods for content analysis and understanding: panel overview.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Real-world trajectory extraction for attack pattern analysis in soccer video.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Learning to photograph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Canonical Image Selection by Visual Context Learning.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Interactive Web Video Advertising with Context Analysis and Search.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multiple Kernel Learning with High Order Kernels.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Action Recognition Using Spatial-Temporal Context.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Group Activity Recognition by Gaussian Processes Estimation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multiple instance learning using visual phrases for object classification.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Annotate Wikipedia with Flickr images: concepts and case study.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Interactive service recommendation based on ad concept hierarchy.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Visual topic model for web image annotation.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Large scale partial-duplicate image retrieval with bi-space quantization and geometric consistency.
Proceedings of the IEEE International Conference on Acoustics, 2010

Building pair-wise visual word tree for efficent image re-ranking.
Proceedings of the IEEE International Conference on Acoustics, 2010

Cross-database age estimation based on transfer learning.
Proceedings of the IEEE International Conference on Acoustics, 2010

Multi-level trajectory modeling for video copy detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Latent visual context analysis for image re-ranking.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Music video affective understanding using feature importance analysis.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Image Classification Using Spatial Pyramid Coding and Visual Word Reweighting.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Integration of Context and Content for Multimedia Management: An Introduction to the Special Issue.
IEEE Trans. Multim., 2009

Discriminant Subspace Analysis: An Adaptive Approach for Image Classification.
IEEE Trans. Multim., 2009

Improved concept similarity measuring in the visual domain.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

Descriptive visual words and visual phrases for image applications.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Multimedia content analysis: model-based approaches vs. data-driven approaches.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

The 1st workshop on large-scale multimedia retrieval and mining (LS-MMRM'09).
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Visual ContextRank for web image re-ranking.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Refining image retrieval using one-class classification.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A lexica family with small semantic gap.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

What can visual content analysis do for text based image search?
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Utilizing affective analysis for efficient movie browsing.
Proceedings of the International Conference on Image Processing, 2009

Category sensitive codebook construction for object category recognition.
Proceedings of the International Conference on Image Processing, 2009

Visual block link analysis for image re-ranking.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

2008
Adaptive discriminant analysis for microarray-based classification.
ACM Trans. Knowl. Discov. Data, 2008

Semantic Subspace Projection and Its Applications in Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2008

Distance Learning for Similarity Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Signal Processing for Applications in Healthcare Systems.
EURASIP J. Adv. Signal Process., 2008

Similarity Matching in Computer Vision and Multimedia.
Comput. Vis. Image Underst., 2008

Personalized MTV Affective Analysis Using User Profile.
Proceedings of the Advances in Multimedia Information Processing, 2008

i.MTV: an integrated system for mtv affective analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Learning object from small and imbalanced dataset with Boost-BFKO.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Affective MTV analysis based on arousal and valence features.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

What are the high-level concepts with small semantic gaps?
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Integrating Discriminant and Descriptive Information for Dimension Reduction and Classification.
IEEE Trans. Circuits Syst. Video Technol., 2007

Interactive Semisupervised Learning for Microarray Analysis.
IEEE ACM Trans. Comput. Biol. Bioinform., 2007

Learning Microarray Gene Expression Data by Hybrid Discriminant Analysis.
IEEE Multim., 2007

FADA: An Efficient Dimension Reduction Scheme for Image Classification.
Proceedings of the Advances in Multimedia Information Processing, 2007

Feature selection using principal feature analysis.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Personalized multimedia retrieval: the new trend?
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Interactive Boosting for Image Classification.
Proceedings of the Multiple Classifier Systems, 7th International Workshop, 2007

Semantic Analysis and Personalization for Mobile Media Applications.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

i.Boosting for Image Classification.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Integrating Relevance Feedback in Boosting for Content-Based Image Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2007

Two-Dimensional Adaptive Discriminant Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Semantic retrieval of video - review of research on video retrieval in meetings, movies and broadcast news, and sports.
IEEE Signal Process. Mag., 2006

Learning image manifolds by semantic subspace projection.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Adaptive Discriminant Projection for Content-based Image Retrieval.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Toward Intelligent Use of Semantic Information on Subspace Discovery for Image Retrieval.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A New Study on Distance Metrics as Similarity Measurement.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Self-supervised Learning Framework for Classifying Microarray Gene Expression Data.
Proceedings of the Computational Science, 2006

Constructing Descriptive and Discriminant Features for Face Classification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Toward Robust Distance Metric Analysis for Similarity Estimation.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Self-supervised learning based on discriminative nonlinear features for image classification.
Pattern Recognit., 2005

Boosting Multiple Classifiers Constructed by Hybrid Discriminant Analysis.
Proceedings of the Multiple Classifier Systems, 6th International Workshop, 2005

Video Object Boundary Reconstruction by 2-Pass Voting.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Neighborhood issue in single-frame image super-resolution.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Hybrid PCA and LDA Analysis of Microarray Gene Expression Data.
Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005

2004
Visualization and User-Modeling for Browsing Personal Photo Libraries.
Int. J. Comput. Vis., 2004

Complete Performance Graphs in Probabilistic Information Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A new analysis of the value of unlabeled data in semi-supervised learning for image retrieval.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Parameterized discriminant analysis for image classification.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Toward an improved error metric.
Proceedings of the 2004 International Conference on Image Processing, 2004

Learning based on kernel discriminant-EM algorithm for image classification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Robust Error Metric Analysis for Noise Estimation in Image Indexing.
Proceedings of the Proceedings MDDE '04, 2004

2003
Evaluation of salient point techniques.
Image Vis. Comput., 2003

2002
Content -Based Visualization and Retrieval for Image Libraries
PhD thesis, 2002

PDH: a human-centric interface for image libraries.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Visualization, Estimation and User-Modeling for Interactive Browsing of Image Libraries.
Proceedings of the Image and Video Retrieval, International Conference, 2002

2001
Image retrieval using wavelet-based salient points.
J. Electronic Imaging, 2001

Content-based image retrieval using wavelet-based salient points.
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001

Display Optimization for Image Browsing.
Proceedings of the Multimedia Databases and Image Communication, 2001

Spatial Visualization For Content-Based Image Retrieval.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

2000
Integrating Unlabeled Images for Image Retrieval Based on Relevance Feedback.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Incorporate Discriminant Analysis with EM Algorithm in Image Retrieval.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Update Relevant Image Weights for Content-Based Image Retrieval using Support Vector Machines.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Combine User Defined Region-of-Interest and Spatial Layout for Image Retrieval.
Proceedings of the 2000 International Conference on Image Processing, 2000

Incorporate Support Vector Machines to Content-Based Image Retrieval with Relevant Feedback.
Proceedings of the 2000 International Conference on Image Processing, 2000

Discriminant-EM Algorithm with Application to Image Retrieval.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

1994
Error criteria analysis and robust data fusion.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994


  Loading...