Xian-Sheng Hua

Orcid: 0000-0002-8232-5049

Affiliations:
  • Alibaba DAMO Academy, Artificial Intelligence Center, Hangzhou, China


According to our database1, Xian-Sheng Hua authored at least 502 papers between 2001 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2016, "For contributions to multimedia content analysis and image search".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
HARR: Learning Discriminative and High-Quality Hash Codes for Image Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024

DIOR: Learning to Hash With Label Noise Via Dual Partition and Contrastive Learning.
IEEE Trans. Knowl. Data Eng., April, 2024

Messages are Never Propagated Alone: Collaborative Hypergraph Neural Network for Time-Series Forecasting.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Study on Spatio-Temporal Patterns of Commuting under Adverse Weather Events: Case Study of Typhoon In-Fa.
ISPRS Int. J. Geo Inf., February, 2024

CLEAR: Cluster-Enhanced Contrast for Self-Supervised Graph Representation Learning.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Criterion-based Heterogeneous Collaborative Filtering for Multi-behavior Implicit Recommendation.
ACM Trans. Knowl. Discov. Data, January, 2024

An Evaluation of Large Language Models in Bioinformatics Research.
CoRR, 2024

Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection.
CoRR, 2024

2023
Optimizing traffic efficiency via a reinforcement learning approach based on time allocation.
Int. J. Mach. Learn. Cybern., October, 2023

Urban Traffic Light Control via Active Multi-Agent Communication and Supply-Demand Modeling.
IEEE Trans. Knowl. Data Eng., April, 2023

TSNAdb v2.0: The Updated Version of Tumor-specific Neoantigen Database.
Genom. Proteom. Bioinform., April, 2023

A Survey on Deep Hashing Methods.
ACM Trans. Knowl. Discov. Data, January, 2023

Single Person Dense Pose Estimation via Geometric Equivariance Consistency.
IEEE Trans. Multim., 2023

Boosting Robust Learning Via Leveraging Reusable Samples in Noisy Web Data.
IEEE Trans. Multim., 2023

FECANet: Boosting Few-Shot Semantic Segmentation With Feature-Enhanced Context-Aware Network.
IEEE Trans. Multim., 2023

Toward Effective Domain Adaptive Retrieval.
IEEE Trans. Image Process., 2023

Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.
IEEE Trans. Image Process., 2023

Learning comprehensive global features in person re-identification: Ensuring discriminativeness of more local regions.
Pattern Recognit., 2023

Smart Community Networks and Systems.
IEEE Netw., 2023

Hierarchical Graph Pattern Understanding for Zero-Shot VOS.
CoRR, 2023

Proposal-Level Unsupervised Domain Adaptation for Open World Unbiased Detector.
CoRR, 2023

CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification.
CoRR, 2023

PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction.
CoRR, 2023

DANCE: Learning A Domain Adaptive Framework for Deep Hashing.
Proceedings of the ACM Web Conference 2023, 2023

IDEA: An Invariant Perspective for Efficient Domain Adaptive Image Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Anatomy-Aware Lymph Node Detection in Chest CT Using Implicit Station Stratification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023 Workshops, 2023

CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification.
Proceedings of the International Conference on Machine Learning, 2023

Contextual Convolutional Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dynamic Hypergraph Structure Learning for Traffic Flow Forecasting.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Prototypical Mixing and Retrieval-based Refinement for Label Noise-resistant Image Retrieval.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Random Boxes Are Open-world Object Detectors.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Guest Editorial: Learning From Noisy Multimedia Data.
IEEE Trans. Multim., 2022

Apparel-Invariant Feature Learning for Person Re-Identification.
IEEE Trans. Multim., 2022

Progressive Transfer Learning.
IEEE Trans. Image Process., 2022

Graph Convolutional Dictionary Selection With L₂<sub>, </sub>ₚ Norm for Video Summarization.
IEEE Trans. Image Process., 2022

Offline-Online Associated Camera-Aware Proxies for Unsupervised Person Re-Identification.
IEEE Trans. Image Process., 2022

Dense Semantics-Assisted Networks for Video Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Centerness-Aware Network for Temporal Action Proposal.
IEEE Trans. Circuits Syst. Video Technol., 2022

Dynamic supervisor for cross-dataset object detection.
Neurocomputing, 2022

Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution.
CoRR, 2022

Box-supervised Instance Segmentation with Level Set Evolution.
CoRR, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
CoRR, 2022

Online Convolutional Re-parameterization.
CoRR, 2022

Disentangled Representation Learning for Text-Video Retrieval.
CoRR, 2022

Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.
Briefings Bioinform., 2022

Molecular persistent spectral image (Mol-PSI) representation for machine learning models in drug design.
Briefings Bioinform., 2022

Towards Counterfactual Image Manipulation via CLIP.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DEAL: An Unsupervised Domain Adaptive Framework for Graph-level Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Token Embeddings Alignment for Cross-Modal Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HEART: Towards Effective Hash Codes under Label Noise.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Improved Deep Unsupervised Hashing via Prototypical Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CoHOZ: Contrastive Multimodal Prompt Tuning for Hierarchical Open-set Zero-shot Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Pursuing Knowledge Consistency: Supervised Hierarchical Contrastive Learning for Facial Action Unit Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Effective Opportunistic Esophageal Cancer Screening Using Noncontrast CT Imaging.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

DeepCRC: Colorectum and Colorectal Cancer Segmentation in CT Scans via Deep Colorectal Coordinate Transform.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

RemixFormer: A Transformer Model for Precision Skin Tumor Differential Diagnosis via Multi-modal Imaging and Non-imaging Data.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Thoracic Lymph Node Segmentation in CT Imaging via Lymph Node Station Stratification and Size Encoding.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

TGNN: A Joint Semi-supervised Framework for Graph-level Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

On Non-Random Missing Labels in Semi-Supervised Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Dynamic Hypergraph Convolutional Network.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

DualGraph: Improving Semi-supervised Graph Classification via Dual Contrastive Learning.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Identifying Hard Noise in Long-Tailed Sample Distribution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking IoU-based Optimization for Single-stage 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Delving into Details: Synopsis-to-Detail Networks for Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Box-Supervised Instance Segmentation with Level Set Evolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unleashing the Potential of Adaptation Models via Go-getting Domain Labels.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Balanced and Hierarchical Relation Learning for One-shot Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CDAD: A Common Daily Action Dataset with Collected Hard Negative Samples.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Meta Convolutional Neural Networks for Single Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unpaired Cartoon Image Synthesis via Gated Cycle Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Online Convolutional Reparameterization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Homography Loss for Monocular 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dense Learning based Semi-Supervised Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SP-ViT: Learning 2D Spatial Priors for Vision Transformers.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Cross-Domain Empirical Risk Minimization for Unbiased Long-Tailed Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Active Boundary Loss for Semantic Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
DecorIn: An Automatic Method for Plane-Based Decorating.
IEEE Trans. Vis. Comput. Graph., 2021

Self-Adaptive Neural Module Transformer for Visual Question Answering.
IEEE Trans. Multim., 2021

Joint Auction-Coalition Formation Framework for Communication-Efficient Federated Learning in UAV-Enabled Internet of Vehicles.
IEEE Trans. Intell. Transp. Syst., 2021

Towards Federated Learning in UAV-Enabled Internet of Vehicles: A Multi-Dimensional Contract-Matching Approach.
IEEE Trans. Intell. Transp. Syst., 2021

Towards Fine-Grained Human Pose Transfer With Detail Replenishing Network.
IEEE Trans. Image Process., 2021

Unsupervised Discrete Hashing With Affinity Similarity.
IEEE Trans. Image Process., 2021

Coarse-to-Fine Semantic Alignment for Cross-Modal Moment Localization.
IEEE Trans. Image Process., 2021

Spatial likelihood voting with self-knowledge distillation for weakly supervised object detection.
Image Vis. Comput., 2021

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.
CoRR, 2021

PANet: Perspective-Aware Network with Dynamic Receptive Fields and Self-Distilling Supervision for Crowd Counting.
CoRR, 2021

Density-Based Clustering with Kernel Diffusion.
CoRR, 2021

Improving 3D Object Detection with Channel-wise Transformer.
CoRR, 2021

Aug3D-RPN: Improving Monocular 3D Object Detection by Synthetic Images with Virtual Depth.
CoRR, 2021

Criterion-based Heterogeneous Collaborative Filtering for Multi-behavior Implicit Recommendation.
CoRR, 2021

Attention-guided Temporal Coherent Video Object Matting.
CoRR, 2021

Discriminative-Generative Dual Memory Video Anomaly Detection.
CoRR, 2021

Half-Real Half-Fake Distillation for Class-Incremental Semantic Segmentation.
CoRR, 2021

Active Boundary Loss for Semantic Segmentation.
CoRR, 2021

Graph-Induced Contrastive Learning for Intra-Camera Supervised Person Re-Identification.
IEEE Access, 2021

Towards Precise Intra-camera Supervised Person Re-Identification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

$\alpha$-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Large-scale vehicle trajectory reconstruction with camera sensing network.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

Attention-guided Temporally Coherent Video Object Matting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Pairwise VLAD Interaction Network for Video Question Answering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

CIMON: Towards High-quality Hash Codes.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Graph Contrastive Clustering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Self-Regulation for Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Transporting Causal Mechanisms for Unsupervised Domain Adaptation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving 3D Object Detection with Channel-wise Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

3D Local Convolutional Neural Networks for Gait Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dense Interaction Learning for Video-based Person Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Counterfactual Zero-Shot and Open-Set Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Counterfactual VQA: A Cause-Effect Look at Language Bias.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Revisiting Knowledge Distillation: An Inheritance and Exploration Framework.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Distilling Causal Effect of Data in Class-Incremental Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Partial Person Re-Identification With Part-Part Correspondence Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SpineOne: A One-Stage Detection Framework for Degenerative Discs and Vertebrae.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

Traffic Flow Prediction with Vehicle Trajectories.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Category Dictionary Guided Unsupervised Domain Adaptation for Object Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Asynchronous Teacher Guided Bit-wise Hard Mining for Online Hashing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Camera-Aware Proxies for Unsupervised Person Re-Identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Concentrated Local Part Discovery With Fine-Grained Part Representation for Person Re-Identification.
IEEE Trans. Multim., 2020

SIF: Self-Inspirited Feature Learning for Person Re-Identification.
IEEE Trans. Image Process., 2020

Deep Saliency Hashing for Fine-Grained Retrieval.
IEEE Trans. Image Process., 2020

Decouple co-adaptation: Classifier randomization for person re-identification.
Neurocomputing, 2020

Learning to Generate Content-Aware Dynamic Detectors.
CoRR, 2020

FGAGT: Flow-Guided Adaptive Graph Tracking.
CoRR, 2020

CIMON: Towards High-quality Hash Codes.
CoRR, 2020

Apparel-invariant Feature Learning for Apparel-changed Person Re-identification.
CoRR, 2020

Deep Robust Clustering by Contrastive Learning.
CoRR, 2020

Salvage Reusable Samples from Noisy Data for Robust Learning.
CoRR, 2020

Stable Learning via Causality-based Feature Rectification.
CoRR, 2020

A Survey on Deep Hashing Methods.
CoRR, 2020

Dynamic Spatio-Temporal Graph-Based CNNs for Traffic Flow Prediction.
IEEE Access, 2020

Incentive Mechanism Design for Federated Learning in the Internet of Vehicles.
Proceedings of the 92nd IEEE Vehicular Technology Conference, 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Interventional Few-Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Bridging the Web Data and Fine-Grained Visual Recognition via Alleviating Label Noise and Domain Mismatch.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PyRetri: A PyTorch-based Library for Unsupervised Image Retrieval by Deep Convolutional Neural Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Landmarks Detection with Anatomical Constraints for Total Hip Arthroplasty Preoperative Measurements.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Weakly Supervised Organ Localization with Attention Maps Regularized by Local Area Reconstruction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Training Liver Vessel Segmentation Deep Neural Networks on Noisy Labels from Contrast CT Imaging.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Optimizing Filter-bank Canonical Correlation Analysis for fast response SSVEP Brain-Computer Interface (BCI).
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

MaCAR: Urban Traffic Light Control via Active Multi-agent Communication and Action Rectification.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Adversarial Mutual Information for Text Generation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Communication-Efficient Federated Learning in UAV-enabled IoV: A Joint Auction-Coalition Approach.
Proceedings of the IEEE Global Communications Conference, 2020

Multi-Dimensional Contract-Matching for Federated Learning in UAV-Enabled Internet of Vehicles.
Proceedings of the IEEE Global Communications Conference, 2020

Feature Pyramid Transformer.
Proceedings of the Computer Vision - ECCV 2020, 2020

Momentum Batch Normalization for Deep Learning with Small Batch Size.
Proceedings of the Computer Vision - ECCV 2020, 2020

Gradient Centralization: A New Optimization Technique for Deep Neural Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Boosting Semantic Human Matting With Coarse Annotations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Structure Aware Single-Stage 3D Object Detection From Point Cloud.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

CPR-GCN: Conditional Partial-Residual Graph Convolutional Network in Automated Anatomical Labeling of Coronary Arteries.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Second-Order Camera-Aware Color Transformation for Cross-Domain Person Re-identification.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Part-Aware Attention Network for Person Re-identification.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

HoMM: Higher-Order Moment Matching for Unsupervised Domain Adaptation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-level Similarity Perception Network for Person Re-identification.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Panoramic Background Image Generation for PTZ Cameras.
IEEE Trans. Image Process., 2019

Foreground Gating and Background Refining Network for Surveillance Object Detection.
IEEE Trans. Image Process., 2019

Sharp Attention Network via Adaptive Sampling for Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2019

Towards Self-similarity Consistency and Feature Discrimination for Unsupervised Domain Adaptation.
CoRR, 2019

Discriminative Coronary Artery Tracking via 3D CNN in Cardiac CT Angiography.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Automated Segmentation Of Pulmonary Lobes Using Coordination-Guided Deep Neural Networks.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Volume R-CNN: Unified Framework for CT Object Detection and Instance Segmentation.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Progressive Transfer Learning for Person Re-identification.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Homocentric Hypersphere Feature Embedding for Person Re-Identification.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Dynamic Anchor Feature Selection for Single-Shot Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Attribute-Driven Feature Disentangling and Temporal Aggregation for Video Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Quantization Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Concept Detection based on Multi-label Classification and Image Captioning Approach - DAMO at ImageCLEF 2019.
Proceedings of the Working Notes of CLEF 2019, 2019

A Multi-Task Learning Framework for Extracting Bacteria Biotope Information.
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019

2018
Video Content Structure.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

DeepProduct: Mobile Product Search With Portable Deep Features.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Multi-Task Vehicle Detection With Region-of-Interest Voting.
IEEE Trans. Image Process., 2018

Deep Active Learning for Video-based Person Re-identification.
CoRR, 2018

Dynamic Spatio-temporal Graph-based CNNs for Traffic Prediction.
CoRR, 2018

Video2Shop: Exactly Matching Clothes in Videos to Online Shopping Images.
CoRR, 2018

The City Brain: Towards Real-Time Search for the Real-World.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Local Convolutional Neural Networks for Person Re-Identification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Multimodal-2 (Cross-Modal Translation).
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Multimodal-1 (Multimodal Reasoning).
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Challenges and Practices of Large Scale Visual Intelligence in the Real-World.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Previewer for Multi-Scale Object Detector.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Extracting Privileged Information from Untagged Corpora for Classifier Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

An Adversarial Approach to Hard Triplet Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Global Versus Localized Generative Adversarial Nets.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Foreground Gated Network for Surveillance Object Detection.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Exploiting Web Images for Dataset Construction: A Domain Robust Approach.
IEEE Trans. Multim., 2017

Two-Stage Friend Recommendation Based on Network Alignment and Series Expansion of Probabilistic Topic Model.
IEEE Trans. Multim., 2017

Video eCommerce++: Toward Large Scale Online Video Advertising.
IEEE Trans. Multim., 2017

A new web-supervised method for image dataset constructions.
Neurocomputing, 2017

Refining Image Categorization by Exploiting Web Images and General Corpus.
CoRR, 2017

Spatio-Temporal AutoEncoder for Video Anomaly Detection.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Stylized Adversarial AutoEncoder for Image Generation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Layout Style Modeling for Automating Banner Design.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Learning Feature Embedding with Strong Neural Activations for Fine-Grained Retrieval.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Spatiotemporal Multi-Task Network for Human Activity Understanding.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Social Friend Recommendation Based on Multiple Network Correlation.
IEEE Trans. Multim., 2016

Extracting Visual Knowledge from the Internet: Making Sense of Image Data.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

A Domain Robust Approach For Image Dataset Construction.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video eCommerce: Towards Online Video Advertising.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Deep CTR Prediction in Display Advertising.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Automatic image dataset construction with multiple textual metadata.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

2015
Introduction to the Special Section on Visual Computing in the Cloud: Fundamentals and Applications.
IEEE Trans. Circuits Syst. Video Technol., 2015

Learning Visual Semantic Relationships for Efficient Visual Retrieval.
IEEE Trans. Big Data, 2015

TapTell: Interactive visual search for mobile task recommendation.
J. Vis. Commun. Image Represent., 2015

Social Friend Recommendation Based on Network Correlation and Feature Co-Clustering.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Automatic Preview Frame Selection for Online Videos.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

SAPPHIRE: an always-on context-aware computer vision system for portable devices.
Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Prajna: Towards Recognizing Whatever You Want from Images without Image Labeling.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Exploiting On-Device Image Classification for Energy Efficiency in Ambient-Aware Systems.
Proceedings of the Mobile Cloud Visual Media Computing - From Interaction to Service, 2015

Cloud-Based Mobile Experience Sharing Through Automatic Multimedia Blogging.
Proceedings of the Mobile Cloud Visual Media Computing - From Interaction to Service, 2015

2014
Community Discovery from Social Media by Low-Rank Matrix Recovery.
ACM Trans. Intell. Syst. Technol., 2014

Regularized Tree Partitioning and Its Application to Unsupervised Image Segmentation.
IEEE Trans. Image Process., 2014

Social Image Tagging With Diverse Semantics.
IEEE Trans. Cybern., 2014

Trinary-Projection Trees for Approximate Nearest Neighbor Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Typicality ranking: beyond accuracy for video semantic annotation.
Multim. Tools Appl., 2014

Special section on learning from multiple evidences for large scale multimedia analysis.
Comput. Vis. Image Underst., 2014

Image tag refinement by regularized latent Dirichlet allocation.
Comput. Vis. Image Underst., 2014

Multifold Concept Relationships Metrics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Pushing Image Recognition in the Real World: Towards Recognizing Millions of Entities.
Proceedings of the First International Workshop on Internet-Scale Multimedia Management, 2014

Plant identification with noisy web data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

L2, 0 constrained sparse dictionary selection for video summarization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Mining knowledge from clicks: MSR-Bing image retrieval challenge.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Tell me what.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

2013
Near-lossless semantic video summarization and its applications to video analysis.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Image retrieval with query-adaptive hashing.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Searching for images by video.
Int. J. Multim. Inf. Retr., 2013

Hybrid Affinity Propagation.
CoRR, 2013

Towards next generation multimedia recommendation systems.
Proceedings of the ACM Multimedia Conference, 2013

Clickage: towards bridging semantic and intent gaps via mining click logs of search engines.
Proceedings of the ACM Multimedia Conference, 2013

2012
Guest editorial: content, concept and context mining in social media.
World Wide Web, 2012

Special Issue on Subspace and Manifold Learning for Image and Video Indexing and Search.
IEEE Trans. Syst. Man Cybern. Part B, 2012

A unified context model for web image retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2012

ImageSense: Towards contextual image advertising.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Correction to "Bayesian Visual Reranking".
IEEE Trans. Multim., 2012

Bridging the Semantic Gap via Functional Brain Imaging.
IEEE Trans. Multim., 2012

Ranking Model Adaptation for Domain-Specific Search.
IEEE Trans. Knowl. Data Eng., 2012

Introduction to the Special Section on Intelligent Multimedia Systems and Technology Part II.
ACM Trans. Intell. Syst. Technol., 2012

Intelligent photo clustering with user interaction and distance metric learning.
Pattern Recognit. Lett., 2012

Flickr Distance: A Relationship Measure for Visual Concepts.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Ensemble Manifold Regularization.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Social media mining and search.
Multim. Tools Appl., 2012

A comprehensive representation scheme for video semantic ontology and its applications in semantic concept detection.
Neurocomputing, 2012

Assistive tagging: A survey of multimedia tagging with human-computer joint exploration.
ACM Comput. Surv., 2012

Interactive mobile visual search for social activities completion using query image contextual model.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Color filter for image search.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Scalable similar image search by joint indices.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Tag filtering based on similar compatible principle.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Image search results refinement via outlier detection using deep contexts.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Elements of Visual Concept Analysis.
Proceedings of the Multimedia Analysis, Processing and Communications, 2011

Contextual Video Recommendation by Multimodal Relevance and User Feedback.
ACM Trans. Inf. Syst., 2011

Tag Tagging: Towards More Descriptive Keywords of Image Content.
IEEE Trans. Multim., 2011

Object Retrieval Using Visual Query Context.
IEEE Trans. Multim., 2011

Bayesian Visual Reranking.
IEEE Trans. Multim., 2011

Image Retagging Using Collaborative Tag Propagation.
IEEE Trans. Multim., 2011

Semi-Automatic Tagging of Photo Albums via Exemplar Selection and Tag Inference.
IEEE Trans. Multim., 2011

Interactive Image Search by Color Map.
ACM Trans. Intell. Syst. Technol., 2011

Active learning in multimedia annotation and retrieval: A survey.
ACM Trans. Intell. Syst. Technol., 2011

Introduction to the special issue on intelligent multimedia systems and technology.
ACM Trans. Intell. Syst. Technol., 2011

Assemble New Object Detector With Few Examples.
IEEE Trans. Image Process., 2011

Image Decomposition With Multilabel Context: Algorithms and Applications.
IEEE Trans. Image Process., 2011

Contextual Bag-of-Words for Visual Categorization.
IEEE Trans. Circuits Syst. Video Technol., 2011

A transductive multi-label learning approach for video concept detection.
Pattern Recognit., 2011

PLBP: An effective local binary patterns texture descriptor with pyramid representation.
Pattern Recognit., 2011

Content-based tag processing for Internet social images.
Multim. Tools Appl., 2011

Interactive multimedia computing.
Multim. Syst., 2011

Interactive browsing via diversified visual summarization for image search results.
Multim. Syst., 2011

Clip-based hierarchical representation for near-duplicate video detection.
Int. J. Comput. Math., 2011

Visual Content Identification and Search.
IEEE Multim., 2011

WonderWhat: real-time event determination from photos.
Proceedings of the 20th International Conference on World Wide Web, 2011

Graph-cut based tag enrichment.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

FAMER: Making Multi-Instance Learning Better and Faster.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Tap-to-search: Interactive and contextual visual search on mobile devices.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Community Discovery from Movie and Its Application to Poster Generation.
Proceedings of the Advances in Multimedia Modeling, 2011

Modeling social strength in social media community via kernel-based learning.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

TapTell: understanding visual intents on-the-go.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Video-based image retrieval.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Hybrid image summarization.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Web-scale image search by color sketch.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Multimedia tagging: past, present and future.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Internet multimedia advertising: techniques and technologies.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Contextual image search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

<i>StoryImaging</i>: a media-rich presentation system for textual stories.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

The role of attractiveness in web image search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Million-scale near-duplicate video retrieval system.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards Optimal Discriminating Order for Multiclass Classification.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Predicting occupation via human clothing and contexts.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Tag-Based Social Image Search: Toward Relevant and Diverse Results.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Joint Learning of Labels and Distance Metric.
IEEE Trans. Syst. Man Cybern. Part B, 2010

Visual query suggestion: Towards capturing user intent in internet image search.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Towards a Relevant and Diverse Search of Social Images.
IEEE Trans. Multim., 2010

In-Image Accessibility Indication.
IEEE Trans. Multim., 2010

Image Classification With Kernelized Spatial-Context.
IEEE Trans. Multim., 2010

Accessible image search for colorblindness.
ACM Trans. Intell. Syst. Technol., 2010

Active Reranking for Web Image Search.
IEEE Trans. Image Process., 2010

Typicality-Based Visual Search Reranking.
IEEE Trans. Circuits Syst. Video Technol., 2010

Contextual Internet Multimedia Advertising.
Proc. IEEE, 2010

GameSense: game-like in-image advertising.
Multim. Tools Appl., 2010

AdOn: toward contextual overlay in-video advertising.
Multim. Syst., 2010

Visual quality assessment for web videos.
J. Vis. Commun. Image Represent., 2010

Metric learning with feature decomposition for image categorization.
Neurocomputing, 2010

Interactive image search by 2D semantic map.
Proceedings of the 19th International Conference on World Wide Web, 2010

Retagging social images based on visual and semantic consistency.
Proceedings of the 19th International Conference on World Wide Web, 2010

Image search by concept map.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Effective music tagging through advanced statistical modeling.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Social Image Search with Diverse Relevance Ranking.
Proceedings of the Advances in Multimedia Modeling, 2010

Dynamic Video Collage.
Proceedings of the Advances in Multimedia Modeling, 2010

Visual Reranking with Local Learning Consistency.
Proceedings of the Advances in Multimedia Modeling, 2010

Tagging tags.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Real-time large scale near-duplicate web video retrieval.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Image retagging.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Large-scale robust visual codebook construction.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Melog.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

ACM workshop on mobile cloud media computing.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Bridging low-level features and high-level semantics via fMRI brain imaging for video classification.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Learning to combine multi-resolution spatially-weighted co-occurrence matrices for image representation.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Compact projection: Simple and efficient near neighbor search with practical memory requirements.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Optimizing kd-trees for scalable visual descriptor indexing.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Content-aware Ranking for visual search.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Contextual image retrieval model.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Scalable clip-based near-duplicate video detection with ordinal measure.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009
Video Content Structure.
Proceedings of the Encyclopedia of Database Systems, 2009

Video collage: presenting a video sequence using a single image.
Vis. Comput., 2009

Correlative Linear Neighborhood Propagation for Video Annotation.
IEEE Trans. Syst. Man Cybern. Part B, 2009

Scale-Invariant Visual Language Modeling for Object Categorization.
IEEE Trans. Multim., 2009

Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation.
IEEE Trans. Multim., 2009

Unified Video Annotation via Multigraph Learning.
IEEE Trans. Circuits Syst. Video Technol., 2009

VideoSense: A Contextual In-Video Advertising System.
IEEE Trans. Circuits Syst. Video Technol., 2009

Multigraph-Based Query-Independent Learning for Video Search.
IEEE Trans. Circuits Syst. Video Technol., 2009

Video semantic analysis based on structure-sensitive anisotropic manifold ranking.
Signal Process., 2009

Multi-video synopsis for video representation.
Signal Process., 2009

Combining global, regional and contextual features for automatic image annotation.
Pattern Recognit., 2009

Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Graph-based semi-supervised learning with multiple labels.
J. Vis. Commun. Image Represent., 2009

Semi-supervised kernel density estimation for video annotation.
Comput. Vis. Image Underst., 2009

Introduction to computer vision and image understanding the special issue on video analysis.
Comput. Vis. Image Underst., 2009

Learning to tag.
Proceedings of the 18th International Conference on World Wide Web, 2009

Tag ranking.
Proceedings of the 18th International Conference on World Wide Web, 2009

Gamesense.
Proceedings of the 18th International Conference on World Wide Web, 2009

Query sampling for ranking learning in web search.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Concept representation based video indexing.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Accommodating colorblind users in image search.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

CrowdReranking: exploring multiple search engines for visual search reranking.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

AdOn: an intelligent overlay video advertising system.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Graph-Based Pairwise Learning to Rank for Video Search.
Proceedings of the Advances in Multimedia Modeling, 2009

Multiple-Instance Active Learning for Image Categorization.
Proceedings of the Advances in Multimedia Modeling, 2009

Tag refinement by regularized LDA.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Accessible image search.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

NLVS: a near-lossless video summarization system.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Near-lossless video summarization.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Learning semantic distance from community-tagged media collection.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Smart batch tagging of photo albums.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Robust Distance Metric Learning with Auxiliary Knowledge.
Proceedings of the IJCAI 2009, 2009

Summarizing tagged image collections by cross-media representativeness voting.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Efficient image and video re-coloring for colorblindness.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Tag quality improvement for social images.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Boost search relevance for tag-based social image retrieval.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

MSRA-MM 2.0: A Large-Scale Web Multimedia Dataset.
Proceedings of the ICDM Workshops 2009, 2009

Image Search Result Summarization with Informative Priors.
Proceedings of the Computer Vision, 2009

Active Video Annotation.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Image/Video Semantic Analysis by Semi-Supervised Learning.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

2008
Content-Based Multimedia Retrieval.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

Correlative multilabel video annotation with temporal kernels.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Video Annotation Based on Kernel Linear Neighborhood Propagation.
IEEE Trans. Multim., 2008

Multi-Layer Multi-Instance Learning for Video Concept Detection.
IEEE Trans. Multim., 2008

Media Content Analysis.
Scholarpedia, 2008

Structure and event mining in sports video with efficient mosaic.
Multim. Tools Appl., 2008

Optimizing Training Set Construction for Video Semantic Classification.
EURASIP J. Adv. Signal Process., 2008

MSRA atT TRECVID 2008: High-Level Feature Extraction and Automatic Search.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

When multimedia advertising meets the new Internet era.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Free-Shaped Video Collage.
Proceedings of the Advances in Multimedia Modeling, 2008

MILC<sup>2</sup>: A Multi-Layer Multi-Instance Learning Approach to Video Concept Detection.
Proceedings of the Advances in Multimedia Modeling, 2008

Flickr distance.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Study on the combination of video concept detectors.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Bayesian video search reranking.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Contextual in-image advertising.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

ImageSense.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Annotating personal albums via web mining.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Finding image exemplars using fast sparse affinity propagation.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Online multi-label active annotation: towards large-scale content-based video search.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Transductive multi-label learning for video concept detection.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Optimizing video search reranking via minimum incremental information loss.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Collaborative learning for image and video annotation.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Graph-based semi-supervised learning with multi-label.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Optimized video scene segmentation.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Transductive video annotation via local learnable kernel classifier.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Automatic video annotation through search and mining.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Query-independent learning for video search.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Learning to video search rerank via pseudo preference feedback.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Unbiased active learning for image retrieval.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Smart video player.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Augmented tree partitioning for interactive image segmentation.
Proceedings of the International Conference on Image Processing, 2008

Video<sup>M</sup>: Multi-video Synopsis.
Proceedings of the Workshops Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Maximum Margin Clustering with Pairwise Constraints.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Joint multi-label multi-instance learning for image classification.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Normalized tree partitioning for image segmentation.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

A joint appearance-spatial distance for kernel-based image categorization.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Two-Dimensional Active Learning for image classification.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Coherent image annotation by learning semantic distance.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Modeling and Mining of Users' Capture Intention for Home Videos.
IEEE Trans. Multim., 2007

Home Video Visual Quality Assessment With Spatiotemporal Factors.
IEEE Trans. Circuits Syst. Video Technol., 2007

Interactive Video Annotation by Multi-Concept Multi-Modality Active Learning.
Int. J. Semantic Comput., 2007

MSRA-USTC-SJTU at TRECVID 2007: High-Level Feature Extraction and Search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

VideoReach: an online video recommendation system.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

RMulti-Concept Multi-Modality Active Learning for Interactive Video Annotation.
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007

Kernel-Based Linear Neighborhood Propagation for Semantic Video Annotation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Object-Sensitive Query Analysis for Video Search.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

An Efficient Automatic Video Shot Size Annotation Scheme.
Proceedings of the Advances in Multimedia Modeling, 2007

Video Histogram: A Novel Video Signature for Efficient Web Video Duplicate Detection.
Proceedings of the Advances in Multimedia Modeling, 2007

Refining video annotation by exploiting pairwise concurrent relation.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Optimizing multi-graph learning: towards a unified video annotation scheme.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Structure-sensitive manifold ranking for video concept detection.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Typicality ranking via semi-supervised multiple-instance learning.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Correlative multi-label video annotation.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

VideoSense: a contextual video advertising system.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

VideoSense: towards effective online video advertising.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Video collage.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Video search re-ranking via multi-graph propagation.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Multi-layer multi-instance kernel for video concept detection.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Building a comprehensive ontology to refine video concept detection.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Multi-modality web video categorization.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

A Novel Multiple Instance Learning Approach for Image Retrieval Based on Adaboost Feature Selection.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Video Collage: A Novel Presentation of Video Sequence.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Multi-Graph Semi-Supervised Learning for Video Semantic Feature Extraction.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Lazy Learning Based Efficient Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Anisotropic Manifold Ranking for Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Beyond Accuracy: Typicality Ranking for Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Transductive Inference with Hierarchical Clustering for Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

EMS: Energy Minimization Based Video Scene Segmentation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Temporally Consistent Gaussian Random Field for Video Semantic Analysis.
Proceedings of the International Conference on Image Processing, 2007

An Interactive Video Annotation Frameowrk with Multiple Modalities.
Proceedings of the IEEE International Conference on Acoustics, 2007

On Real-Time Detecting Duplicate Web Videos.
Proceedings of the IEEE International Conference on Acoustics, 2007

Concurrent Multiple Instance Learning for Image Categorization.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Online video recommendation based on multimodal fusion and relevance feedback.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Photo2Video - A System for Automatically Converting Photographic Series Into Video.
IEEE Trans. Circuits Syst. Video Technol., 2006

Microsoft Research Asia TRECVID 2006 High-Level Feature Extraction and Rushes Exploitation.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

A semi-supervised incremental learning framework for sports video view classification.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Manifold-ranking based video concept detection on large database and feature pool.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Automatic video annotation by semi-supervised learning with kernel density estimation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

To construct optimal training set for video annotation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Towards content-based relevance ranking for video search.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Interactive video authoring and sharing based on two-layer templates.
Proceedings of the 1st ACM international workshop on Human-centered multimedia, 2006

Efficient semantic annotation method for indexing large personal video database.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

Automatic video annotation based on co-adaptation and label correction.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Enhanced Semi-Supervised Learning for Automatic Video Annotation.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Video Annotation by Active Learning and Semi-Supervised Ensembling.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Probabilistic Multimodality Fusion for Event based Home Photo Clustering.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Automatic Video Genre Categorization using Hierarchical SVM.
Proceedings of the International Conference on Image Processing, 2006

Semi-Supervised Kernel Regression.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

An Automatic Video Semantic Annotation Scheme Based on Combination of Complementary Predictors.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Video Annotation by Active Learning and Cluster Tuning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

2005
A generic framework of user attention model and its application in video summarization.
IEEE Trans. Multim., 2005

Natural video browsing.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Spatio-temporal quality assessment for home videos.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Tracking users' capture intention: a novel complementary view for home video content analysis.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Intention-based home video browsing.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

To learn representativeness of video frames.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

LazyCut: content-aware template-based video authoring.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Personal media sharing and authoring on the web.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Video booklet: a natural video searching and browsing interface.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Tracking concept drifting with an online-optimized incremental learning framework.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Semi-automatic video annotation based on active learning with multiple complementary predictors.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Online End Detection for Live-Broadcast Sports TV Programs.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Camera notes.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Video booklet.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Robust learning-based TV commercial detection.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Efficient video mosaicing based on motion analysis.
Proceedings of the 2005 International Conference on Image Processing, 2005

2004
An automatic performance evaluation protocol for video text detection algorithms.
IEEE Trans. Circuits Syst. Video Technol., 2004

Optimization-based automated home video editing system.
IEEE Trans. Circuits Syst. Video Technol., 2004

An Online Learning Framework for Sports Video View Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

An Attention-Based Decision Fusion Scheme for Multimedia Information Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Online Play Segmentation for Broadcasted American Football TV Programs.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

An online-optimized incremental learning framework for video semantic classification.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Automatically converting otograic series into video.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Automatic music video generation based on temporal pattern analysis.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

P-Karaoke: personalized karaoke system.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Content and transformation effect matching for automated home video editing.
Proceedings of the 2004 International Conference on Image Processing, 2004

Robust video signature based on ordinal measure.
Proceedings of the 2004 International Conference on Image Processing, 2004

2003
Photo2Video.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

AVE: automated home video editing.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Content based photograph slide show with incidental music.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

2002
Efficient video text recognition using multiple frame integration.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Automatic location of text in video frames.
Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrieval, Ottawa, ON, Canada, September 30, 2001

A Video Text Detection And Recognition System.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Automatic Performance Evaluation for Video Text Detection.
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR 2001), 2001


  Loading...