Ling-Yu Duan

CoRR, May, 2025

Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

Bridging the Source-to-Target Gap for Cross-Domain Person Re-identification with Intermediate Domains.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2025

Beyond Entropy: Region Confidence Proxy for Wild Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Adaptive Gradient Quantization with Bit Allocation for Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Which Tasks Should Be Compressed Together? A Causal Discovery Approach for Efficient Multi-Task Representation Compression.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

HARDer-Net: Hardness-Guided Discrimination Network for 3D Early Activity Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., December, 2024

Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2024

Amodal Segmentation for Laparoscopic Surgery Video Instruments.

[BibT_eX]

[DOI]

CoRR, 2024

Coding for Intelligence from the Perspective of Category.

[BibT_eX]

[DOI]

CoRR, 2024

Transferable Adversarial Attacks on SAM and Its Downstream Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LEAD: Exploring Logit Space Evolution for Model Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Seeing Dark Videos via Self-Learned Bottleneck Neural Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Coarse-to-fine Disentangling Demoiréing Framework for Recaptured Screen Images.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

PS-Net: human perception-guided segmentation network for EM cell membrane.

[BibT_eX]

[DOI]

Bioinform., August, 2023

Background Scene Recovery From an Image Looking Through Colored Glass.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Purifying Low-Light Images via Near-Infrared Enlightened Image.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Benchmarking Single-Image Reflection Removal Algorithms.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Modeling Uncertain Feature Representation for Domain Generalization.

[BibT_eX]

[DOI]

CoRR, 2023

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring Model Transferability through the Lens of Potential Energy.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Switchable Representation Learning Framework with Self-Compatibility.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

$A^3$-FKG: Attentive Attribute-Aware Fashion Knowledge Graph for Outfit Preference Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2022

Intrinsic Performance Influence-based Participant Contribution Estimation for Horizontal Federated Learning.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2022

Towards Low Light Enhancement With RAW Images.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Disentangled Feature Learning Network and a Comprehensive Benchmark for Vehicle Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Switchable Representation Learning Framework with Self-compatibility.

[BibT_eX]

[DOI]

CoRR, 2022

Bridging the Source-to-target Gap for Cross-domain Person Re-Identification with Intermediate Domains.

[BibT_eX]

[DOI]

CoRR, 2022

Nonlinear Multi-Model Reuse.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Collaborative Scalable Visual Compression for Human-Centered Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Uncertainty Modeling for Out-of-Distribution Generalization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

mc-BEiT: Multi-choice Discretization for Image BERT Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neighborhood Consensus Contrastive Learning for Backward-Compatible Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Attribute-wise Explainable Fashion Compatibility Modeling.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2021

Market2Dish: Health-aware Food Recommendation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2021

Pose-Normalized and Appearance-Preserved Street-to-Shop Clothing Image Generation and Feature Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Towards Coding for Human and Machine Vision: Scalable Face Image Coding.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Dual-Refinement: Joint Label and Feature Refinement for Unsupervised Domain Adaptive Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

Face Image Reflection Removal.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics.

[BibT_eX]

[DOI]

CoRR, 2021

Person Retrieval with Conv-Transformer.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Single Image Reflection Removal With Absorption Effect.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generalizable Person Re-Identification With Relevance-Aware Mixture of Experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Person30K: A Dual-Meta Generalization Network for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

Iterative Local-Global Collaboration Learning Towards One-Shot Video Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Toward Intelligent Sensing: Intermediate Deep Feature Compression.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

CoRRN: Cooperative Reflection Removal Network.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Skeleton-Based Online Action Prediction Using Scale Selection Network.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Feature Boosting Network For 3D Pose Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Deep Variational and Structural Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

JDNet: A Joint-Learning Distilled Network for Mobile Visual Food Recognition.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2020

Key-Point Sequence Lossless Compression for Intelligent Video Analysis.

[BibT_eX]

[DOI]

Tushar Shankar Shinde

Hongkai Xiong

IEEE Multim., 2020

Network Update Compression for Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Pose-native Network Architecture Search for Multi-person Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Disentangled Feature Learning Network for Vehicle Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

An Emerging Coding Paradigm Vcm: A Scalable Coding Approach Beyond Feature And Signal.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Extending Hashing Towards Fast Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Data Representation in Hybrid Coding Framework for Feature Maps Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Deep Product Quantization Module for Efficient Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

FHDe<sup>2</sup>Net: Full High Definition Demoireing Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

What Does Plate Glass Reveal About Camera Calibration?

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Reflection Scene Separation From a Single Image.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Codebook-Free Compact Descriptor for Scalable Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Unified Spatio-Temporal Attention Networks for Action Recognition in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Embedding Adversarial Learning for Vehicle Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Robust Distracter-Resistive Tracker via Learning a Multi-Component Discriminative Dictionary.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Multi-scale Optimal Fusion model for single image dehazing.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2019

Learning to remove reflections from windshield images.

[BibT_eX]

[DOI]

Ce Wang

Boxin Shi

Signal Process. Image Commun., 2019

基于深度残差网络的HEVC压缩视频增强 (Deep Residual Network Based HEVC Compressed Videos Enhancement).

[BibT_eX]

[DOI]

Xiaoyi He

Sreyasee Das Bhattacharjee

Weiyao Lin

计算机科学, 2019

Front-End Smart Visual Sensing and Back-End Intelligent Analysis: A Unified Infrastructure for Economizing the Visual System of City Brain.

[BibT_eX]

[DOI]

IEEE J. Sel. Areas Commun., 2019

Toward Knowledge as a Service Over Networks: A Deep Learning Model Communication Paradigm.

[BibT_eX]

[DOI]

IEEE J. Sel. Areas Commun., 2019

IDeRs: Iterative dehazing method for single remote sensing image.

[BibT_eX]

[DOI]

Inf. Sci., 2019

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond.

[BibT_eX]

[DOI]

IEEE Multim., 2019

Compact Descriptors for Video Analysis: The Emerging MPEG Standard.

[BibT_eX]

[DOI]

IEEE Multim., 2019

DeepShoe: An improved Multi-Task View-invariant CNN for street-to-shop shoe retrieval.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2019

Hard-Aware Fashion Attribute Classification.

[BibT_eX]

[DOI]

CoRR, 2019

Signal-Independent Separable KLT by Offline Training for Video Coding.

[BibT_eX]

[DOI]

IEEE Access, 2019

Toward Intelligent Visual Sensing and Low-cost Analysis: A Collaborative Computing Approach.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

See Through the Windshield from Surveillance Camera.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Adaptive Feature Fusion via Graph Neural Network for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Market2Dish: A Health-aware Food Recommendation System.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Lossy Intermediate Deep Learning Feature Compression and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Few-Shot and Many-Shot Fusion Learning in Mobile Visual Food Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

From Market to Dish: Multi-ingredient Image Recognition for Personalized Recipe Recommendation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Learning to Remove Reflections for Text Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Incorporating Category Taxonomy in Deep Reinforcement Learning Based Image Hashing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Denoising Adversarial Networks for Rain Removal and Reflection Removal.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Fashion Recommendation on Street Images.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

SPLINE-Net: Sparse Photometric Stereo Through Lighting Interpolation and Normal Estimation Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning to Jointly Generate and Separate Reflections.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sampling Wisely: Deep Image Embedding by Top-K Precision Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Mop Moiré Patterns Using MopNet.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Separable KLT for Intra Coding in Versatile Video Coding (VVC).

[BibT_eX]

[DOI]

Proceedings of the Data Compression Conference, 2019

VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Accurate One-Stage Object Detection With AP-Loss.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploring Object Relation in Mean Teacher for Cross-Domain Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Toward Intelligent Product Retrieval for TV-to-Online (T2O) Application: A Transfer Metric Learning Approach.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Query Adaptive Multiview Object Instance Search and Localization Using Sketches.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Group-Sensitive Triplet Embedding for Vehicle Reidentification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Region-Aware Reflection Removal With Unified Content and Gradient Priors.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Minimizing Reconstruction Bias Hashing via Joint Projection Learning and Quantization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Fast MPEG-CDVS Encoder With GPU-CPU Hybrid Computing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Rate-Distortion Optimized Sparse Coding With Ordered Dictionary for Image Set Compression.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Transfer Metric Learning: Algorithms, Applications and Outlooks.

[BibT_eX]

[DOI]

CoRR, 2018

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing.

[BibT_eX]

[DOI]

CoRR, 2018

Tracklet Siamese Network with Constrained Clustering for Multiple Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE Visual Communications and Image Processing, 2018

Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Depth Structure Preserving Scene Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-Scale Context Attention Network for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Unified Generative Adversarial Framework for Image Generation and Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

From Data to Knowledge: Deep Learning Model Compression, Transmission and Communication.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Gated Square-Root Pooling for Image Instance Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

CRRN: Multi-Scale Guided Concurrent Reflection Removal Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

SSNet: Scale Selection Network for Online 3D Action Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Pruning Convolutional Neural Networks for Image Instance Retrieval.

[BibT_eX]

[DOI]

CoRR, 2017

Skeleton Based Human Action Recognition with Global Context-Aware Attention LSTM Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Fast MPEG-CDVS Encoder with GPU-CPU Hybrid Computing.

[BibT_eX]

[DOI]

CoRR, 2017

Incorporating Intra-Class Variance to Fine-Grained Visual Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

From Part to Whole: Who is Behind the Painting?

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Nested Invariance Pooling and RBM Hashing for Image Instance Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

DeepHash for Image Instance Retrieval: Getting Regularization, Depth and Fine-Tuning Right.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Improving object detection with region similarity learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Incorporating intra-class variance to fine-grained visual recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

GPU Based fast MPEG-CDVS encoder.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A Multi-Block N-ary trie structure for exact r-neighbour search in hamming space.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deep regional feature pooling for video matching.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Benchmarking Single-Image Reflection Removal Algorithms.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Compact Deep Invariant Descriptors for Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2017 Data Compression Conference, 2017

Compression of Deep Neural Networks for Image Instance Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2017 Data Compression Conference, 2017

Global Context-Aware Attention LSTM Networks for 3D Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware Descriptors.

[BibT_eX]

[DOI]

Sreyasee Das Bhattacharjee

Junsong Yuan

Yap-Peng Tan

Sreyasee Das Bhattacharjee

IEEE Trans. Multim., 2016

Overview of the MPEG-CDVS Standard.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

A Compact Binary Aggregated Descriptor via Dual Selection for Visual Search.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

To Project More or to Quantize More: Minimize Reconstruction Bias for Learning Compact Binary Codes.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Smart query expansion scheme for CDVS based on illumination and key features.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Two-stage pooling of deep convolutional features for image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Depth-based local feature selection for mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Selectively Aggregated Fisher Vectors of Query Video for Mobile Visual Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Adaptive Weighted Matching of Deep Convolutional Features for Painting Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Affinity Preserving Quantization for Hashing: A Vector Quantization Approach to Compact Learn Binary Codes.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Weighted Component Hashing of Binary Aggregated Descriptors for Fast Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Depth-Preserving Warping for Stereo Image Retargeting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

A Low Complexity Interest Point Detector.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Finding the Secret of Image Saliency in the Frequency Domain.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Efficient image retrieval based mobile indoor localization.

[BibT_eX]

[DOI]

Proceedings of the 2015 Visual Communications and Image Processing, 2015

Query-Adaptive Logo Search using Shape-Aware Descriptors.

[BibT_eX]

[DOI]

Junsong Yuan

Yap-Peng Tan

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Hamming Compatible Quantization for Hashing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Hierarchical multi-VLAD for image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

An efficient coding framework for compact descriptors extracted from video sequence.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Optimizing Binary Fisher Codes for Visual Search.

[BibT_eX]

[DOI]

Proceedings of the 2015 Data Compression Conference, 2015

Overview of the MPEG CDVS Standard.

[BibT_eX]

[DOI]

Tiejun Huang

Wen Gao

Proceedings of the 2015 Data Compression Conference, 2015

Real-Time Tracking with Selective DoP-RIEF Features for Augmented Reality.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

2014

Towards Mobile Document Image Retrieval for Digital Library.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Spatiotemporal Grid Flow for Video Retargeting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Mining Compact Bag-of-Patterns for Low Bit Rate Mobile Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Interactive ads recommendation with contextual search on product topic space.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2014

Compact Descriptors for Visual Search.

[BibT_eX]

[DOI]

IEEE Multim., 2014

Component hashing of variable-length binary aggregated descriptors for fast image search.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Joint optimization of JPEG quantization table and coefficient thresholding for low bitrate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Region-based depth-preserving stereoscopic image retargeting.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013

Learning to Distribute Vocabulary Indexing for Scalable Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

Estimating Visual Saliency Through Single Image Optimization.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2013

Learning from mobile contexts to minimize the mobile location search latency.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2013

Learning Compact Visual Descriptors for Low Bit Rate Mobile Landmark Search.

[BibT_eX]

[DOI]

AI Mag., 2013

A local shape descriptor for mobile linedrawing retrieval.

[BibT_eX]

[DOI]

Yucong Xuan

Tiejun Huang

Proceedings of the 2013 Visual Communications and Image Processing, 2013

An Error Resilient Depth Map Coding Scheme Using Adaptive Wyner-Ziv Frame.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Mobile media communication, processing, and analysis: A review of recent advances.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Compact descriptors for mobile visual search and MPEG CDVS standardization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

A novel pair-wise image matching strategy with compact descriptors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Robust fisher codes for large scale image retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

On the interoperability of local descriptors compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A hybrid pixel-block based view synthesis for multiviewpoint 3D video.

[BibT_eX]

[DOI]

Proceedings of the 3DTV-Conference 2013: The True Vision, 2013

2012

A Generic Approach for Systematic Analysis of Sports Videos.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2012

Group-Sensitive Multiple Kernel Learning for Object Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2012

Optimizing JPEG quantization table for low bit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 Visual Communications and Image Processing, 2012

Motion Based Perceptual Distortion and Rate Optimization for Video Coding.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Social Image Tagging by Mining Sparse Tag Patterns from Auxiliary Data.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Learning sparse tag patterns for social image classification.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Weakly supervised topic grouping of YouTube search results.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Multi-stage vector quantization towards low bit rate visual search.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Allocating images and selecting image collections for distributed visual search.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

PQ-WGLOH: A bit-rate scalable local feature descriptor.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Learning multiple codebooks for low bit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Predicting the effectiveness of queries for visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Pruning tree-structured vector quantizer towards low bit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Towards compact topical descriptors.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Grid-Based Retargeting with Transformation Consistency Smoothing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2011

Towards low bit rate mobile visual search with multiple-channel coding.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

Fast retargeting with adaptive grid optimization.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Learning the trip suggestion from landmark photos on the web.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

PKUBench: A context rich mobile visual search benchmark.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Generating vocabulary for global feature representation towards commerce image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

When codeword frequency meets geographical location.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

A lowbit rate vocabulary coding scheme for mobile landmark search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Sorting local descriptors for lowbit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Topic level sampling towards optimized locality sensitive vocabulary coding.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Information, 2011

2010

Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2010

Per-Sample Multiple Kernel Approach for Visual Concept Learning.

[BibT_eX]

[DOI]

EURASIP J. Image Video Process., 2010

AdVR: Linking Ad Video with Products or Service.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

Saliency detection based on 2D log-gabor wavelets and center bias.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Video retargeting with multi-scale trajectory optimization.

[BibT_eX]

[DOI]

Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Interactive Web Video Advertising with Context Analysis and Search.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

ESUR: A system for Events detection in SURveillance video.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

Interactive service recommendation based on ad concept hierarchy.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Automatic video genre categorization and event detection techniques on large-scale sports data.

[BibT_eX]

[DOI]

Proceedings of the 2010 conference of the Centre for Advanced Studies on Collaborative Research, 2010

2009

A New Multiple Kernel Approach for Visual Concept Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2009

Sports video retargeting.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Consumer video retargeting: context assisted spatial-temporal grid optimization.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Automatic sports genre categorization and view-type classification over large-scale dataset.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Multiple kernel active learning for image classification.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Linking video ADS with product or service information by web search.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A generic approach to classify sports video shots and its application in event detection.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Semantic Linking between Video Ads and Web Services with Progressive Search.

[BibT_eX]

[DOI]

Proceedings of the ICDM Workshops 2009, 2009

Group-sensitive multiple kernel learning for object categorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2008

Audio keywords generation for sports video analysis.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2008

A Multimodal Scheme for Program Segmentation and Representation in Broadcast Video Streams.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2008

Digesting Commercial Clips from TV Streams.

[BibT_eX]

[DOI]

IEEE Multim., 2008

Hierarchical movie affective content analysis based on arousal and valence features.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Personalization of media and its attention service applications.

[BibT_eX]

[DOI]

Changsheng Xu

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007

An algorithm to estimate mean vehicle speed from MPEG Skycam video.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2007

Automatic TV Logo Detection, Tracking and Removal in Broadcast Video.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2007

TV ad video categorization with probabilistic latent concept learning.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Robust Commercial Retrieval in Video Streams.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006

Nonparametric motion characterization for robust classification of camera motion patterns.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2006

A Semantic Image Category for Structuring TV Broadcast Video Streams.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2006

Live sports event detection based on broadcast video and web-casting text.

[BibT_eX]

[DOI]

Proceedings of the 14th ACM International Conference on Multimedia, 2006

Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis.

[BibT_eX]

[DOI]

Proceedings of the 14th ACM International Conference on Multimedia, 2006

Local Motion Analysis and Its Application in Video based Swimming Style Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

TV Commercial Classification by using Multi-Modal Textual Information.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Robust Method for TV Logo Tracking in Video Streams.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Mid-Level Scene Change Representation Via Audiovisual Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

A unified framework for semantic shot classification in sports video.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2005

Shot-Level Camera Motion Estimation Based on a Parametric Model.

[BibT_eX]

[DOI]

Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Automatic generation of personalized music sports video.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

A unified framework for semantic shot representation of sports video.

[BibT_eX]

[DOI]

Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Periodicity Detection of Local Motion.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

A Mid-level Visual Concept Generation Framework for Sports Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Replay Scene Classification in Soccer Video Using Web Broadcast Text.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004

Fast and Robust Short Video Clip Search for Copy Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

HMM-Based Audio Keyword Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Audio keyword generation for sports video analysis.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Multimedia, 2004

Fast and robust video clip search using index structure.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Multimedia, 2004

Nonparametric motion model.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Multimedia, 2004

Nonparametric motion model with applications to camera motion pattern classification.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Multimedia, 2004

Fast and robust short video clip search using an index structure.

[BibT_eX]

[DOI]

Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Mean shift based nonparametric motion characterization.

[BibT_eX]

Proceedings of the 2004 International Conference on Image Processing, 2004

Mean shift based video segment representation and applications to replay detection.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Semantic Shot Classification in Sports Video.

[BibT_eX]

[DOI]

Min Xu

Proceedings of the Storage and Retrieval for Media Databases 2003, 2003

Nonparametric color characterization using mean shift.

[BibT_eX]

[DOI]

Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

A mid-level representation framework for semantic sports video analysis.

[BibT_eX]

[DOI]

Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Robust moving video object segmentation in the MPEG compressed domain.

[BibT_eX]

[DOI]

Xiao-Dong Yu

Proceedings of the 2003 International Conference on Image Processing, 2003

A fusion scheme of visual and auditory modalities for event detection in sports video.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Shot Classification of Sports Video Based on Features in Motion Vector Field.

[BibT_eX]

[DOI]

Xiao-Dong Yu

Proceedings of the Advances in Multimedia Information Processing, 2002

Foreground Segmentation Using Motion Vectors in Sports Video.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2002

A unified framework for semantic shot classification in sports videos.

[BibT_eX]

[DOI]

Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Clear face analysis from MPEG compressed video.

[BibT_eX]

[DOI]