Weiming Hu

Orcid: 0000-0001-9237-8825

Affiliations:
  • ShanghaiTech University, Shanghai, China


According to our database1, Weiming Hu authored at least 377 papers between 2001 and 2026.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MSFI: Multi-timescale spatio-temporal features integration in spiking neural networks.
Neural Networks, 2026

2025
FiGVCL: Fine-Grained Benchmark and Method for Video Copy Localization.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

Mitigating Hallucinations in Large Vision-Language Models by Self-Injecting Hallucinations.
CoRR, September, 2025

Task-Aware Attentional Dynamic Alignment for Few-Shot Compressed Video Classification.
IEEE Trans. Circuits Syst. Video Technol., August, 2025

LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training.
CoRR, August, 2025

An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training.
Int. J. Comput. Vis., July, 2025

CST Anti-UAV: A Thermal Infrared Benchmark for Tiny UAV Tracking in Complex Scenes.
CoRR, July, 2025

iESTA: Instance-Enhanced Spatial-Temporal Alignment for Video Copy Localization.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval.
CoRR, May, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
CoRR, May, 2025

PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments.
CoRR, February, 2025

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC.
CoRR, February, 2025

Content-Decoupled Contrastive Learning-Based Implicit Degradation Modeling for Blind Image Super-Resolution.
IEEE Trans. Image Process., 2025

Self-Supervised Monocular Depth Estimation With Dual-Path Encoders and Offset Field Interpolation.
IEEE Trans. Image Process., 2025

HiFusion: An Unsupervised Infrared and Visible Image Fusion Framework With a Hierarchical Loss Function.
IEEE Trans. Instrum. Meas., 2025

DABF-Net: A Dual-Branch Attention-Guided and Bi-Directional Feature Enhancement Network for Infrared Small-Target Detection With Air-to-Ground Benchmark.
IEEE Trans. Geosci. Remote. Sens., 2025

FRANet: A Feature Refinement Attention Network for SAR Image Denoising.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

Two-stream transformer tracking with messengers.
Image Vis. Comput., 2025

Reversing Flow for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Visual-Instructed Degradation Diffusion for All-in-One Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
One-Stage Anchor-Free Online Multiple Target Tracking With Deformable Local Attention and Task-Aware Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

ACR-Net: Learning High-Accuracy Optical Flow via Adaptive-Aware Correlation Recurrent Network.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Cross-Architecture Knowledge Distillation.
Int. J. Comput. Vis., August, 2024

Joint Learning of Audio-Visual Saliency Prediction and Sound Source Localization on Multi-face Videos.
Int. J. Comput. Vis., June, 2024

DCFNet: Discriminant Correlation Filters Network for Visual Tracking.
J. Comput. Sci. Technol., May, 2024

DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

Chinese Title Generation for Short Videos: Dataset, Metric and Algorithm.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

mR<sup>2</sup>AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA.
CoRR, 2024

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning.
CoRR, 2024

HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision.
CoRR, 2024

Token Caching for Diffusion Transformer Acceleration.
CoRR, 2024

Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking.
CoRR, 2024

Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training.
CoRR, 2024

BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues.
CoRR, 2024

NFT1000: A Visual Text Dataset For Non-Fungible Token Retrieval.
CoRR, 2024

GMC-IQA: Exploiting Global-correlation and Mean-opinion Consistency for No-reference Image Quality Assessment.
CoRR, 2024

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

NFT1000: A Cross-Modal Dataset For Non-Fungible Token Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

BEV<sup>2</sup>PR: BEV-Enhanced Visual Place Recognition with Structural Cues.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Learn from Noise: Detecting Deepfakes via Regional Noise Consistency.
Proceedings of the International Joint Conference on Neural Networks, 2024

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Real-Time Monocular Depth Estimation on Embedded Systems.
Proceedings of the IEEE International Conference on Image Processing, 2024

MIBench: Evaluating Multimodal Large Language Models over Multiple Images.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians.
Proceedings of the Computer Vision - ECCV 2024, 2024

EA-VTR: Event-Aware Video-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

MobileIQA: Exploiting Mobile-Level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts.
Proceedings of the Computer Vision - ECCV 2024, 2024

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Self-Prior Guided Pixel Adversarial Networks for Blind Image Inpainting.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Ranking-Based Color Constancy With Limited Training Samples.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Jointing Recurrent Across-Channel and Spatial Attention for Multi-Object Tracking With Block-Erasing Data Augmentation.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Dynamic adjustment of hyperparameters for anchor-based detection of objects with large image size differences.
Pattern Recognit. Lett., March, 2023

Learning to Explore Distillability and Sparsability: A Joint Framework for Model Compression.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

SiamMask: A Framework for Fast Online Object Tracking and Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Multi-scale self-attention-based feature enhancement for detection of targets with small image sizes.
Pattern Recognit. Lett., February, 2023

Learning Video-Text Aligned Representations for Video Captioning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

A Robust Infrared Small Target Detection Method Jointing Multiple Information and Noise Prediction: Algorithm and Benchmark.
IEEE Trans. Geosci. Remote. Sens., 2023

Circle-Net: An Unsupervised Lightweight-Attention Cyclic Network for Hyperspectral and Multispectral Image Fusion.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

Hierarchical Curriculum Learning for No-Reference Image Quality Assessment.
Int. J. Comput. Vis., 2023

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models.
CoRR, 2023

RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems.
CoRR, 2023

Exploiting Contextual Objects and Relations for 3D Visual Grounding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

A Closer Look at Self-Supervised Lightweight Vision Transformers.
Proceedings of the International Conference on Machine Learning, 2023

Order-Prompted Tag Sequence Generation for Video Tagging.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning from the Raw Domain: Cross Modality Distillation for Compressed Video Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Video Tiny-Object Detection Guided by the Spatial-Temporal Motion Information.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Exploit the Sequence-Specific Prior Knowledge for Image Processing Pipelines Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ViLEM: Visual-Language Error Modeling for Image-Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AUNet: Learning Relations Between Action Units for Face Forgery Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Self-Attention-Based Multiscale Feature Learning Optical Flow With Occlusion Feature Map Prediction.
IEEE Trans. Multim., 2022

3DCANN: A Spatio-Temporal Convolution Attention Neural Network for EEG Emotion Recognition.
IEEE J. Biomed. Health Informatics, 2022

PDNet: Toward Better One-Stage Object Detection With Prediction Decoupling.
IEEE Trans. Image Process., 2022

Narrowing the Gap: Improved Detector Training With Noisy Location Annotations.
IEEE Trans. Image Process., 2022

Rethinking the Competition Between Detection and ReID in Multiobject Tracking.
IEEE Trans. Image Process., 2022

SSAU-Net: A Spectral-Spatial Attention-Based U-Net for Hyperspectral Image Fusion.
IEEE Trans. Geosci. Remote. Sens., 2022

MRDDANet: A Multiscale Residual Dense Dual Attention Network for SAR Image Denoising.
IEEE Trans. Geosci. Remote. Sens., 2022

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

SDTP: Semantic-Aware Decoupled Transformer Pyramid for Dense Image Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2022

Parallel multiscale context-based edge-preserving optical flow estimation with occlusion detection.
Signal Process. Image Commun., 2022

Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PIC 4th Challenge: Semantic-Assisted Multi-Feature Encoding and Multi-Head Decoding for Dense Video Captioning.
CoRR, 2022

PolarFormer: Multi-camera 3D Object Detection with Polar Transformers.
CoRR, 2022

CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation.
CoRR, 2022

Long-Short Term Cross-Transformer in Compressed Domain for Few-Shot Video Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Learning Target-aware Representation for Visual Tracking via Informative Interactions.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Inter-Intra Cross-Modality Self-Supervised Video Representation Learning by Contrastive Clustering.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Learnable Pixel Clustering Via Structure and Semantic Dual Constraints for Unsupervised Image Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines.
Proceedings of the Computer Vision - ECCV 2022, 2022

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Exploring Motion Information for Distractor Suppression in Visual Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Cross-Architecture Knowledge Distillation.
Proceedings of the Computer Vision - ACCV 2022, 2022

Teacher-Guided Learning for Blind Image Quality Assessment.
Proceedings of the Computer Vision - ACCV 2022, 2022

One More Check: Making "Fake Background" Be Tracked Again.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression.
IEEE Trans. Neural Networks Learn. Syst., 2021

Toward Accurate Pixelwise Object Tracking via Attention Retrieval.
IEEE Trans. Image Process., 2021

Multi-Scale Low-Discriminative Feature Reactivation for Weakly Supervised Object Localization.
IEEE Trans. Image Process., 2021

Robust Texture-Aware Computer-Generated Image Forensic: Benchmark and Algorithm.
IEEE Trans. Image Process., 2021

Web Objectionable Video Recognition Based on Deep Multi-Instance Learning With Representative Prototypes Selection.
IEEE Trans. Circuits Syst. Video Technol., 2021

UMAG-Net: A New Unsupervised Multiattention-Guided Network for Hyperspectral and Multispectral Image Fusion.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2021

Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos.
CoRR, 2021

SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction.
CoRR, 2021

PDNet: Towards Better One-stage Object Detection with Prediction Decoupling.
CoRR, 2021

One More Check: Making "Fake Background" Be Tracked Again.
CoRR, 2021

Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection.
Proceedings of the International Joint Conference on Neural Networks, 2021

DSIC: Dynamic Sample-Individualized Connector for Multi-Scale Object Detection.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Learn to Match: Automatic Matching Network Design for Visual Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Practical Face Swapping Detection Based on Identity Spatial Constraints.
Proceedings of the International IEEE Joint Conference on Biometrics, 2021

Open-Book Video Captioning With Retrieve-Copy-Generate Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Anomaly Detection Using Local Kernel Density Estimation and Context-Based Regression.
IEEE Trans. Knowl. Data Eng., 2020

STA-CNN: Convolutional Spatial-Temporal Attention Learning for Action Recognition.
IEEE Trans. Image Process., 2020

Tangent Fisher Vector on Matrix Manifolds for Action Recognition.
IEEE Trans. Image Process., 2020

Anisotropic Convolution for Image Classification.
IEEE Trans. Image Process., 2020

Multi-Cue Semi-Supervised Color Constancy With Limited Training Samples.
IEEE Trans. Image Process., 2020

Manipulating Template Pixels for Model Adaptation of Siamese Visual Tracking.
IEEE Signal Process. Lett., 2020

Distractor-aware discrimination learning for online multiple object tracking.
Pattern Recognit., 2020

Graph convolutional network with structure pooling and joint-wise channel attention for action recognition.
Pattern Recognit., 2020

Tracking-by-Fusion via Gaussian Process Regression Extended to Transfer Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Dual L<sub>1</sub>-Normalized Context Aware Tensor Power Iteration and Its Applications to Multi-object Tracking and Multi-graph Matching.
Int. J. Comput. Vis., 2020

DSIC: Dynamic Sample-Individualized Connector for Multi-Scale Object Detection.
CoRR, 2020

Anchor-Free One-Stage Online Multi-object Tracking.
Proceedings of the Pattern Recognition and Computer Vision - Third Chinese Conference, 2020

Globally Spatial-Temporal Perception: a Long-Term Tracking System.
Proceedings of the IEEE International Conference on Image Processing, 2020

End-to-End Temporal Feature Aggregation for Siamese Trackers.
Proceedings of the IEEE International Conference on Image Processing, 2020

Ocean: Object-Aware Anchor-Free Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model.
Proceedings of the Computer Vision - ECCV 2020, 2020

The Eighth Visual Object Tracking VOT2020 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Object Relational Graph With Teacher-Recommended Learning for Video Captioning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

RDSNet: A New Deep Architecture forReciprocal Object Detection and Instance Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Asymmetric 3D Convolutional Neural Networks for action recognition.
Pattern Recognit., 2019

Rank-1 Tensor Approximation for High-Order Association in Multi-target Tracking.
Int. J. Comput. Vis., 2019

RDSNet: A New Deep Architecture for Reciprocal Object Detection and Instance Segmentation.
CoRR, 2019

VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning.
CoRR, 2019

Multimodal Semantic Attention Network for Video Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019


Anchor Diffusion for Unsupervised Video Object Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Fast Online Object Tracking and Segmentation: A Unifying Approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge Distillation via Instance Relationship Graph.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Context-Dependent Random Walk Graph Kernels and Tree Pattern Graph Matching Kernels With Applications to Action Recognition.
IEEE Trans. Image Process., 2018

Deep Constrained Siamese Hash Coding Network and Load-Balanced Locality-Sensitive Hashing for Near Duplicate Image Detection.
IEEE Trans. Image Process., 2018

FatRegion: A Fast Adaptive Tree-Structured Region Extraction Approach.
IEEE Trans. Circuits Syst. Video Technol., 2018

Towards Robust and Accurate Multi-View and Partially-Occluded Face Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Dual Sticky Hierarchical Dirichlet Process Hidden Markov Model and Its Application to Natural Language Description of Motions.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Do not Lose the Details: Reinforced Representation Learning for High Performance Visual Tracking.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Online Multi-Target Tracking with Tensor-Based High-Order Graph Matching.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

SPCNet: Scale Position Correlation Network for End-to-End Visual Tracking.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Distractor-Aware Siamese Networks for Visual Object Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

Visual Tracking via Spatially Aligned Correlation Filters Network.
Proceedings of the Computer Vision - ECCV 2018, 2018


The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Hierarchical Nonlinear Orthogonal Adaptive-Subspace Self-Organizing Map Based Feature Extraction for Human Action Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Diagnosing deep learning models for high accuracy age estimation from a single image.
Pattern Recognit., 2017

D2C: Deep cumulatively and comparatively learning for human age estimation.
Pattern Recognit., 2017

Salient Object Detection via Structured Matrix Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Multi-View Multi-Instance Learning Based on Joint Sparse Representation and Multi-View Dictionary Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Semi-Supervised Tensor-Based Graph Embedding Learning and Its Application to Visual Discriminant Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Towards human-like and transhuman perception in AI 2.0: a review.
Frontiers Inf. Technol. Electron. Eng., 2017

Human activity prediction using temporally-weighted generalized time warping.
Neurocomputing, 2017

DCFNet: Discriminant Correlation Filters Network for Visual Tracking.
CoRR, 2017

Diversity encouraging ensemble of convolutional networks for high performance action recognition.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

SCNN: Sequential convolutional neural network for human action recognition in videos.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

The Visual Object Tracking VOT2017 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection from Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Multimodal Web Aesthetics Assessment Based on Structural SVM and Multitask Fusion Learning.
IEEE Trans. Multim., 2016

Multi-Perspective Cost-Sensitive Context-Aware Multi-Instance Sparse Coding and Its Application to Sensitive Video Recognition.
IEEE Trans. Multim., 2016

Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation.
IEEE Trans. Multim., 2016

Learning A Superpixel-Driven Speed Function for Level Set Tracking.
IEEE Trans. Cybern., 2016

Fusing ℝ Features and Local Features with Context-Aware Kernels for Action Recognition.
Int. J. Comput. Vis., 2016

Multi-Cue Illumination Estimation via a Tree-Structured Group Joint Sparse Representation.
Int. J. Comput. Vis., 2016

Hierarchical Bayesian Multiple Kernel Learning Based Feature Fusion for Action Recognition.
Proceedings of the Multimodal Pattern Recognition of Social Signals in Human-Computer-Interaction, 2016

Metadata-Based Clustered Multi-task Learning for Thread Mining in Web Communities.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2016

A Probabilistic Matrix Factorization Method for Link Sign Prediction in Social Networks.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2016

Bootstrapping deep feature hierarchy for pornographic image recognition.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Graph Based Skeleton Motion Representation and Similarity Measurement for Action Recognition.
Proceedings of the Computer Vision - ECCV 2016, 2016

The Visual Object Tracking VOT2016 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Tensor Power Iteration for Multi-graph Matching.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Image Copy Detection Based on Convolutional Neural Networks.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

2015
Horror Image Recognition Based on Context-Aware Multi-Instance Learning.
IEEE Trans. Image Process., 2015

Single and Multiple Object Tracking Using a Multi-Feature Joint Sparse Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Erratum to: A Robust Tracking System for Low Frame Rate Video.
Int. J. Comput. Vis., 2015

A Robust Tracking System for Low Frame Rate Video.
Int. J. Comput. Vis., 2015

Predicting Image Memorability by Multi-view Adaptive Regression.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Robust visual tracking using joint scale-spatial correlation filters.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Load-balanced locality-sensitive hashing: A new method for efficient near duplicate image detection.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Joint Scale-Spatial Correlation Tracking with Adaptive Rotation Estimation.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Local Subspace Collaborative Tracking.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Human Pose Estimation and Tracking via Parsing a Tree Structure Based Human Model.
IEEE Trans. Syst. Man Cybern. Syst., 2014

Context-Aware Hypergraph Construction for Robust Spectral Clustering.
IEEE Trans. Knowl. Data Eng., 2014

Modeling Geometric-Temporal Context With Directional Pyramid Co-Occurrence for Action Recognition.
IEEE Trans. Image Process., 2014

Action Recognition Using Nonnegative Action Component Representation and Sparse Basis Selection.
IEEE Trans. Image Process., 2014

Evaluating Combinational Illumination Estimation Methods on Real-World Images.
IEEE Trans. Image Process., 2014

Image Classification Using Multiscale Information Fusion Based on Saliency Driven Nonlinear Diffusion Filtering.
IEEE Trans. Image Process., 2014

Graph-Embedding-Based Learning for Robust Object Tracking.
IEEE Trans. Ind. Electron., 2014

Online Adaboost-Based Parameterized Methods for Dynamic Distributed Network Intrusion Detection.
IEEE Trans. Cybern., 2014

Learning Human Actions by Combining Global Dynamics and Local Appearance.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Bin Ratio-Based Histogram Distances and Their Application to Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Learning from Multi-User Multi-Attribute Annotations.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

RGBD Salient Object Detection: A Benchmark and Algorithms.
Proceedings of the Computer Vision - ECCV 2014, 2014

Transfer Learning Based Visual Tracking with Gaussian Processes Regression.
Proceedings of the Computer Vision - ECCV 2014, 2014

Towards Multi-view and Partially-Occluded Face Alignment.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Multi-target Tracking with Motion Context in Tensor Power Iteration.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
A survey of appearance models in visual object tracking.
ACM Trans. Intell. Syst. Technol., 2013

Active Contour-Based Visual Tracking by Integrating Colors, Shapes, and Motions.
IEEE Trans. Image Process., 2013

Robust Head Tracking Based on Multiple Cues Fusion in the Kernel-Bayesian Framework.
IEEE Trans. Circuits Syst. Video Technol., 2013

Block covariance based l<sub>1</sub> tracker with a subtle template dictionary.
Pattern Recognit., 2013

Action recognition using linear dynamic systems.
Pattern Recognit., 2013

An Incremental DPMM-Based Method for Trajectory Clustering, Modeling, and Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

An Improved Hierarchical Dirichlet Process-Hidden Markov Model and Its Application to Trajectory Modeling and Retrieval.
Int. J. Comput. Vis., 2013

Non-negative Sparse Coding Using Independent Multi-Codebooks for Near-Duplicate Image Detection.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Horror Text Recognition Based on Generalized Expectation Criteria.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Spatio-temporal Features for Efficient Video Copy Detection.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Mining activities using sticky multimodal dual hierarchical Dirichlet process hidden Markov model.
Proceedings of the IEEE International Conference on Image Processing, 2013

Learning silhouette dynamics for human action recognition.
Proceedings of the IEEE International Conference on Image Processing, 2013


Graph Embedding Based Semi-supervised Discriminative Tracker.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Robust Object Tracking with Online Multi-lifespan Dictionary Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminant Tracking Using Tensor Representation with Semi-supervised Improvement.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Combining sparse appearance features and dense motion features via random forest for action detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

Adaptive cooperative tracking based on multi-graph embedding and Markov Random Field.
Proceedings of the IEEE International Conference on Acoustics, 2013

Distance Map of Various Weights: A new feature for adaptive object tracking.
Proceedings of the IEEE International Conference on Acoustics, 2013

3D R Transform on Spatio-temporal Interest Points for Action Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multi-task Sparse Learning with Beta Process Prior for Action Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multi-target Tracking by Rank-1 Tensor Approximation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Illumination Estimation Based on Bilayer Sparse Coding.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

An Efficient Approach to Web Near-Duplicate Image Detection.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Saliency Driven Nonlinear Diffusion Filtering for Object Recognition.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Multi-object Tracking under Occlusion Using Dual-Mode Graph Embedding.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Label Ranking by Directly Optimizing Performance Measures.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

Salient Object Detection via Low-Rank and Structured Sparse Matrix Decomposition.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Efficient Clustering Aggregation Based on Data Fragments.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Supervised class-specific dictionary learning for sparse modeling in action recognition.
Pattern Recognit., 2012

Single and Multiple Object Tracking Using Log-Euclidean Riemannian Subspace and Block-Division Appearance Model.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Unsupervised Ensemble Learning for Mining Top-n Outliers.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2012

Context-aware affective images classification based on bilayer sparse representation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Scaring or pleasing: exploit emotional impact of an image.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

An Efficient Video Copy Detection Method Combining Vocabulary Tree and Inverted File.
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

Context-aware horror video scene recognition via cost-sensitive sparse coding.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Multiple sample group pairs' graph embedding for tracking.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Horror Video Scene Recognition Based on Multi-view Multi-instance Learning.
Proceedings of the Computer Vision - ACCV 2012, 2012

Visual Saliency Map from Tensor Analysis.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
A Survey on Visual Content-Based Video Indexing and Retrieval.
IEEE Trans. Syst. Man Cybern. Part C, 2011

Recognition of adult images, videos, and web page bags.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Adaptive learning codebook for action recognition.
Pattern Recognit. Lett., 2011

Visual tracking via dynamic tensor analysis with mean update.
Neurocomputing, 2011

Incremental Tensor Subspace Learning and Its Applications to Foreground Segmentation and Tracking.
Int. J. Comput. Vis., 2011

Evaluating the visual quality of web pages using a computational aesthetic approach.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

RKOF: Robust Kernel-Based Local Outlier Detection.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2011

Robust visual tracking via transfer learning.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Web Horror Image Recognition Based on Context-Aware Multi-instance Learning.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Context-Aware Multi-instance Learning Based on Hierarchical Sparse Representation.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Robust object tracking with boosted discriminative model via graph embedding.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Horror video scene recognition via Multiple-Instance learning.
Proceedings of the IEEE International Conference on Acoustics, 2011

Multi-cue based multi-target tracking using online random forests.
Proceedings of the IEEE International Conference on Acoustics, 2011

Efficient block-division model for robust multiple object tracking.
Proceedings of the IEEE International Conference on Acoustics, 2011

Evaluating combinational color constancy methods on real-world images.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Space-time neighborhood based hierarchical descriptor for action recognition.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

Ranking social emotions by learning listwise preference.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010
Multiple Object Tracking Via Species-Based Particle Swarm Optimization.
IEEE Trans. Circuits Syst. Video Technol., 2010

Heat Kernel Based Local Binary Pattern for Face Representation.
IEEE Signal Process. Lett., 2010

Robust object tracking using a spatial pyramid heat kernel structural information representation.
Neurocomputing, 2010

Linear discriminant analysis using rotational invariant L<sub>1</sub> norm.
Neurocomputing, 2010

Learning to evaluate the visual quality of web pages.
Proceedings of the 19th International Conference on World Wide Web, 2010

Identifying Multi-instance Outliers.
Proceedings of the SIAM International Conference on Data Mining, 2010

Video Scene Segmentation Using Time Constraint Dominant-Set Clustering.
Proceedings of the Advances in Multimedia Modeling, 2010

Prototype Learning Using Metric Learning Based Behavior Recognition.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Semi-supervised Trajectory Learning Using a Multi-Scale Key Point Based Trajectory Representation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Discriminative Level Set for Contour Tracking.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Event Recognition Based on Top-Down Motion Attention.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Local Outlier Detection Based on Kernel Regression.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Compact visual codebook for action recognition.
Proceedings of the International Conference on Image Processing, 2010

Horror movie scene recognition based on emotional perception.
Proceedings of the International Conference on Image Processing, 2010

Spatio-Temporal Proximity Distribution Kernels for Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

A swarm intelligence based searching strategy for articulated 3D human body tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Use bin-ratio information for category and scene classification.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Group ranking with application to image retrieval.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Probabilistic Index Histogram for Robust Object Tracking.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Top-Down Cues for Event Recognition.
Proceedings of the Computer Vision - ACCV 2010, 2010

Occlusion Handling with ℓ<sub>1</sub>-Regularized Sparse Reconstruction.
Proceedings of the Computer Vision - ACCV 2010, 2010

Horror Image Recognition Based on Emotional Attention.
Proceedings of the Computer Vision - ACCV 2010, 2010

Action Recognition.
Proceedings of the Machine Learning for Human Motion Analysis - Theory and Practice., 2010

2009
Unsupervised Active Learning Based on Hierarchical Graph-Theoretic Clustering.
IEEE Trans. Syst. Man Cybern. Part B, 2009

Occlusion Reasoning for Tracking Multiple People.
IEEE Trans. Circuits Syst. Video Technol., 2009

Detecting image spam using local invariant features and pyramid match kernel.
Proceedings of the 18th International Conference on World Wide Web, 2009

Rank Aggregation Based Text Feature Selection.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence, 2009

Adaptive Distributed Intrusion Detection Using Parametric Model.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence, 2009

A Boosted Semi-supervised Learning Framework for Web Page Filtering.
Proceedings of the IEEE International Conference on Systems, 2009

Normalized Cut Based Coherence Measure Construction for Scene Segmentation.
Proceedings of the Advances in Multimedia Information Processing, 2009

Soccer Video Shot Classification Based on Color Characterization Using Dominant Sets Clustering.
Proceedings of the Advances in Multimedia Information Processing, 2009

Human Activity Recognition Based on Â\Re Transform and Fourier Mellin Transform.
Proceedings of the Advances in Visual Computing, 5th International Symposium, 2009

Group Action Recognition Using Space-Time Interest Points.
Proceedings of the Advances in Visual Computing, 5th International Symposium, 2009

Recognition of Semantic Basketball Events Based on Optical Flow Patterns.
Proceedings of the Advances in Visual Computing, 5th International Symposium, 2009

Contour tracking with abrupt motion.
Proceedings of the International Conference on Image Processing, 2009

Video shot segmentation using graph-based dominant-set clustering.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Multi-object tracking via species based particle swarm optimization.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Efficient human pose estimation via parsing a tree structure based human model.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Image spam filtering using Fourier-Mellin invariant features.
Proceedings of the IEEE International Conference on Acoustics, 2009

Segment Model Based Vehicle Motion Analysis.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

A Smarter Particle Filter.
Proceedings of the Computer Vision, 2009

Human Action Recognition Using Pyramid Vocabulary Tree.
Proceedings of the Computer Vision, 2009

Human Action Recognition under Log-Euclidean Riemannian Metric.
Proceedings of the Computer Vision, 2009

Spectral Graph Partitioning Based on a Random Walk Diffusion Similarity Measure.
Proceedings of the Computer Vision, 2009

Learning Group Activity in Soccer Videos from Local Motion.
Proceedings of the Computer Vision, 2009

2008
AdaBoost-Based Algorithm for Network Intrusion Detection.
IEEE Trans. Syst. Man Cybern. Part B, 2008

User oriented link function classification.
Proceedings of the 17th International Conference on World Wide Web, 2008

Distributed detection of network intrusions based on a parametric model.
Proceedings of the IEEE International Conference on Systems, 2008

Robust foreground segmentation based on two effective background models.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

SVD based Kalman particle filter for robust visual tracking.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Boosted cannabis image recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Multiclass spectral clustering based on discriminant analysis.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Group action recognition in soccer videos.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Key-frame extraction using dominant-set clustering.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Level set tracking with dynamical shape priors.
Proceedings of the International Conference on Image Processing, 2008

Online Boosting Based Intrusion Detection in Changing Environments.
Proceedings of the 2008 International Conference on Artificial Intelligence, 2008

Robust Visual Tracking Based on an Effective Appearance Model.
Proceedings of the Computer Vision, 2008

Sequential particle swarm optimization for visual tracking.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Visual tracking via incremental Log-Euclidean Riemannian subspace learning.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Trajectory-Based Video Retrieval Using Dirichlet Process Mixture Models.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

2007
Semantic-Based Surveillance Video Retrieval.
IEEE Trans. Image Process., 2007

Recognition of Pornographic Web Pages by Classifying Texts and Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Supervised tensor learning.
Knowl. Inf. Syst., 2007

Customizable Instance-Driven Webpage Filtering Based on Semi-Supervised Learning.
Proceedings of the 2007 IEEE / WIC / ACM International Conference on Web Intelligence, 2007

Dominant Sets-Based Action Recognition using Image Sequence Matching.
Proceedings of the International Conference on Image Processing, 2007

Corner Detection of Contour Images using Spectral Clustering.
Proceedings of the International Conference on Image Processing, 2007

Graph Based Discriminative Learning for Robust and Efficient Object Tracking.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Robust Visual Tracking Based on Incremental Tensor Subspace Learning.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Markov Random Field Modeled Level Sets Method for Object Tracking with Moving Cameras.
Proceedings of the Computer Vision, 2007

Kernel-Bayesian Framework for Object Tracking.
Proceedings of the Computer Vision, 2007

2006
A System for Learning Statistical Motion Patterns.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Principal Axis-Based Correspondence between Multiple Cameras for People Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

A Coarse-to-Fine Strategy for Vehicle Motion Trajectory Clustering.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Indexing and Matching of Video Shots Based on Motion and Color Analysis.
Proceedings of the Ninth International Conference on Control, 2006

2005
3-D Model-Based Vehicle Tracking.
IEEE Trans. Image Process., 2005

Stable Third-Order Tensor Representation for Color Image Classification.
Proceedings of the 2005 IEEE / WIC / ACM International Conference on Web Intelligence (WI 2005), 2005

Kernel Principle Component Analysis in Pixels Clustering.
Proceedings of the 2005 IEEE / WIC / ACM International Conference on Web Intelligence (WI 2005), 2005

Similarity based vehicle trajectory clustering and anomaly detection.
Proceedings of the 2005 International Conference on Image Processing, 2005

2004
Traffic accident prediction using 3-D model-based vehicle tracking.
IEEE Trans. Veh. Technol., 2004

Learning activity patterns using fuzzy self-organizing neural network.
IEEE Trans. Syst. Man Cybern. Part B, 2004

A survey on visual surveillance of object motion and behaviors.
IEEE Trans. Syst. Man Cybern. Part C, 2004

A hierarchical self-organizing approach for learning the patterns of motion trajectories.
IEEE Trans. Neural Networks, 2004

Fusion of static and dynamic body biometrics for gait recognition.
IEEE Trans. Circuits Syst. Video Technol., 2004

People tracking based on motion model and motion constraints with automatic initialization.
Pattern Recognit., 2004

Kinematics-based tracking of human walking in monocular video sequences.
Image Vis. Comput., 2004

A Novel Approach to Detecting Adult Images.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Skin Color Detection Using Multiple Cues.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A Multi-Object Tracking System for Surveillance Video Analysis.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Tracking People through Occlusions.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Mixture Clustering Using Multidimensional Histograms for Skin Detection.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Adaptive skin detection using multiple cues.
Proceedings of the 2004 International Conference on Image Processing, 2004

Semantic-based traffic video retrieval using activity pattern analysis.
Proceedings of the 2004 International Conference on Image Processing, 2004

Multi-camera correspondence based on principal axis of human body.
Proceedings of the 2004 International Conference on Image Processing, 2004

Gait analysis for human identification in frequency domain.
Proceedings of the Third International Conference on Image and Graphics, 2004

Face Contour Construction with Multiple Information.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

2003
Automatic gait recognition based on statistical shape analysis.
IEEE Trans. Image Process., 2003

Recent developments in human motion analysis.
Pattern Recognit., 2003

Silhouette Analysis-Based Gait Recognition for Human Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Pose Evaluation Based on Bayesian classification Error.
Proceedings of the British Machine Vision Conference, 2003

2002
A New Attempt to Gait-based Human Identification.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Semantic Interpretation of Object Activities in a Surveillance System.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Articulated Model Based People Tracking Using Motion Models.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Gait recognition based on Procrustes shape analysis.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Efficient and robust vehicle localization.
Proceedings of the 2001 International Conference on Image Processing, 2001


  Loading...