Feng Zheng

Orcid: 0000-0002-1701-9141

Affiliations:
  • Southern University of Science and Technology, Shenzhen, Guangdong, China


According to our database1, Feng Zheng authored at least 191 papers between 2010 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MCoCa: Towards fine-grained multimodal control in image captioning.
Pattern Recognit., 2026

2025
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

Towards Balanced Representation Learning with Semantic Anchor Regularization.
Int. J. Comput. Vis., October, 2025

Leveraging Geometric Priors for Unaligned Scene Change Detection.
CoRR, September, 2025

Generalization Bounds of Deep Neural Networks With τ-Mixing Samples.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

Few-Shot Referring Video Single- and Multi-Object Segmentation Via Cross-Modal Affinity with Instance Sequence Matching.
Int. J. Comput. Vis., August, 2025

Learning to Generalize Heterogeneous Representation for Cross-Modality Image Synthesis via Multiple Domain Interventions.
Int. J. Comput. Vis., July, 2025

Refine Any Object in Any Scene.
CoRR, June, 2025

LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization.
CoRR, June, 2025

Bidirectional Error-Aware Fusion Network for Video Inpainting.
IEEE Trans. Circuits Syst. Video Technol., January, 2025

Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs.
CoRR, January, 2025

SoftPatch+: Fully unsupervised anomaly classification and segmentation.
Pattern Recognit., 2025

Low-visibility Crop Detection in Agricultural Scenes via Point Cloud Guidance.
Proceedings of the International Joint Conference on Neural Networks, 2025

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Multi-Constraint Transferable Generative Adversarial Networks for Cross-Modal Brain Image Synthesis.
Int. J. Comput. Vis., November, 2024

Overcoming Data Deficiency for Multi-Person Pose Estimation.
IEEE Trans. Neural Networks Learn. Syst., August, 2024

Gradient Learning With the Mode-Induced Loss: Consistency Analysis and Applications.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing.
IEEE Trans. Cybern., May, 2024

Cross-modality Neuroimage Synthesis: A Survey.
ACM Comput. Surv., March, 2024

Deep Industrial Image Anomaly Detection: A Survey.
Mach. Intell. Res., February, 2024

A Bayesian Federated Learning Framework With Online Laplace Approximation.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Cross-Modal Alternating Learning With Task-Aware Representations for Continual Learning.
IEEE Trans. Multim., 2024

Weakly-Supervised RGBD Video Object Segmentation.
IEEE Trans. Image Process., 2024

MCD-Net: Toward RGB-D Video Inpainting in Real-World Scenes.
IEEE Trans. Image Process., 2024

Unveiling the Power of Visible-Thermal Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2024

Allowing Supervision in Unsupervised Deformable- Instances Image-to-Image Translation.
IEEE Trans. Circuits Syst. Video Technol., 2024

Error Density-dependent Empirical Risk Minimization.
Expert Syst. Appl., 2024

Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases.
CoRR, 2024

PlantCamo: Plant Camouflage Detection.
CoRR, 2024

MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection.
CoRR, 2024

CAR: Controllable Autoregressive Modeling for Visual Generation.
CoRR, 2024

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation.
CoRR, 2024

LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model.
CoRR, 2024

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization.
CoRR, 2024

SoftPatch: Unsupervised Anomaly Detection with Noisy Data.
CoRR, 2024

Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference.
CoRR, 2024

Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Place Anything into Any Video.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Negative Label Guided OOD Detection with Pretrained Vision-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-Driven Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

Tuning-Free Image Customization with Image and Text Guidance.
Proceedings of the Computer Vision - ECCV 2024, 2024

The Second Visual Object Tracking Segmentation VOTS2024 Challenge Results.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024


MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Block Image Compressive Sensing with Local and Global Information Interaction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Pose-Aided Video-Based Person Re-Identification via Recurrent Graph Convolutional Network.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Verbal-Person Nets: Pose-Guided Multi-Granularity Language-to-Person Generation.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Continuous cross-modal hashing.
Pattern Recognit., October, 2023

Decoupling Multimodal Transformers for Referring Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Label-Guided Generative Adversarial Network for Realistic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Show, Tell and Rephrase: Diverse Video Captioning via Two-Stage Progressive Training.
IEEE Trans. Multim., 2023

Does Thermal Really Always Matter for RGB-T Salient Object Detection?
IEEE Trans. Multim., 2023

Video Object Segmentation using Point-based Memory Network.
Pattern Recognit., 2023

FedMed-GAN: Federated domain translation on unsupervised cross-modality brain image synthesis.
Neurocomputing, 2023

Video Understanding with Large Language Models: A Survey.
CoRR, 2023

LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad.
CoRR, 2023

K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment.
CoRR, 2023

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning.
CoRR, 2023

CageViT: Convolutional Activation Guided Efficient Vision Transformer.
CoRR, 2023

Track Anything: Segment Anything Meets Videos.
CoRR, 2023

Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore.
CoRR, 2023

Deep Industrial Image Anomaly Detection: A Survey.
CoRR, 2023

Deep learning for video object segmentation: a review.
Artif. Intell. Rev., 2023

Real3D-AD: A Dataset of Point Cloud Anomaly Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Depth-aided Camouflaged Object Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

EasyNet: An Easy Network for 3D Industrial Anomaly Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Detecting Out-of-distribution Data through In-distribution Class Prior.
Proceedings of the International Conference on Machine Learning, 2023

Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

What Makes a Good Data Augmentation for Few-Shot Unsupervised Image Anomaly Detection?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Resource-Efficient RGBD Aerial Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Accelerating Vision-Language Pretraining with Free Language Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

On the Stability and Generalization of Triplet Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Conditional Feature Embedding by Visual Clue Correspondence Graph for Person Re-Identification.
IEEE Trans. Image Process., 2022

Spatial-Temporal Pyramid Graph Reasoning for Action Recognition.
IEEE Trans. Image Process., 2022

Conditional Feature Learning Based Transformer for Text-Based Person Search.
IEEE Trans. Image Process., 2022

Deep Adaptively-Enhanced Hashing With Discriminative Similarity Guidance for Unsupervised Cross-Modal Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2022

Rethinking Camouflaged Object Detection: Models and Datasets.
IEEE Trans. Circuits Syst. Video Technol., 2022

PSIDP: Unsupervised deep hashing with pretrained semantic information distillation and preservation.
Neurocomputing, 2022

Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired Unsupervised Cross-Modal Retrieval.
CoRR, 2022

T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up.
CoRR, 2022

Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion.
CoRR, 2022

Exploiting Context Information for Generic Event Boundary Captioning.
CoRR, 2022

Semantic-Aware Pretraining for Dense Video Captioning.
CoRR, 2022

RGBD Object Tracking: An In-depth Review.
CoRR, 2022

Audio-Visual MLP for Scoring Sport.
CoRR, 2022

A Survey of Visual Sensory Anomaly Detection.
CoRR, 2022

A Survey of Cross-Modality Brain Image Synthesis.
CoRR, 2022

FedMed-ATL: Misaligned Unpaired Brain Image Synthesis via Affine Transform Loss.
CoRR, 2022

FedMed-GAN: Federated Multi-Modal Unsupervised Brain Image Synthesis.
CoRR, 2022

Hightlight Video Detection in Figure Skating.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

SoftPatch: Unsupervised Anomaly Detection with Noisy Data.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Prompting for Multi-Modal Tracking.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

FedMed-ATL: Misaligned Unpaired Cross-Modality Neuroimage Synthesis via Affine Transform Loss.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Continual Adaptation in Industrial Anomaly Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix.
Proceedings of the International Conference on Machine Learning, 2022

Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline.
Proceedings of the Computer Vision - ECCV 2022, 2022

S<sup>2</sup>Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Dual-Fused Modality-Aware Representations for RGBD Tracking.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Class-Aware Contrastive Semi-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Meta Distribution Alignment for Generalizable Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward.
Proceedings of the Computer Vision - ACCV 2022, 2022

Error-Based Knockoffs Inference for Controlled Feature Selection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

VITA: A Multi-Source Vicinal Transfer Augmentation Method for Out-of-Distribution Generalization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Sparse Modal Additive Model.
IEEE Trans. Neural Networks Learn. Syst., 2021

Learning Efficient Hash Codes for Fast Graph-Based Data Similarity Retrieval.
IEEE Trans. Image Process., 2021

Deep 3D human pose estimation: A review.
Comput. Vis. Image Underst., 2021

GuidedMix-Net: Learning to Improve Pseudo Masks Using Labeled Images as Reference.
CoRR, 2021

Brain Image Synthesis with Unsupervised Multivariate Canonical CSC𝓁<sub>4</sub>Net.
CoRR, 2021

Tiny Adversarial Mulit-Objective Oneshot Neural Architecture Search.
CoRR, 2021

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search.
CoRR, 2021

Camera-Agnostic Person Re-Identification via Adversarial Disentangling Learning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Saliency-Associated Object Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DepthTrack: Unveiling the Power of RGBD Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

End-to-End Dense Video Captioning with Parallel Decoding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

FREE: Feature Refinement for Generalized Zero-Shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

HID 2021: Competition on Human Identification at a Distance 2021.
Proceedings of the International IEEE Joint Conference on Biometrics, 2021

Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

One for More: Selecting Generalizable Samples for Generalizable ReID Model.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Constructing a Fair Classifier with Generated Fair Data.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Distributed Ranking with Communications: Approximation Analysis and Applications.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

A Unified Multi-Scenario Attacking Network for Visual Object Tracking.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dual Distribution Alignment Network for Generalizable Person Re-Identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
MCMT-GAN: Multi-Task Coherent Modality Transferable GAN for 3D Brain Image Synthesis.
IEEE Trans. Image Process., 2020

A parallel down-up fusion network for salient object detection in optical remote sensing images.
Neurocomputing, 2020

Video scene parsing: An overview of deep learning methods and datasets.
Comput. Vis. Image Underst., 2020

Dual Distribution Alignment Network for Generalizable Person Re-Identification.
CoRR, 2020

Multi-task Additive Models for Robust Estimation and Automatic Structure Discovery.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Lab2Pix: Label-Adaptive Generative Adversarial Network for Unsupervised Image Synthesis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

LSOTB-TIR: A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Super-Resolution and Inpainting with Degraded and Upgraded Generative Adversarial Networks.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Hijacking Tracker: A Powerful Adversarial Attack on Visual Tracking.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Enabling Deep Residual Networks for Weakly Supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Noise-Aware Fully Webly Supervised Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

One-Shot Adversarial Attacks on Visual Tracking With Dual Attention.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Salience-Guided Cascaded Suppression Network for Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Viewpoint-Aware Loss with Angular Regularization for Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Supervised Online Hashing via Similarity Distribution Learning.
CoRR, 2019

Retrieval by Classification: Discriminative Binary Embedding for Sketch-Based Image Retrieval.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Binarized Neural Networks for Resource-Efficient Hashing with Minimizing Quantization Loss.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Automatic Grassland Degradation Estimation Using Deep Learning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Equally-Guided Discriminative Hashing for Cross-modal Retrieval.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Part Power Set Model for Scale-Free Person Retrieval.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Particle Swarm Loss for Lightweight Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep Spectral Clustering Using Dual Autoencoder Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep Asymmetric Metric Learning via Rich Relationship Mining.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Multitarget Sparse Latent Regression.
IEEE Trans. Neural Networks Learn. Syst., 2018

Robust and Long-Term Object Tracking With an Application to Vehicles.
IEEE Trans. Intell. Transp. Syst., 2018

A Winner-Take-All Strategy for Improved Object Tracking.
IEEE Trans. Image Process., 2018

Dense Invariant Feature-Based Support Vector Ranking for Cross-Camera Person Reidentification.
IEEE Trans. Circuits Syst. Video Technol., 2018

Hetero-Manifold Regularisation for Cross-Modal Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Learning a Multiple Kernel Similarity Metric for kinship verification.
Inf. Sci., 2018

A Coarse-to-fine Pyramidal Model for Person Re-identification via Multi-Loss Dynamic Training.
CoRR, 2018

Fast Vehicle Identification in Surveillance via Ranked Semantic Sampling Based Embedding.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Unsupervised Deep Generative Adversarial Hashing Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Direct Hashing Without Pseudo-Labels.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dual-Reference Face Retrieval.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Supervised Local Descriptor Learning for Human Action Recognition.
IEEE Trans. Multim., 2017

Dual-reference Face Retrieval: What Does He/She Look Like at Age 'X'?
CoRR, 2017

2016
Learning Cross-View Binary Identities for Fast Person Re-Identification.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Dense invariant feature based support vector ranking for person re-identification.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

2014
Realistic action recognition via sparsely-constructed Gaussian processes.
Pattern Recognit., 2014

Learn++ for Robust Object Tracking.
Proceedings of the British Machine Vision Conference, 2014

Discriminative Embedding via Image-to-Class Distances.
Proceedings of the British Machine Vision Conference, 2014

2013
A semi-supervised approach for dimensionality reduction with distributional similarity.
Neurocomputing, 2013

2011
Recent advances and trends in visual tracking: A review.
Neurocomputing, 2011

Action recognition using graph embedding and the co-occurrence matrices descriptor.
Int. J. Comput. Math., 2011

2010
Hand Detection and Gesture Recognition Exploit Motion Times Image in Complicate Scenarios.
Proceedings of the Advances in Visual Computing - 6th International Symposium, 2010

A set of co-occurrence matrices on the intrinsic manifold of human silhouettes for action recognition.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Eigen-space learning using semi-supervised diffusion maps for human action recognition.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010


  Loading...