Tianzhu Zhang

Orcid: 0000-0003-1856-9564

Affiliations:
  • University of Science and Technology of China, Hefei, China


According to our database1, Tianzhu Zhang authored at least 246 papers between 2009 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
GaitC<sup>3</sup>I: Robust Cross-Covariate Gait Recognition via Causal Intervention.
IEEE Trans. Circuits Syst. Video Technol., August, 2025

Learning Adaptive Conceptual Prototypes for 3D Single Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., July, 2025

ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting.
CoRR, July, 2025

CUBE360: Learning Cubic Field Representation for Monocular Panoramic Depth Estimation.
IEEE Robotics Autom. Lett., June, 2025

Exploring Semantic Masked Autoencoder for Self-supervised Point Cloud Understanding.
CoRR, June, 2025

StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning.
CoRR, June, 2025

CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection.
CoRR, June, 2025

Quantity-Quality Enhanced Self-Training Network for Weakly Supervised Point Cloud Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Learning Discriminative Features for Visual Tracking via Scenario Decoupling.
Int. J. Comput. Vis., May, 2025

Exploring the Better Correlation for Few-Shot Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., March, 2025

SAS: Segment Any 3D Scene with Integrated 2D Priors.
CoRR, March, 2025

Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation.
IEEE Trans. Circuits Syst. Video Technol., February, 2025

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2025

Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation.
CoRR, February, 2025

Purify Then Guide: A Bi-Directional Bridge Network for Open-Vocabulary Semantic Segmentation.
IEEE Trans. Circuits Syst. Video Technol., January, 2025

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation.
CoRR, January, 2025

Rethinking Masked Representation Learning for 3D Point Cloud Understanding.
IEEE Trans. Image Process., 2025

Adaptive Prototype Learning for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Image Process., 2025

Learning Cubic Field Representation from A Single Panorama for Virtual Reality.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2025

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Unbiased Learning in Semi-Supervised Semantic Segmentation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Rethinking Correspondence-based Category-Level Object Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Implicit Correspondence Learning for Image-to-Point Cloud Registration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Dual-Agent Optimization framework for Cross-Domain Few-Shot Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Structure-Aware Correspondence Learning for Relative Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Exploring the Better Multimodal Synergy Strategy for Vision-Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Beyond Pixel and Object: Part Feature as Reference for Few-Shot Video Object Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Pamba: Enhancing Global Interaction in Point Clouds via State Space Model.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Alleviate and Mining: Rethinking Unsupervised Domain Adaptation for Mitochondria Segmentation from Pseudo-Label Perspective.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Attention-Driven Memory Network for Online Visual Tracking.
IEEE Trans. Neural Networks Learn. Syst., December, 2024

Hierarchy-Aware Interactive Prompt Learning for Few-Shot Classification.
IEEE Trans. Circuits Syst. Video Technol., December, 2024

FD-GAN: Generalizable and Robust Forgery Detection via Generative Adversarial Networks.
Int. J. Comput. Vis., December, 2024

HybridPrompt: Domain-Aware Prompting for Cross-Domain Few-Shot Learning.
Int. J. Comput. Vis., December, 2024

Multi-Modal Attribute Prompting for Vision-Language Models.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Reliable Phrase Feature Mining for Hierarchical Video-Text Retrieval.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition.
Int. J. Comput. Vis., November, 2024

Reference-Aware Adaptive Network for Image-Text Matching.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

HA-Bins: Hierarchical Adaptive Bins for Robust Monocular Depth Estimation Across Multiple Datasets.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Learning Dynamic Compact Memory Embedding for Deformable Visual Object Tracking.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

Feature Disentanglement Network: Multi-Object Tracking Needs More Differentiated Features.
ACM Trans. Multim. Comput. Commun. Appl., March, 2024

One-Stream Vision-Language Memory Network for Object Tracking.
IEEE Trans. Multim., 2024

Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning.
IEEE Trans. Image Process., 2024

Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching.
IEEE Trans. Image Process., 2024

Efficient Dynamic Correspondence Network.
IEEE Trans. Image Process., 2024

EI-MVSNet: Epipolar-Guided Multi-View Stereo Network With Interval-Aware Label.
IEEE Trans. Image Process., 2024

A Unified Optimization Framework for Feature-Based Transferable Attacks.
IEEE Trans. Inf. Forensics Secur., 2024

Robust and Generalized Physical Adversarial Attacks via Meta-GAN.
IEEE Trans. Inf. Forensics Secur., 2024

A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation.
CoRR, 2024

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting.
CoRR, 2024

CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality.
CoRR, 2024

Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection.
CoRR, 2024

ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation.
CoRR, 2024

Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model.
CoRR, 2024

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy.
CoRR, 2024

Multi-modal Attribute Prompting for Vision-Language Models.
CoRR, 2024

Frequency Domain Modality-invariant Feature Learning for Visible-infrared Person Re-Identification.
CoRR, 2024

DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Rethinking the Implicit Optimization Paradigm with Dual Alignments for Referring Remote Sensing Image Segmentation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SAM-Glomeruli: Enhanced Segment Anything Model for Precise Glomeruli Segmentation.
Proceedings of the Medical Optical Imaging and Virtual Microscopy Image Analysis, 2024

Aggregation and Purification: Dual Enhancement Network for Point Cloud Few-shot Segmentation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Free Lunch for Gait Recognition: A Novel Relation Descriptor.
Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Reliable Matching with Phase Enhancement for Night-Time Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Localization and Expansion: A Decoupled Framework for Point Cloud Few-Shot Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Diff3DETR: Agent-Based Diffusion Model for Semi-supervised 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RankMatch: Exploring the Better Consistency Regularization for Semi-Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BSNet: Box-Supervised Simulation-Assisted Mean Teacher for 3D Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SD2Event: Self-Supervised Learning of Dynamic Detectors and Contextual Descriptors for Event Cameras.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unsupervised Template-assisted Point Cloud Shape Correspondence Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Task-Adaptive Prompted Transformer for Cross-Domain Few-Shot Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Pay Attention to Target: Relation-Aware Temporal Consistency for Domain Adaptive Video Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Unifying Visual and Vision-Language Tracking via Contrastive Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Electron Microscopy Images as Set of Fragments for Mitochondrial Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Long-Short Range Adaptive Transformer With Dynamic Sampling for 3D Object Detection.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Dynamic Keypoint Detection Network for Image Matching.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Adaptive Part Mining for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Task-Aware Weakly Supervised Object Localization With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Uncertainty Guided Collaborative Training for Weakly Supervised and Unsupervised Temporal Action Localization.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Hierarchical Shape-Consistent Transformer for Unsupervised Point Cloud Shape Correspondence.
IEEE Trans. Image Process., 2023

TIFace: Improving Facial Reconstruction through Tensorial Radiance Fields and Implicit Surfaces.
CoRR, 2023

EC-Depth: Exploring the consistency of self-supervised monocular depth estimation under challenging scenes.
CoRR, 2023

The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation.
CoRR, 2023

Adaptive Spot-Guided Transformer for Consistent Local Feature Matching.
CoRR, 2023

Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DAW: Exploring the Better Weighting Function for Semi-supervised Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Structure-Decoupled Adaptive Part Alignment Network for Domain Adaptive Mitochondria Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Enhancing Cell Detection in Histopathology Images: A ViT-Based U-Net Approach.
Proceedings of the Graphs in Biomedical Image Analysis, and Overlapped Cell on Tissue Dataset for Histopathology, 2023

Appearance Prompt Vision Transformer for Connectome Reconstruction.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023


Foreground-Background Distribution Modeling Transformer for Visual Object Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multimodal High-order Relation Transformer for Scene Boundary Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Adaptive Template Transformer for Mitochondria Segmentation in Electron Microscopy Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Query Refinement Transformer for 3D Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Unified Transformer-based Tracker for Anti-UAV Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Spot-Guided Transformer for Consistent Local Feature Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Rethinking the Correlation in Few-Shot Segmentation: A Buoys View.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DualRel: Semi-Supervised Mitochondria Segmentation from A Prototype Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Camouflaged Instance Segmentation via Explicit De-Camouflaging.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

D<sup>2</sup>Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Dynamic Generative Targeted Attacks with Pattern Injection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Structured Epipolar Matcher for Local Feature Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Domain Generalized Stereo Matching via Hierarchical Visual Transformation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Entropy-guided Open-set Fine-grained Fungi Recognition.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

2022
Focus Your Attention: A Focal Attention for Multimodal Learning.
IEEE Trans. Multim., 2022

Adversarial Transformers for Weakly Supervised Object Localization.
IEEE Trans. Image Process., 2022

Diverse Complementary Part Mining for Weakly Supervised Object Localization.
IEEE Trans. Image Process., 2022

Visible-Infrared Person Re-Identification With Modality-Specific Memory Network.
IEEE Trans. Image Process., 2022

Joint Attention-Guided Feature Fusion Network for Saliency Detection of Surface Defects.
IEEE Trans. Instrum. Meas., 2022

Robust Collaborative Learning of Patch-Level and Image-Level Annotations for Diabetic Retinopathy Grading From Fundus Image.
IEEE Trans. Cybern., 2022

Object Tracking via Spatial-Temporal Memory Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

Target-Distractor Aware Deep Tracking With Discriminative Enhancement Learning Loss.
IEEE Trans. Circuits Syst. Video Technol., 2022

Bayesian Correlation Filter Learning With Gaussian Scale Mixture Model for Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

Correlation filters based on spatial-temporal Gaussion scale mixture modelling for visual tracking.
Neurocomputing, 2022

MAUNet: Modality-Aware Anti-Ambiguity U-Net for Multi-Modality Cell Segmentation.
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Electron Microscopy Image Registration with Transformers.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Adaptive Agent Transformer for Few-Shot Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Cross-Modality Transformer for Visible-Infrared Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Keypoint-based Global Association Network for Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Implementation of Parallel Acceleration for Real-time Extraction of Visual Features.
Proceedings of the 4th International Conference on Artificial Intelligence and Advanced Manufacturing, 2022

2021
Part-based Structured Representation Learning for Person Re-identification.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Density-Aware Multi-Task Learning for Crowd Counting.
IEEE Trans. Multim., 2021

Local Correspondence Network for Weakly Supervised Temporal Sentence Grounding.
IEEE Trans. Image Process., 2021

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection.
IEEE Trans. Image Process., 2021

Consistency Graph Modeling for Semantic Correspondence.
IEEE Trans. Image Process., 2021

Learning to Model Relationships for Zero-Shot Video Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Task-aware Part Mining Network for Few-Shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Foreground Activation Maps for Weakly Supervised Object Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Geometry Uncertainty Projection Network for Monocular 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Lesion-Aware Transformers for Diabetic Retinopathy Grading.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Action Unit Memory Network for Weakly Supervised Temporal Action Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Diverse Part Discovery: Occluded Person Re-Identification With Part-Aware Transformer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval.
IEEE Trans. Multim., 2020

A Unified Deep Model for Joint Facial Expression Recognition, Face Synthesis, and Face Alignment.
IEEE Trans. Image Process., 2020

Geometry Guided Pose-Invariant Facial Expression Recognition.
IEEE Trans. Image Process., 2020

Online Multi-Expert Learning for Visual Tracking.
IEEE Trans. Image Process., 2020

Self-Supervised Agent Learning for Unsupervised Cross-Domain Person Re-Identification.
IEEE Trans. Image Process., 2020

Guest Editorial Introduction to the Special Section on Intelligent Visual Content Analysis and Understanding.
IEEE Trans. Circuits Syst. Video Technol., 2020

Cross-modality paired-images generation and augmentation for RGB-infrared person re-identification.
Neural Networks, 2020

Discriminative multimodal embedding for event classification.
Neurocomputing, 2020

A Structured Graph Attention Network for Vehicle Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Modality Cross Attention Network for Image and Sentence Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Graph Structured Network for Image-Text Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Attention Scaling for Crowd Counting.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-Modality Paired-Images Generation for RGB-Infrared Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation.
IEEE Trans. Multim., 2019

SMART: Joint Sampling and Regression for Visual Tracking.
IEEE Trans. Image Process., 2019

Robust Structural Sparse Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Learning Multi-Task Correlation Particle Filters for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Multi-modal max-margin supervised topic model for social event analysis.
Multim. Tools Appl., 2019

Video Highlight Detection via Region-Based Deep Ranking Model.
Int. J. Pattern Recognit. Artif. Intell., 2019

Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Boundary Perception Guidance: A Scribble-Supervised Semantic Segmentation Approach.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Exploring Feature Representation and Training Strategies in Temporal Action Localization.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

GCAN: Graph Convolutional Adversarial Network for Unsupervised Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Graph Convolutional Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

I Know the Relationships: Zero-Shot Action Recognition via Two-Stream Graph Convolutional Networks and Knowledge Graphs.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos.
IEEE Trans. Multim., 2018

Deep-Structured Event Modeling for User-Generated Photos.
IEEE Trans. Multim., 2018

Online Multimodal Multiexpert Learning for Social Event Tracking.
IEEE Trans. Multim., 2018

Cross-Domain Collaborative Learning via Discriminative Nonparametric Bayesian Model.
IEEE Trans. Multim., 2018

Three-Dimensional Attention-Based Deep Ranking Model for Video Highlight Detection.
IEEE Trans. Multim., 2018

Correlation Particle Filter for Visual Tracking.
IEEE Trans. Image Process., 2018

P2T: Part-to-Target Tracking via Deep Regression Learning.
IEEE Trans. Image Process., 2018

Robust Target Tracking by Online Random Forests and Superpixels.
IEEE Trans. Circuits Syst. Video Technol., 2018

Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Unified Generative Adversarial Framework for Image Generation and Person Re-identification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Learning semantic topics for domain-adapted textual knowledge transfer.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Joint Pose and Expression Modeling for Facial Expression Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Deep Relative Tracking.
IEEE Trans. Image Process., 2017

Temporal Restricted Visual Tracking Via Reverse-Low-Rank Sparse Learning.
IEEE Trans. Cybern., 2017

Discriminative Reverse Sparse Tracking via Weighted Multitask Learning.
IEEE Trans. Circuits Syst. Video Technol., 2017

Video Highlight Detection via Deep Ranking Modeling.
Proceedings of the Image and Video Technology - 8th Pacific-Rim Symposium, 2017

A Unified Personalized Video Recommendation via Dynamic Recurrent Neural Networks.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Generic Framework for Social Event Analysis.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

The Visual Object Tracking VOT2017 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Multi-task Correlation Particle Filter for Robust Object Tracking.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Semantic Feature Mining for Video Event Understanding.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Deep Relative Attributes.
IEEE Trans. Multim., 2016

Multi-Modal Event Topic Model for Social Event Analysis.
IEEE Trans. Multim., 2016

Robust Visual Tracking via Exclusive Context Modeling.
IEEE Trans. Cybern., 2016

Special Issue on Visual Tracking.
Comput. Vis. Image Underst., 2016

Abnormal Event Discovery in User Generated Photos.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Multi-modal Multi-view Topic-opinion Mining for Social Event Analysis.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

The Visual Object Tracking VOT2016 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

In Defense of Sparse Tracking: Circulant Sparse Tracker.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Structural Correlation Filter for Robust Visual Tracking.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

3D Part-Based Sparse Tracker with Automatic Synchronization and Registration.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Boosted Multifeature Learning for Cross-Domain Transfer.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Automatic Visual Concept Learning for Social Event Understanding.
IEEE Trans. Multim., 2015

Cross-Domain Feature Learning in Multimedia.
IEEE Trans. Multim., 2015

Latent Support Vector Machine Modeling for Sign Language Recognition with Kinect.
ACM Trans. Intell. Syst. Technol., 2015

Multi-object tracking via MHT with multiple information fusion in surveillance video.
Multim. Syst., 2015

A new discriminative coding method for image classification.
Multim. Syst., 2015

Robust Visual Tracking Via Consistent Low-Rank Sparse Learning.
Int. J. Comput. Vis., 2015

Cross-Domain Collaborative Learning in Social Multimedia.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Structural Sparse Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Cross-Domain Multi-Event Tracking via CO-PMHT.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Social Event Classification via Boosted Multimodal Supervised Latent Dirichlet Allocation.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Boosted Multi-modal Supervised Latent Dirichlet Allocation for Social Event Classification.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Multi-Modal Supervised Latent Dirichlet Allocation for Event Classification in Social Media.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Partial Occlusion Handling for Visual Tracking via Robust Part Matching.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Mining Semantic Context Information for Intelligent Video Surveillance of Traffic Scenes.
IEEE Trans. Ind. Informatics, 2013

Discriminative Exemplar Coding for Sign Language Recognition With Kinect.
IEEE Trans. Cybern., 2013

M<sup>4</sup>L: Maximum margin Multi-instance Multi-cluster Learning for scene modeling.
Pattern Recognit., 2013

Robust Visual Tracking via Structured Multi-Task Sparse Learning.
Int. J. Comput. Vis., 2013

Multi-cue Based Multi-target Tracking with Boosted MHT.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Graph-Guided Fusion Penalty Based Sparse Coding for Image Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Latent support vector machine for sign language recognition with Kinect.
Proceedings of the IEEE International Conference on Image Processing, 2013

Locality discriminative coding for image classification.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Low-Rank Sparse Coding for Image Classification.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Object Tracking by Occlusion Detection via Structured Sparse Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
A Generic Framework for Video Annotation via Semi-Supervised Learning.
IEEE Trans. Multim., 2012

Weakly Supervised Graph Propagation Towards Collective Image Parsing.
IEEE Trans. Multim., 2012

Hi, magic closet, tell me what to wear!
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Context-aware learning for automatic sports highlight recognition.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Robust multi-object tracking via cross-domain contextual information for sports video analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Low-Rank Sparse Learning for Robust Visual Tracking.
Proceedings of the Computer Vision - ECCV 2012, 2012

Robust visual tracking via multi-task sparse learning.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Hierarchical Object Representations for Visual Recognition via Weakly Supervised Learning.
Proceedings of the Computer Vision - ACCV 2012, 2012

2011
Boosted Exemplar Learning for Action Recognition and Annotation.
IEEE Trans. Circuits Syst. Video Technol., 2011

Boosted multi-class semi-supervised learning for human action recognition.
Pattern Recognit., 2011

2010
Human Action Recognition in Videos Using Hybrid Motion Features.
Proceedings of the Advances in Multimedia Modeling, 2010

A generic framework for event detection in various video domains.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Human action recognition via multi-view learning.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

2009
Human action recognition in videos using motion impression image.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Boosted Exemplar Learning for human action recognition.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Learning semantic scene models by object classification and trajectory clustering.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009


  Loading...