Hehe Fan

Orcid: 0000-0001-9572-2345

According to our database1, Hehe Fan authored at least 50 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition.
IEEE Trans. Multim., 2024

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering.
IEEE Trans. Multim., 2024

EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing.
CoRR, 2024

ProtChatGPT: Towards Understanding Proteins with Large Language Models.
CoRR, 2024

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
CoRR, 2024

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation.
CoRR, 2023

A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization.
CoRR, 2023

FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax.
CoRR, 2023

Prior-Free Continual Learning with Unlabeled Data in the Wild.
CoRR, 2023

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation.
CoRR, 2023

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering.
CoRR, 2023

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023.
CoRR, 2023

STPrivacy: Spatio-Temporal Tubelet Sparsification and Anonymization for Privacy-preserving Action Recognition.
CoRR, 2023

Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PointListNet: Deep Learning on 3D Point Lists.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Text to Point Cloud Localization with Relation-Enhanced Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

SEFormer: Structure Embedding Transformer for 3D Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Temporal Cross-Layer Correlation Mining for Action Recognition.
IEEE Trans. Multim., 2022

Unsupervised Visual Representation Learning via Dual-Level Progressive Similar Instance Selection.
IEEE Trans. Cybern., 2022

Understanding Atomic Hand-Object Interaction With Human Intention.
IEEE Trans. Circuits Syst. Video Technol., 2022

Entropy guided attention network for weakly-supervised action localization.
Pattern Recognit., 2022

Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
CoRR, 2022

Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022

Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Few-Shot Common-Object Reasoning Using Common-Centric Localization Network.
IEEE Trans. Image Process., 2021

Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences.
Proceedings of the 9th International Conference on Learning Representations, 2021

Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
From Video Classification to Video Prediction: Deep Learning Approaches to Video Modelling
PhD thesis, 2020

Recurrent Attention Network with Reinforced Generator for Visual Dialog.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Adaptive Exploration for Unsupervised Person Re-identification.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Cascaded Revision Network for Novel Object Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Person Tube Retrieval via Language Description.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
PointRNN: Point Recurrent Neural Network for Moving Point Cloud Processing.
CoRR, 2019

Cascaded Revision Network for Novel Object Captioning.
CoRR, 2019

Attract or Distract: Exploit the Margin of Open Set.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cubic LSTMs for Video Prediction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Unsupervised Person Re-identification: Clustering and Fine-tuning.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Unsupervised Person Re-identification: Clustering and Fine-tuning.
CoRR, 2017

Complex Event Detection by Identifying Reliable Shots from Untrimmed Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Multiple kernel visual-auditory representation learning for retrieval.
Multim. Tools Appl., 2016

Informedia @ TRECVID 2016.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016


  Loading...