Zhan Tong

According to our database1, Zhan Tong authored at least 19 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Contextual AD Narration with Interleaved Multimodal Sequence.
CoRR, 2024

2023
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification.
CoRR, 2023

Bootstrapping SparseFormers from Vision Foundation Models.
CoRR, 2023

Advancing Vision Transformers with Group-Mix Attention.
CoRR, 2023

Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training.
CoRR, 2023

TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale.
CoRR, 2023

SparseFormer: Sparse Visual Recognition via Limited Latent Tokens.
CoRR, 2023

CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection.
CoRR, 2023

Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient Video Action Detection with Token Dropout and Context Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Data-consistent Unsupervised Diffusion Model for Metal Artifact Reduction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations.
CoRR, 2022

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EViT: Expediting Vision Transformers via Token Reorganizations.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TDN: Temporal Difference Networks for Efficient Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019
TransmiR v2.0: an updated transcription factor-microRNA regulation database.
Nucleic Acids Res., 2019


  Loading...