Zhuofan Xia

Orcid: 0009-0001-7965-364X

According to our database1, Zhuofan Xia authored at least 20 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation.
CoRR, March, 2026

Towards Sparse Video Understanding and Reasoning.
CoRR, February, 2026

2025
Step by Step Network.
CoRR, November, 2025

Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception.
CoRR, September, 2025

From ReLU to GeMU: Activation functions in the lens of cone projection.
Neural Networks, 2025

2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data.
CoRR, 2024

Training an Open-Vocabulary Monocular 3D Detection Model without 3D Data.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Demystify Mamba in Vision: A Linear Attention Perspective.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Bridging the Divide: Reconsidering Softmax and Linear Attention.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Efficient Diffusion Transformer with Step-Wise Dynamic Attention Mediators.
Proceedings of the Computer Vision - ECCV 2024, 2024

Agent Attention: On the Integration of Softmax and Linear Attention.
Proceedings of the Computer Vision - ECCV 2024, 2024

GSVA: Generalized Segmentation via Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Agent Attention: On the Integration of Softmax and Linear Attention.
CoRR, 2023

Generalized Activation via Multivariate Projection.
CoRR, 2023

DAT++: Spatially Dynamic Vision Transformer with Deformable Attention.
CoRR, 2023

Budgeted Training for Vision Transformer.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Adaptive Rotated Convolution for Rotated Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Vision Transformer with Deformable Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
3D Object Detection With Pointformer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021


  Loading...