Wenxuan Huang
Orcid: 0009-0001-9656-813XAffiliations:
- East China Normal University, School of Computer Science and Technology, Shanghai, China
According to our database1,
Wenxuan Huang
authored at least 14 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2025
CoRR, June, 2025
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning.
CoRR, June, 2025
MT<sup>3</sup>: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning.
CoRR, May, 2025
ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation.
CoRR, May, 2025
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation.
CoRR, April, 2025
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning.
CoRR, April, 2025
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?
CoRR, March, 2025
CoRR, March, 2025
An Intelligent First-Arrival Picking Method of Microseismic Signals Based on the Small Sample Expansion.
IEEE Trans. Geosci. Remote. Sens., 2025
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
CoRR, 2023