Wenxuan Huang
Orcid: 0009-0001-9656-813XAffiliations:
- East China Normal University, School of Computer Science and Technology, Shanghai, China
According to our database1,
Wenxuan Huang authored at least 30 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2026
CoRR, April, 2026
CoRR, April, 2026
SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction.
CoRR, March, 2026
HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts.
CoRR, March, 2026
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant.
CoRR, March, 2026
VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph.
CoRR, February, 2026
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression.
CoRR, February, 2026
CoRR, February, 2026
Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models.
CoRR, February, 2026
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models.
CoRR, January, 2026
2025
CoRR, November, 2025
CoRR, October, 2025
CoRR, October, 2025
MASA: Rethinking the Representational Bottleneck in LoRA with Multi-A Shared Adaptation.
CoRR, October, 2025
Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models.
CoRR, October, 2025
CoRR, June, 2025
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning.
CoRR, June, 2025
MT<sup>3</sup>: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning.
CoRR, May, 2025
ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation.
CoRR, May, 2025
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning.
CoRR, April, 2025
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?
CoRR, March, 2025
CoRR, March, 2025
An Intelligent First-Arrival Picking Method of Microseismic Signals Based on the Small Sample Expansion.
IEEE Trans. Geosci. Remote. Sens., 2025
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
CoRR, 2023