Hai Huang

Affiliations:
  • Zhejiang University, Hangzhou, China


According to our database1, Hai Huang authored at least 18 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal.
CoRR, August, 2025

Open-set Cross Modal Generalization via Multimodal Unified Representation.
CoRR, July, 2025

Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations.
CoRR, July, 2025

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models.
CoRR, May, 2025

Continual Cross-Modal Generalization.
CoRR, April, 2025

Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning.
CoRR, March, 2025

Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Overcoming both Domain Shift and Label Shift for Referring Video Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Semantic Residual for Multimodal Unified Discrete Representation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Towards Transformer-Based Aligned Generation with Self-Coherence Guidance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ControlSpeech: Towards Simultaneous and Independent Zero-shot Speaker Cloning and Zero-shot Language Style Control.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Language-Codec: Bridging Discrete Codec Representations and Speech Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Enhancing Multimodal Unified Representations for Cross Modal Generalization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

CART: A Generative Cross-Modal Retrieval Framework With Coarse-To-Fine Semantic Modeling.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling.
CoRR, 2024

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec.
CoRR, 2024

Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment.
CoRR, 2024

2023
Achieving Cross Modal Generalization with Multimodal Unified Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


  Loading...