Hai Huang

Affiliations:

Zhejiang University, Hangzhou, China

According to our database¹, Hai Huang authored at least 19 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation.

[BibT_eX]

[DOI]

CoRR, September, 2025

TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal.

[BibT_eX]

[DOI]

CoRR, August, 2025

Open-set Cross Modal Generalization via Multimodal Unified Representation.

[BibT_eX]

[DOI]

CoRR, July, 2025

Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations.

[BibT_eX]

[DOI]

CoRR, July, 2025

Continual Cross-Modal Generalization.

[BibT_eX]

[DOI]

CoRR, April, 2025

Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Overcoming both Domain Shift and Label Shift for Referring Video Segmentation.

[BibT_eX]

[DOI]

Hai Huang

Sashuai Zhou

Yan Xia

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning.

[BibT_eX]

[DOI]

Sashuai Zhou

Yan Xia

Hai Huang

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Semantic Residual for Multimodal Unified Discrete Representation.

[BibT_eX]

[DOI]

Hai Huang

Shulei Wang

Yan Xia

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Towards Transformer-Based Aligned Generation with Self-Coherence Guidance.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ControlSpeech: Towards Simultaneous and Independent Zero-shot Speaker Cloning and Zero-shot Language Style Control.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Language-Codec: Bridging Discrete Codec Representations and Speech Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Enhancing Multimodal Unified Representations for Cross Modal Generalization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

CART: A Generative Cross-Modal Retrieval Framework With Coarse-To-Fine Semantic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec.

[BibT_eX]

[DOI]

CoRR, 2024

Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Achieving Cross Modal Generalization with Multimodal Unified Representation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hai Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...