Zhaoye Fei

According to our database¹, Zhaoye Fei authored at least 30 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

World Action Models: The Next Frontier in Embodied AI.

[BibT_eX]

[DOI]

CoRR, May, 2026

MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions.

[BibT_eX]

[DOI]

CoRR, March, 2026

MOSS-TTSD: Text to Spoken Dialogue Generation.

[BibT_eX]

[DOI]

CoRR, March, 2026

MOSS-TTS Technical Report.

[BibT_eX]

[DOI]

CoRR, March, 2026

MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

MOVA: Towards Scalable and Synchronized Video-Audio Generation.

[BibT_eX]

[DOI]

CoRR, February, 2026

WESR: Scaling and Evaluating Word-level Event-Speech Recognition.

[BibT_eX]

[DOI]

CoRR, January, 2026

MOSS Transcribe Diarize Technical Report.

[BibT_eX]

[DOI]

CoRR, January, 2026

XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

RoboOmni: Proactive Robot Manipulation in Omni-modal Context.

[BibT_eX]

[DOI]

CoRR, October, 2025

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance.

[BibT_eX]

[DOI]

CoRR, October, 2025

CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation.

[BibT_eX]

[DOI]

CoRR, August, 2025

XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs.

[BibT_eX]

[DOI]

CoRR, June, 2025

Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, June, 2025

World-aware Planning Narratives Enhance Large Vision-Language Model Planner.

[BibT_eX]

[DOI]

CoRR, June, 2025

InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems.

[BibT_eX]

[DOI]

CoRR, June, 2025

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

How to Mitigate Overfitting in Weak-to-strong Generalization?

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

InternLM2 Technical Report.

[BibT_eX]

[DOI]

et al.

CoRR, 2024

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora.

[BibT_eX]

[DOI]

CoRR, 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Balanced Data Sampling for Language Model Training with Clustering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2022

Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?

[BibT_eX]

[DOI]

CoRR, 2022

Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Towards More Effective and Economic Sparsely-Activated Model.

[BibT_eX]

[DOI]

CoRR, 2021

Zhaoye Fei

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...