Fei Yuan

Affiliations:
  • Shanghai Artificial Intelligence Laboratory (Shanghai AI Laboratory), China


According to our database1, Fei Yuan authored at least 24 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback.
CoRR, July, 2025

A Controllable Examination for Long-Context Language Models.
CoRR, June, 2025

Could Thinking Multilingually Empower LLM Reasoning?
CoRR, April, 2025

Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning.
CoRR, February, 2025

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models.
CoRR, February, 2025

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages.
CoRR, January, 2025

KS-Lottery: Finding Certified Lottery Tickets for Multilingual Transfer in Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

LegoMT2: Selective Asynchronous Sharded Data Parallel Training for Massive Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
A Controlled Study on Long Context Extension and Generalization in LLMs.
CoRR, 2024

MindMerger: Efficient Boosting LLM Reasoning in non-English Languages.
CoRR, 2024

The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights.
CoRR, 2024

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond.
CoRR, 2024

KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models.
CoRR, 2024

MindMerger: Efficiently Boosting LLM Reasoning in non-English Languages.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Question Translation Training for Better Multilingual Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
How Multilingual is Multilingual LLM?
CoRR, 2023

Extrapolating Large Language Models to Non-English by Aligning Languages.
CoRR, 2023

Scaling TransNormer to 175 Billion Parameters.
CoRR, 2023

Extrapolating Multilingual Understanding Models as Multilingual Generators.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Lego-MT: Learning Detachable Models for Massively Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Lego-MT: Towards Detachable Models in Massively Multilingual Machine Translation.
CoRR, 2022


  Loading...