Shaolei Zhang

Orcid: 0000-0002-7254-9380

Affiliations:
  • Renmin University of China, School of Information, Beijing, China
  • Chinese Academy of Sciences, Institute of Computing Technology, Key Laboratory of Intelligent Information Processing, Beijing, China (PhD 2025)
  • Beijing University of Posts and Telecommunications, School of Computer Science, Beijing, China (until 2020)


According to our database1, Shaolei Zhang authored at least 38 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment.
CoRR, September, 2025

PSO-Merging: Merging Models Based on Particle Swarm Optimization.
CoRR, August, 2025

FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing.
CoRR, July, 2025

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model.
CoRR, June, 2025

TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes.
CoRR, May, 2025

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis.
CoRR, May, 2025

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMA-Omni: Seamless Speech Interaction with Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models.
CoRR, 2024

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment.
CoRR, 2024

Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models.
CoRR, 2024

SiLLM: Large Language Models for Simultaneous Machine Translation.
CoRR, 2024

Glancing Future for Simultaneous Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Decoder-only Streaming Transformer for Simultaneous Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models.
CoRR, 2023

Unified Segment-to-Segment Framework for Simultaneous Sequence Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hidden Markov Transformer for Simultaneous Machine Translation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Non-autoregressive Streaming Transformer for Simultaneous Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Simultaneous Machine Translation with Tailored Reference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

End-to-End Simultaneous Speech Translation with Differentiable Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learning Optimal Policy for Simultaneous Machine Translation via Binary Search.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Information-Transport-based Policy for Simultaneous Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Turning Fixed to Adaptive: Integrating Post-Evaluation into Simultaneous Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Gaussian Multi-head Attention for Simultaneous Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Reducing Position Bias in Simultaneous Machine Translation with Length-Aware Framework.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Modeling Dual Read/Write Paths for Simultaneous Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Modeling Concentrated Cross-Attention for Neural Machine Translation with Gaussian Mixture Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Future-Guided Incremental Transformer for Simultaneous Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2019
Opinion Knowledge Injection Network for Aspect Extraction.
Proceedings of the Neural Information Processing - 26th International Conference, 2019


  Loading...