Qingkai Fang

Orcid: 0000-0001-8575-591X

According to our database1, Qingkai Fang authored at least 22 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Efficient Training for Cross-lingual Speech Language Models.
CoRR, April, 2026

2025
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model.
CoRR, June, 2025

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis.
CoRR, May, 2025

FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMA-Omni: Seamless Speech Interaction with Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment.
CoRR, 2024

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CTC-based Non-autoregressive Textless Speech-to-Speech Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models.
CoRR, 2023

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Understanding and Bridging the Modality Gap for Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Back Translation for Speech-to-text Translation Without Transcripts.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Low-resource Neural Machine Translation with Cross-modal Alignment.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Neural Machine Translation with Phrase-Level Universal Visual Representations.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Geometric Object 3D Reconstruction from Single Line Drawing Image Based on a Network for Classification and Sketch Extraction.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021


  Loading...