Qingkai Fang

Orcid: 0000-0001-8575-591X

According to our database¹, Qingkai Fang authored at least 22 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Efficient Training for Cross-lingual Speech Language Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

2025

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model.

[BibT_eX]

[DOI]

CoRR, June, 2025

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, May, 2025

FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMA-Omni: Seamless Speech Interaction with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CTC-based Non-autoregressive Textless Speech-to-Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation.

[BibT_eX]

[DOI]

Qingkai Fang

Yan Zhou

Yang Feng

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation.

[BibT_eX]

[DOI]

Yan Zhou

Qingkai Fang

Yang Feng

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Understanding and Bridging the Modality Gap for Speech Translation.

[BibT_eX]

[DOI]

Qingkai Fang

Yang Feng

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Back Translation for Speech-to-text Translation Without Transcripts.

[BibT_eX]

[DOI]

Qingkai Fang

Yang Feng

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Low-resource Neural Machine Translation with Cross-modal Alignment.

[BibT_eX]

[DOI]

Zhe Yang

Qingkai Fang

Yang Feng

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Neural Machine Translation with Phrase-Level Universal Visual Representations.

[BibT_eX]

[DOI]

Qingkai Fang

Yang Feng

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Geometric Object 3D Reconstruction from Single Line Drawing Image Based on a Network for Classification and Sketch Extraction.

[BibT_eX]

[DOI]

Zhuoying Wang

Qingkai Fang

Yongtao Wang

Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Qingkai Fang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...