Luyao Cheng

Orcid: 0009-0006-1311-8448

According to our database1, Luyao Cheng authored at least 20 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models.
CoRR, August, 2025

OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment.
CoRR, June, 2025

3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization on Multi-party Conversation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Clustering-NN-Based CFO Estimation Using Random Access Preambles for 5G Non-Terrestrial Networks.
IEEE Wirel. Commun. Lett., March, 2024

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation.
CoRR, 2024

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization.
CoRR, 2024

Hyperspectral Image Change Detection via Cross-Sample Slot Attention and Dual Gated Feed-Forward Network.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Joint Activity Detection and Channel Estimation for 6G GFRA: A Memory-Enhanced DL Network Framework.
Proceedings of the IEEE Globecom Workshops 2024, 2024

2023
Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation.
CoRR, 2023

3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement.
CoRR, 2023

CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pushing the Limits of Self-Supervised Speaker Verification using Regularized Distillation Framework.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


  Loading...