Luyao Cheng
Orcid: 0009-0006-1311-8448
According to our database1,
Luyao Cheng
authored at least 20 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models.
CoRR, August, 2025
OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment.
CoRR, June, 2025
3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization on Multi-party Conversation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Clustering-NN-Based CFO Estimation Using Random Access Preambles for 5G Non-Terrestrial Networks.
IEEE Wirel. Commun. Lett., March, 2024
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization.
CoRR, 2024
Hyperspectral Image Change Detection via Cross-Sample Slot Attention and Dual Gated Feed-Forward Network.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Joint Activity Detection and Channel Estimation for 6G GFRA: A Memory-Enhanced DL Network Framework.
Proceedings of the IEEE Globecom Workshops 2024, 2024
2023
Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation.
CoRR, 2023
3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement.
CoRR, 2023
CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Pushing the Limits of Self-Supervised Speaker Verification using Regularized Distillation Framework.
Proceedings of the IEEE International Conference on Acoustics, 2023
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021