Ke-Han Lu
Orcid: 0000-0002-5331-0534
According to our database1,
Ke-Han Lu
authored at least 21 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment.
CoRR, July, 2025
Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding.
CoRR, June, 2025
Speech-IFEval: Evaluating Instruction-Following and Quantifying Catastrophic Forgetting in Speech-Aware Language Models.
CoRR, May, 2025
Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models.
CoRR, May, 2025
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Listen and Speak Fairly: a Study on Semantic Gender Bias in Speech Integrated Large Language Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Speech-Copilot: Leveraging Large Language Models for Speech Processing Via Task Decomposition, Modularization, and Program Generation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR And Speech-to-Text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2024
Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
CoRR, 2023
2022
Non-Autoregressive ASR Modeling Using Pre-Trained Language Models for Chinese Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
2021
A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021.
CoRR, 2021
ntust-nlp-2 at ROCLING-2021 Shared Task: BERT-based semantic analyzer with word-level information.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021