We stand with Ukraine

We stand with Ukraine

Ke-Han Lu

Orcid: 0000-0002-5331-0534

According to our database¹, Ke-Han Lu authored at least 22 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

TAU: A Benchmark for Cultural Sound Understanding Beyond Semantics.

[BibT_eX]

[DOI]

,

,

,

Yueh-Hsuan Huang

,

,

,

,

,

,

,

,

Hsiu-Hsuan Wang

,

,

,

CoRR, September, 2025

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment.

[BibT_eX]

[DOI]

CoRR, July, 2025

Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding.

[BibT_eX]

[DOI]

,

,

Cheng-Han Chiang

,

CoRR, June, 2025

Speech-IFEval: Evaluating Instruction-Following and Quantifying Catastrophic Forgetting in Speech-Aware Language Models.

[BibT_eX]

[DOI]

,

,

CoRR, May, 2025

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, May, 2025

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Wei-Cheng Tseng

,

,

,

,

,

,

,

,

,

,

,

,

,

Fabian Alejandro Ritter Gutierrez

,

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data.

[BibT_eX]

[DOI]

,

,

,

Chao-Han Huck Yang

,

Jagadeesh Balam

,

,

Yu-Chiang Frank Wang

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt.

[BibT_eX]

[DOI]

CoRR, 2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Wei-Cheng Tseng

,

,

,

,

,

,

,

,

,

,

,

,

,

Fabian Ritter Gutierrez

,

,

,

,

,

,

,

Chung-Ming Chien

,

,

Cheng-Hsiu Hsieh

,

,

,

,

Heitor R. Guimarães

,

,

,

,

,

,

,

,

,

,

,

,

,

Kuan-Yu Fang Chiang

,

,

,

,

Shao-Syuan Huang

,

,

,

,

,

,

,

,

,

,

Shih-Yun Shan Kuan

,

,

,

,

,

,

,

,

Chao-Han Huck Yang

,

,

,

Shao-Xiang Yuan

,

,

,

,

,

,

Shinji Watanabe

,

CoRR, 2024

Codec-Superb @ SLT 2024: A Lightweight Benchmark For Neural Audio Codec Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Alexander H. Liu

,

,

,

,

,

,

,

,

Shinji Watanabe

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Listen and Speak Fairly: a Study on Semantic Gender Bias in Speech Integrated Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Speech-Copilot: Leveraging Large Language Models for Speech Processing Via Task Decomposition, Modularization, and Program Generation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

HypR: A comprehensive study for ASR hypothesis revising with a reference corpus.

[BibT_eX]

[DOI]

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment.

[BibT_eX]

[DOI]

,

,

,

,

,

Yu-Chiang Frank Wang

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR And Speech-to-Text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Roshan S. Sharma

,

Shinji Watanabe

,

Bhiksha Ramakrishnan

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

HypR: A comprehensive study for ASR hypothesis revising with a reference corpus.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

2022

Non-Autoregressive ASR Modeling Using Pre-Trained Language Models for Chinese Speech Recognition.

[BibT_eX]

[DOI]

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

A Context-Aware Knowledge Transferring Strategy for CTC-Based ASR.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021

A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

ntust-nlp-2 at ROCLING-2021 Shared Task: BERT-based semantic analyzer with word-level information.

[BibT_eX]

[DOI]

,

Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

Loading...