Bingshen Mu

Orcid: 0009-0009-7797-865X

According to our database¹, Bingshen Mu authored at least 19 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Semantic-Aware Interruption Detection in Spoken Dialogue Systems: Benchmark, Metric, and Model.

[BibT_eX]

[DOI]

CoRR, March, 2026

Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2026

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech.

[BibT_eX]

[DOI]

CoRR, January, 2026

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition.

[BibT_eX]

[DOI]

CoRR, January, 2026

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem.

[BibT_eX]

[DOI]

CoRR, January, 2026

Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages.

[BibT_eX]

[DOI]

CoRR, September, 2025

Summary on The Multilingual Conversational Speech Language Model Challenge: Datasets, Tasks, Baselines, and Methods.

[BibT_eX]

[DOI]

CoRR, September, 2025

Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition.

[BibT_eX]

[DOI]

CoRR, July, 2025

Weakly Supervised Data Refinement and Flexible Sequence Compression for Efficient Thai LLM-based ASR.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Efficient Scaling for LLM-based ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024

MMGER: Multi-Modal and Multi-Granularity Generative Error Correction With LLM for Joint Accent and Speech Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

E-Chat: Emotion-Sensitive Spoken Dialogue System with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Automatic Channel Selection and Spatial Feature Integration for Multi-Channel Speech Recognition Across Various Array Topologies.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Bingshen Mu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...