Bingshen Mu

Orcid: 0009-0009-7797-865X

According to our database1, Bingshen Mu authored at least 19 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Semantic-Aware Interruption Detection in Spoken Dialogue Systems: Benchmark, Metric, and Model.
CoRR, March, 2026

Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning.
CoRR, March, 2026

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech.
CoRR, January, 2026

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition.
CoRR, January, 2026

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem.
CoRR, January, 2026

Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages.
CoRR, September, 2025

Summary on The Multilingual Conversational Speech Language Model Challenge: Datasets, Tasks, Baselines, and Methods.
CoRR, September, 2025

Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition.
CoRR, July, 2025

Weakly Supervised Data Refinement and Flexible Sequence Compression for Efficient Thai LLM-based ASR.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Efficient Scaling for LLM-based ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024
MMGER: Multi-Modal and Multi-Granularity Generative Error Correction With LLM for Joint Accent and Speech Recognition.
IEEE Signal Process. Lett., 2024

MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition.
CoRR, 2024

E-Chat: Emotion-Sensitive Spoken Dialogue System with Large Language Models.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Automatic Channel Selection and Spatial Feature Integration for Multi-Channel Speech Recognition Across Various Array Topologies.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023


  Loading...