Ming Sun

Affiliations:
  • Meta Platforms, Inc., USA


According to our database1, Ming Sun authored at least 17 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Equipping LLM with Directional Multi-Talker Speech Understanding Capabilities.
CoRR, February, 2026

2025
SLM-TTA: A Framework for Test-Time Adaptation of Generative Spoken Language Models.
CoRR, December, 2025

Multi-Channel Differential ASR for Robust Wearer Speech Recognition on Smart Glasses.
CoRR, September, 2025

Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

MASV: Speaker Verification with Global and Local Context Mamba.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Directional Speech Recognition with Full-Duplex Capability.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Effective Integration of KAN for Keyword Spotting.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Directional Source Separation for Robust Speech Recognition on Smart Glasses.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MMW: Side Talk Rejection Multi-Microphone Whisper On Smart Glasses.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Long-Form Fuzzy Speech-to-Text Alignment for 1000+ Languages.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024
MASV: Speaker Verification with Global and Local Context Mamba.
CoRR, 2024

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation.
CoRR, 2024

Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Disentangled Training with Adversarial Examples for Robust Small-Footprint Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting.
CoRR, 2022


  Loading...