Kohei Matsuura

According to our database1, Kohei Matsuura authored at least 23 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Microphone array geometry-independent multi-talker distant ASR: NTT system for DASR task of the CHiME-8 challenge.
Comput. Speech Lang., 2026

2025
Alignment-Free Training for Transducer-based Multi-Talker ASR.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Advancing Streaming ASR with Chunk-wise Attention and Trans-chunk Selective State Spaces.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Bridging Speech and Text Foundation Models with ReShape Attention.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over.
CoRR, 2024

Investigation of Speaker Representation for Target-Speaker Speech Processing.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

What Do Self-Supervised Speech and Speaker Models Learn? New Findings from a Cross Model Layer-Wise Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Leveraging Language Embeddings for Cross-Lingual Self-Supervised Speech Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Scheduled Sampling for Neural Transducer-Based ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Large Text Corpora For End-To-End Speech Summarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Summarization of Long Spoken Document: Improving Memory Efficiency of Speech/Text Encoders.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Summarize While Translating: Universal Model With Parallel Decoding for Summarization and Translation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Hybrid RNN-T/Attention-Based Streaming ASR with Triggered Chunkwise Attention and Dual Internal Language Model Integration.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020


  Loading...