Atsunori Ogawa

Orcid: 0000-0002-2888-101X

According to our database1, Atsunori Ogawa authored at least 113 papers between 1998 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization.
CoRR, 2023

Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization.
CoRR, 2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
CoRR, 2023

Iterative Shallow Fusion of Backward Language Model for End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Large Text Corpora For End-To-End Speech Summarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Summarization of Long Spoken Document: Improving Memory Efficiency of Speech/Text Encoders.
Proceedings of the IEEE International Conference on Acoustics, 2023

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Summarize While Translating: Universal Model With Parallel Decoding for Summarization and Translation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Streaming End-to-End ASR Using CTC Decoder and DRA for Linguistic Information Substitution.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age Estimation.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Language modeling for spontaneous speech recognition based on disfluency labeling and generation of disfluent text.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Combining multiple end-to-end speech recognition models based on density ratio approach.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Multi-Source Domain Generalization Using Domain Attributes for Recurrent Neural Network Language Models.
IEICE Trans. Inf. Syst., 2022

Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening.
CoRR, 2022

End-to-End Spontaneous Speech Recognition Using Disfluency Labeling.
Proceedings of the Interspeech 2022, 2022

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Integrating Multiple ASR Systems into NLP Backend with Attention Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2021

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Robust Speech-Age Estimation Using Local Maximum Mean Discrepancy Under Mismatched Recording Conditions.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Attention-Based Multi-Hypothesis Fusion for Speech Summarization.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Advanced language model fusion method for encoder-decoder model in Japanese speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

End-to-End Spontaneous Speech Recognition Using Hesitation Labeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Language Model Data Augmentation Based on Text Domain Transfer.
Proceedings of the Interspeech 2020, 2020

Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-Based ASR System.
Proceedings of the Interspeech 2020, 2020

Frame-Level Phoneme-Invariant Speaker Embedding for Text-Independent Speaker Recognition on Extremely Short Utterances.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Speaker-Attribute Estimation by Voting Based on Speaker Cluster Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Noise-robust Attention Learning for End-to-End Speech Recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers.
IEICE Trans. Inf. Syst., 2019

Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders.
Proceedings of the Interspeech 2019, 2019

Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues.
Proceedings of the Interspeech 2019, 2019

Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration.
Proceedings of the Interspeech 2019, 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Predicting Speech Intelligibility of Enhanced Speech Using Phone Accuracy of DNN-Based ASR System.
Proceedings of the Interspeech 2019, 2019

ILP-based Compressive Speech Summarization with Content Word Coverage Maximization and Its Oracle Performance Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Unified Framework for Neural Speech Separation and Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2019

Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Unified Framework for Feature-based Domain Adaptation of Neural Network Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Context Adaptive Neural Network Based Acoustic Models for Rapid Adaptation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Semi-Supervised End-to-End Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Auxiliary Feature Based Adaptation of End-to-end ASR Systems.
Proceedings of the Interspeech 2018, 2018

Rescoring N-Best Speech Recognition List Based on One-on-One Hypothesis Comparison Using Encoder-Classifier Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Language Model Domain Adaptation Via Recurrent Neural Networks with Domain-Shared and Domain-Specific Representations.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sequence Training of Encoder-Decoder Model Using Policy Gradient for End-to-End Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Single Channel Target Speaker Extraction and Recognition with Speaker Beam.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Feature-Based Learning Hidden Unit Contributions for Domain Adaptation of RNN-LMs.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Factorised Hidden Layer Based Domain Adaptation for Recurrent Neural Network Language Models.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks.
Speech Commun., 2017

Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures.
Proceedings of the Interspeech 2017, 2017

Uncertainty Decoding with Adaptive Sampling for Noise Robust DNN-Based Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search.
Proceedings of the Interspeech 2017, 2017

Forward-Backward Convolutional LSTM for Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Feedback connection for deep neural network-based acoustic modeling.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep mixture density network for statistical model-based feature enhancement.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

Learning speaker representation for neural network based multichannel speaker extraction.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Estimating Speech Recognition Accuracy Based on Error Type Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Differenced maximum mutual information criterion for robust unsupervised acoustic model adaptation.
Comput. Speech Lang., 2016

Factorized Linear Input Network for Acoustic Model Adaptation in Noisy Conditions.
Proceedings of the Interspeech 2016, 2016

Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
Proceedings of the Interspeech 2016, 2016

Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models.
Proceedings of the Interspeech 2016, 2016

Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Strategies for distant speech recognitionin reverberant environments.
EURASIP J. Adv. Signal Process., 2015

Robust i-vector extraction for neural network adaptation in noisy environment.
Proceedings of the INTERSPEECH 2015, 2015

Text-informed speech enhancement with deep neural networks.
Proceedings of the INTERSPEECH 2015, 2015

ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Double-layer neighborhood graph based similarity search for fast query-by-example spoken term detection.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Automatic Vocabulary Adaptation Based on Semantic and Acoustic Similarities.
IEICE Trans. Inf. Syst., 2014

Fast segment search for corpus-based speech enhancement based on speech recognition technology.
Proceedings of the IEEE International Conference on Acoustics, 2014

Zero-resource spoken term detection using hierarchical graph-based similarity search.
Proceedings of the IEEE International Conference on Acoustics, 2014

Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Prior-shared feature and model space speaker adaptation by consistently employing map estimation.
Speech Commun., 2013

Fast unsupervised adaptation based on efficient statistics accumulation using frame independent confidence within monophone states.
Comput. Speech Lang., 2013

Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.
Comput. Speech Lang., 2013

Unsupervised discriminative language modeling using error rate estimator.
Proceedings of the INTERSPEECH 2013, 2013

Discriminative recognition rate estimation for N-best list and its application to N-best rescoring.
Proceedings of the IEEE International Conference on Acoustics, 2013

Coupling beamforming with spatial and spectral feature based spectral enhancement and its application to meeting recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature space variational Bayesian linear regression and its combination with model space VBLR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Unsupervised discriminative adaptation using differenced maximum mutual information based linear regression.
Proceedings of the IEEE International Conference on Acoustics, 2013

Graph index based query-by-example search on a large speech data set.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.
IEEE Trans. Speech Audio Process., 2012

Joint estimation of confidence and error causes in speech recognition.
Speech Commun., 2012

Recognition rate estimation based on word alignment network and discriminative error type classification.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Dynamic variance adaptation using differenced maximum mutual information.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Automatic Vocabulary Adaptation Based on Semantic Similarity and Speech Recognition Confidence Measure.
Proceedings of the INTERSPEECH 2012, 2012

Speaker Adaptation Using Variational Bayesian Linear Regression in Normalized Feature Space.
Proceedings of the INTERSPEECH 2012, 2012

Error type classification and word accuracy estimation using alignment features from word confusion network.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Discriminative feature transforms using differenced maximum mutual information.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Efficient Combination of Likelihood Recycling and Batch Calculation for Fast Acoustic Likelihood Calculation.
IEICE Trans. Inf. Syst., 2011

Machine and acoustical condition dependency analyses for fast acoustic likelihood calculation techniques.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Real-time meeting recognition and understanding using distant microphones and omni-directional camera.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

A novel confidence measure based on marginalization of jointly estimated error cause probabilities.
Proceedings of the INTERSPEECH 2010, 2010

Discriminative confidence and error cause estimation for extended speech recognition function.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Simultaneous estimation of confidence and error cause in speech recognition using discriminative model.
Proceedings of the INTERSPEECH 2009, 2009

Rapid unsupervised adaptation using frame independent output probabilities of gender and context independent phoneme models.
Proceedings of the INTERSPEECH 2009, 2009

Efficient combination of likelihood recycling and batch calculation based on conditional fast processing and acoustic back-off.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model.
Proceedings of the IEEE International Conference on Acoustics, 2008

2005
Children's speech recognition using elementary-school-student speech database.
Syst. Comput. Jpn., 2005

Rapid response and robust speech recognition by preliminary model adaptation for additive and convolutional noise.
Proceedings of the INTERSPEECH 2005, 2005

2003
Speaker adaptation for non-native speakers using bilingual English lexicon and acoustic models.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Non-native English speech recognition using bilingual English lexicon and acoustic models.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2000
Novel two-pass search strategy using time-asynchronous shortest-first second-pass beam search.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1998
Estimating entropy of a language from optimal word insertion penalty.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Balancing acoustic and linguistic probabilities.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998


  Loading...