Naohiro Tawara

According to our database1, Naohiro Tawara authored at least 32 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Discriminative Training of VBx Diarization.
CoRR, 2023

NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization.
CoRR, 2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
CoRR, 2023

Iterative Shallow Fusion of Backward Language Model for End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Voice or Content? - Exploring Impact of Speech Content on Age Estimation from Voice.
Proceedings of the 31st European Signal Processing Conference, 2023

Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age Estimation.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Multi-Source Domain Generalization Using Domain Attributes for Recurrent Neural Network Language Models.
IEICE Trans. Inf. Syst., 2022

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2021

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Integrating End-to-End Neural and Clustering-Based Diarization: Getting the Best of Both Worlds.
Proceedings of the IEEE International Conference on Acoustics, 2021

Robust Speech-Age Estimation Using Local Maximum Mean Discrepancy Under Mismatched Recording Conditions.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Language Model Data Augmentation Based on Text Domain Transfer.
Proceedings of the Interspeech 2020, 2020

Frame-Level Phoneme-Invariant Speaker Embedding for Text-Independent Speaker Recognition on Extremely Short Utterances.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Speaker-Attribute Estimation by Voting Based on Speaker Cluster Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Noise-robust Attention Learning for End-to-End Speech Recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

Speaker Age Estimation Using Age-Dependent Insensitive Loss.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder.
Proceedings of the Interspeech 2019, 2019

Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages.
Proceedings of the Interspeech 2019, 2019

Postfiltering Using an Adversarial Denoising Autoencoder with Noise-aware Training.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Sequential Fish Catch Forecasting Using Bayesian State Space Models.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Speaker Invariant Feature Extraction for Zero-Resource Languages with Adversarial Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Language Model Domain Adaptation Via Recurrent Neural Networks with Domain-Shared and Domain-Specific Representations.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Adversarial autoencoder for reducing nonlinear distortion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2015
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013
Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

2012
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.
Proceedings of the INTERSPEECH 2012, 2012

Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
Proceedings of the INTERSPEECH 2011, 2011


  Loading...