Michiel Bacchiani

Orcid: 0000-0003-4527-0197

According to our database1, Michiel Bacchiani authored at least 74 papers between 1994 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus.
CoRR, 2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

2022
Wavefit: an Iterative and Non-Autoregressive Neural Vocoder Based on Fixed-Point Iteration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping.
Proceedings of the Interspeech 2022, 2022

SNRi Target Training for Joint Speech Enhancement and Recognition.
Proceedings of the Interspeech 2022, 2022

Knowledge Transfer from Large-Scale Pretrained Language Models to End-To-End Speech Recognizers.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Joint Phoneme-Grapheme Model for End-To-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Speech Processing for Digital Home Assistants: Combining signal processing with deep-learning techniques.
IEEE Signal Process. Mag., 2019

Introduction to the Issue on Far-Field Speech Processing in the Era of Deep Learning: Speech Enhancement, Separation, and Recognition.
IEEE J. Sel. Top. Signal Process., 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

2018
An Overview of the IEEE SPS Speech and Language Technical Committee [In the Spotlight].
IEEE Signal Process. Mag., 2018

Toward Domain-Invariant Speech Recognition via Large Scale Training.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Sampled Connectionist Temporal Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Spectral Distortion Model for Training Phase-Sensitive Deep-Neural Networks for Far-Field Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sound Source Separation Using Phase Difference and Reliable Mask Selection Selection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Performance of Mask Based Statistical Beamforming in a Smart Home Scenario.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model.
CoRR, 2017

End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow.
Proceedings of the Interspeech 2017, 2017


Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home.
Proceedings of the Interspeech 2017, 2017

Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Speech Research at Google to Enable Universal Speech Interfaces.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling.
Proceedings of the Interspeech 2016, 2016

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.
Proceedings of the Interspeech 2016, 2016

Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Factored spatial and spectral multichannel raw waveform CLDNNs.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Large vocabulary automatic speech recognition for children.
Proceedings of the INTERSPEECH 2015, 2015

Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data.
Proceedings of the INTERSPEECH 2014, 2014

Robust speech recognition using temporal masking and thresholding algorithm.
Proceedings of the INTERSPEECH 2014, 2014

Asynchronous, online, GMM-free training of a context dependent acoustic model for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

GMM-free DNN acoustic model training.
Proceedings of the IEEE International Conference on Acoustics, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Context dependent state tying for speech recognition using deep neural network acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
ivector-based acoustic data selection.
Proceedings of the INTERSPEECH 2013, 2013

Rapid adaptation for mobile speech applications.
Proceedings of the IEEE International Conference on Acoustics, 2013

2011
TechWare: Mobile Media Search Resources [Best of the Web].
IEEE Signal Process. Mag., 2011

Discriminative Features for Language Identification.
Proceedings of the INTERSPEECH 2011, 2011

2010
Decision tree state clustering with word and syllable features.
Proceedings of the INTERSPEECH 2010, 2010

2009
Restoring punctuation and capitalization in transcribed speech.
Proceedings of the IEEE International Conference on Acoustics, 2009

An audio indexing system for election video material.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Confidence scores for acoustic model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Deploying GOOG-411: Early lessons in data, measurement, and testing.
Proceedings of the IEEE International Conference on Acoustics, 2008

2006
MAP adaptation of stochastic grammars.
Comput. Speech Lang., 2006

2005
Fast vocabulary-independent audio search using path-based graph indexing.
Proceedings of the INTERSPEECH 2005, 2005

2004
Language Model Adaptation with MAP Estimation and the Perceptron Algorithm.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Improved name recognition with meta-data dependent name networks.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Meta-data conditional language modeling.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Supervised and unsupervised PCFG adaptation to novel domains.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Unsupervised language model adaptation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

SCANMail: a voicemail interface that makes speech browsable, readable and searchable.
Proceedings of the CHI 2002 Conference on Human Factors in Computing Systems: Changing our World, 2002

2001
Audio Browsing and Search in the Voicemail Domain.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

SCANMail: Audio Navigation in the Voicemail Domain.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Caller identification for the SCANMail voicemail browser.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

SCANMail: browsing and searching speech data by content.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic transcription of voicemail at AT&T.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Using maximum likelihood linear regression for segment clustering and speaker identification.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Joint lexicon, acoustic unit inventory and model design.
Speech Commun., 1999

AT&T at TREC-8.
Proceedings of The Eighth Text REtrieval Conference, 1999

1998
Using automatically-derived acoustic sub-word units in large vocabulary speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1996
Speech recognition based on acoustically derived segment units.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Design of a speech recognition system based on acoustically derived segmental units.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Optimization of time-frequency masking filters using the minimum classification error criterion.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994


  Loading...