Maurizio Omologo

Orcid: 0000-0003-0879-0548

According to our database1, Maurizio Omologo authored at least 124 papers between 1989 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Audio-Visual Tracking of Concurrent Speakers.
IEEE Trans. Multim., 2022

Overlapped Speech Detection and speaker counting using distant microphone arrays.
Comput. Speech Lang., 2022

A neural prosody encoder for end-ro-end dialogue act classification.
CoRR, 2022

A Neural Prosody Encoder for End-to-End Dialogue Act Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Caching Networks: Capitalizing on Common Speech for ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Phonetically Induced Subwords for End-to-End Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Multi-Channel Transformer Transducer for Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Context-Aware Transformer Transducer for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Detecting and Counting Overlapping Speakers in Distant Speech Scenarios.
Proceedings of the Interspeech 2020, 2020

Sample drop detection for asynchronous devices distributed in space.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Multi-Speaker Tracking From an Audio-Visual Sensing Device.
IEEE Trans. Multim., 2019

Sample Drop Detection for Distant-speech Recognition with Asynchronous Devices Distributed in Space.
CoRR, 2019

LOCATA challenge: speaker localization with a planar array.
CoRR, 2019

Accurate Target Annotation in 3D from Multimodal Streams.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Light Gated Recurrent Units for Speech Recognition.
IEEE Trans. Emerg. Top. Comput. Intell., 2018

Automatic context window composition for distant speech recognition.
Speech Commun., 2018

Cepstral distance based channel selection for distant speech recognition.
Comput. Speech Lang., 2018

3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Audio Source Separation in Reverberant Environments Using β-Divergence-Based Nonnegative Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments.
CoRR, 2017

Improving Speech Recognition by Revising Gated Recurrent Units.
Proceedings of the Interspeech 2017, 2017

A reassigned based singing voice pitch contour extraction method.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A network of deep neural networks for Distant Speech Recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

3D audio-visual speaker tracking with an adaptive particle filter.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A reassigned front-end for speech recognition.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Batch-normalized joint training for DNN-based distant speech recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Realistic Multi-Microphone Data Simulation for Distant Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Channel Selection for Distant Speech Recognition Exploiting Cepstral Distance.
Proceedings of the Interspeech 2016, 2016

Estimation of the spatial information in Gaussian model based audio source separation using weighted spectral bases.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Contaminated speech training methods for robust DNN-HMM distant speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

A multi-channel corpus for distant-speech interaction in presence of known interferences.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Audio source separation using a redundant library of source spectral bases for non-negative tensor factorization.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Boosted acoustic model learning and hypotheses rescoring on the CHiME-3 task.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
The DIRHA simulated corpus.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Reverberant audio source separation using partially pre-trained nonnegative matrix factorization.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

On the selection of the impulse responses for distant-speech recognition based on contaminated speech training.
Proceedings of the INTERSPEECH 2014, 2014

Word boundary agreementto combine multi-microphone hypotheses in distant speech recognition.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

A speech event detection and localization task for multiroom environments.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Time-frequency reassigned cepstral coefficients for phone-level speech segmentation.
Proceedings of the 22nd European Signal Processing Conference, 2014

Exploiting inter-microphone agreement for hypothesis combination in distant speech recognition.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
An environment aware ML estimation of acoustic radiation pattern with distributed microphone pairs.
Signal Process., 2013

Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition.
EURASIP J. Audio Speech Music. Process., 2013

Large-Scale Cover Song Identification Using Chord Profiles.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Embedding speech recognition to control lights.
Proceedings of the INTERSPEECH 2013, 2013

Geometric contamination for GMM/UBM speaker verification in reverberant environments.
Proceedings of the INTERSPEECH 2013, 2013

2012
Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources.
IEEE Trans. Speech Audio Process., 2012

Maximum a Posteriori Trajectory Estimation for Acoustic Source Tracking.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Semi-Blind Model Adaptation using Piece-wise Energy Decay Curve for Large Reverberant Environments.
Proceedings of the INTERSPEECH 2012, 2012

Enhanced multidimensional spatial functions for unambiguous localization of multiple sparse acoustic sources.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A probabilistic approach to simultaneous extraction of beats and downbeats.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Convolutive Underdetermined Source Separation through Weighted Interleaved ICA and Spatio-temporal Source Correlation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Environment aware estimation of the orientation of acoustic sources using a line array.
Proceedings of the 20th European Signal Processing Conference, 2012

Impulse response estimation for robust speech recognition in a reverberant environment.
Proceedings of the 20th European Signal Processing Conference, 2012

Acoustic model adaptation using piece-wise energy decay curve for reverberant environments.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Convolutive BSS of Short Mixtures by ICA Recursively Regularized Across Frequencies.
IEEE Trans. Speech Audio Process., 2011


Approximated kernel density estimation for multiple TDOA detection.
Proceedings of the IEEE International Conference on Acoustics, 2011

Time-frequency reassigned features for automatic chord recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Inference of acoustic source directivity using environment awareness.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
WOZ acoustic data collection for interactive TV.
Lang. Resour. Evaluation, 2010

Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments.
IEEE J. Sel. Top. Signal Process., 2010

Multiple Source Localization Based on Acoustic Map De-Emphasis.
EURASIP J. Audio Speech Music. Process., 2010

Experiments on distant-talking speaker verification in TV scenario.
Proceedings of the IEEE International Conference on Acoustics, 2010

Cooperative Wiener-ICA for source localization and Separation by distributed microphone arrays.
Proceedings of the IEEE International Conference on Acoustics, 2010


2009
Acoustic Event Detection and Classification.
Proceedings of the Computers in the Human Interaction Loop, 2009

Generalized State Coherence Transform for multidimensional localization of multiple sources.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Use of Hidden Markov Models and Factored Language Models for Automatic Chord Recognition.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Cumulative State Coherence Transform for a Robust Two-Channel Multiple Source Localization.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Robust two-channel TDOA estimation for multiple speaker localization by using recursive ICA and a state coherence transform.
Proceedings of the IEEE International Conference on Acoustics, 2009

A sequential Monte Carlo approach for tracking of overlapping acoustic sources.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verification.
Proceedings of the INTERSPEECH 2008, 2008

Acoustic event classification using a distributed microphone network with a GMM/SVM combined algorithm.
Proceedings of the INTERSPEECH 2008, 2008

Localization of multiple speakers based on a two step acoustic map analysis.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation.
Proceedings of the INTERSPEECH 2007, 2007

Classification of Acoustic Maps to Determine Speaker Position and Orientation from a Distributed Microphone Network.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Speech Recognition in Reverberant Environments Using Remote Microphones.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Multi-microphone periodicity function for robust F0 estimation in real noisy and reverberant environments.
Proceedings of the INTERSPEECH 2006, 2006

Speaker localization based on oriented global coherence field.
Proceedings of the INTERSPEECH 2006, 2006

Robust F0 estimation based on a multi-microphone periodicity function for distant-talking speech.
Proceedings of the 14th European Signal Processing Conference, 2006

N-best parallel maximum likelihood beamformers for robust speech recognition.
Proceedings of the 14th European Signal Processing Conference, 2006

CLEAR Evaluation of Acoustic Event Detection and Classification Systems.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

A Generative Approach to Audio-Visual Person Tracking.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

2005
Speaker Localization in CHIL Lectures: Evaluation Criteria and Results.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

Oriented global coherence field for the estimation of the head orientation in smart rooms equipped with distributed microphone arrays.
Proceedings of the INTERSPEECH 2005, 2005

Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the Chil Seminar Corpus.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
On the use of a weighted autocorrelation based fundamental frequency estimation for a multidimensional speech input.
Proceedings of the INTERSPEECH 2004, 2004

Weighted autocorrelation-based F0 estimation for distant-talking interaction with a distributed microphone network.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Use of a CSP-based voice activity detector for distant-talking ASR.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Use of parallel recognizers for robust in-car speech interaction.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Hidden Markov model training with contaminated speech material for distant-talking speech recognition.
Comput. Speech Lang., 2002

On the joint use of noise reduction and MLLR adaptation for in-car hands-free speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Annotation in the SpeechDat Projects.
Int. J. Speech Technol., 2001

Use of real and contaminated speech for training of a hands-free in-car speech recognizer.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speech Recognition with Microphone Arrays.
Proceedings of the Microphone Arrays - Signal Processing Techniques and Applications, 2001

2000
Annotation of a Multichannel Noisy Speech Corpus.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Training of HMM with filtered speech material for hands-free recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Environmental conditions and acoustic transduction in hands-free speech recognition.
Speech Commun., 1998

Experiments of HMM adaptation for hands-free connected digit recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Use of the crosspower-spectrum phase in acoustic event location.
IEEE Trans. Speech Audio Process., 1997

Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Automatic diphone extraction for an Italian text-to-speech synthesis system.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Acoustic source location in a three-dimensional space using crosspower spectrum phase.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Microphone array based speech recognition with different talker-array positions.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Acoustic source location in noisy and reverberant environment using CSP analysis.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Robust continuous speech recognition using a microphone array.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Hands free continuous speech recognition in noisy environment using a four microphone array.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Talker localization and speech recognition using a microphone array and a cross-powerspectrum phase analysis.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speaker independent continuous speech recognition using an acoustic-phonetic Italian corpus.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Acoustic event localization using a crosspower-spectrum phase based technique.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Automatic segmentation and labeling of speech based on Hidden Markov Models.
Speech Commun., 1993

Talker localization and speech enhancement in a noisy environment using a microphone array based acquisition system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A baseline of a speaker independent continuous speech recognizer of Italian.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Automatic segmentation and labeling of English and Italian speech databases.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Improved connected digit recognition using spectral variation functions.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

A HMM-based system for automatic segmentation and labeling of speech.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

A family of parallel hidden Markov models.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
A preliminary statistical evaluation of manual and automatic segmentation discrepancies.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

A parallel HMM approach to speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1989
The computation and some spectral considerations on line spectrum pairs (LSP).
Proceedings of the First European Conference on Speech Communication and Technology, 1989


  Loading...