We stand with Ukraine

We stand with Ukraine

Haihua Xu

According to our database¹, Haihua Xu authored at least 76 papers between 2008 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2023

Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Internal Language Model Estimation Based Adaptive Language Model Fusion for Domain Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Reducing Language Confusion for Code-Switching Speech Recognition with Token-Level Language Diarization.

[BibT_eX]

[DOI]

,

,

Leibny Paola García

,

Andy W. H. Khong

,

,

Sanjeev Khudanpur

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Improving short-video speech recognition using random utterance concatenation.

[BibT_eX]

[DOI]

,

,

Yerbolat Khassanov

,

,

,

,

,

CoRR, 2022

Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Interspeech 2022, 2022

Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning.

[BibT_eX]

[DOI]

,

,

,

Yerbolat Khassanov

,

,

,

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems.

[BibT_eX]

[DOI]

,

Yerbolat Khassanov

,

,

,

,

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Enriching Under-Represented Named Entities for Improved Speech Recognition.

[BibT_eX]

[DOI]

,

Yerbolat Khassanov

,

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Multitask-based joint learning approach to robust ASR for radio communication speech.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance.

[BibT_eX]

[DOI]

,

Yerbolat Khassanov

,

,

,

,

,

CoRR, 2020

The NTU-AISG Text-to-speech System for Blizzard Challenge 2020.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

Spatial-Scale Aligned Network for Fine-Grained Recognition.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Interspeech 2020, 2020

Independent Language Modeling Architecture for End-To-End ASR.

[BibT_eX]

[DOI]

,

,

Yerbolat Khassanov

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Electromagnetic Transient Modeling and Simulation of Power Converters Based on a Piecewise Generalized State Space Averaging Method.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Access, 2019

On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

,

Yerbolat Khassanov

,

,

,

,

Proceedings of the Interspeech 2019, 2019

Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation.

[BibT_eX]

[DOI]

Yerbolat Khassanov

,

,

,

,

Proceedings of the Interspeech 2019, 2019

Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data.

[BibT_eX]

[DOI]

Yerbolat Khassanov

,

,

,

,

,

,

Proceedings of the Interspeech 2019, 2019

Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling.

[BibT_eX]

[DOI]

,

,

,

Rohan Kumar Das

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Improving code-switching speech recognition with data augmentation and system combination.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Re-ranking spoken term detection with acoustic exemplars of keywords.

[BibT_eX]

[DOI]

,

,

,

,

,

Speech Commun., 2018

Average Modeling Approach to Voice Conversion with Non-Parallel Data.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Mandarin-English Code-switching Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Interspeech 2018, 2018

Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Interspeech 2018, 2018

2017

Mandarin tone modeling using recurrent neural networks.

[BibT_eX]

[DOI]

,

,

CoRR, 2017

Pruning Strategies for Partial Search in Spoken Term Detection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eighth International Symposium on Information and Communication Technology, 2017

The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016.

[BibT_eX]

[DOI]

,

Ville Hautamäki

,

,

Anthony Larcher

,

,

Andreas Nautsch

,

Themos Stafylakis

,

,

Mickaël Rouvier

,

,

Federico Alegre

,

,

,

Achintya Kumar Sarkar

,

Héctor Delgado

,

,

Hagai Aronowitz

,

Aleksandr Sizov

,

,

Trung Hieu Nguyen

,

,

,

,

,

,

Anssi Kanervisto

,

,

Fahimeh Bahmaninezhad

,

Sergey Isadskiy

,

Christian Rathgeb

,

Christoph Busch

,

Georgios Tzimiropoulos

,

,

,

,

,

,

,

,

,

,

Pierre-Michel Bousquet

,

,

Waad Ben Kheder

,

,

,

,

,

,

,

Benoit G. B. Fauve

,

Kaavya Sriskandaraja

,

Vidhyasaharan Sethu

,

,

Dennis Alexander Lehmann Thomsen

,

,

Massimiliano Todisco

,

Nicholas W. D. Evans

,

,

John H. L. Hansen

,

Jean-François Bonastre

,

Eliathamby Ambikairajah

Proceedings of the Interspeech 2017, 2017

Improving N-gram language modeling for code-switching speech recognition.

[BibT_eX]

[DOI]

,

,

Tze Yuang Chong

,

,

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Mark Hasegawa-Johnson

,

,

,

,

,

,

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Fantastic 4 system for NIST 2015 Language Recognition Evaluation.

[BibT_eX]

[DOI]

,

Ville Hautamäki

,

Anthony Larcher

,

,

,

Trung Hieu Nguyen

,

,

Aleksandr Sizov

,

,

Amir Hossein Poorjam

,

Trung Ngo Trong

,

,

,

,

,

,

Sylvain Meignier

CoRR, 2016

The NNI Vietnamese Speech Recognition System for MediaEval 2016.

[BibT_eX]

[DOI]

,

,

Cheung-Chi Leung

,

,

,

,

,

,

,

,

Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Neural networks based channel compensation for i-vector speaker verification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Interspeech 2016, 2016

Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Interspeech 2016, 2016

Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis.

[BibT_eX]

[DOI]

Cheung-Chi Leung

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Interspeech 2016, 2016

The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2016, 2016

Approximate search of audio queries by using DTW with phone time boundary and data augmentation.

[BibT_eX]

[DOI]

,

,

,

,

Cheung-Chi Leung

,

,

,

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Keyword search using query expansion for graph-based rescoring of hypothesized detections.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exemplar-inspired strategies for low-resource spoken keyword search in Swahili.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

I-vector based deep neural network acoustic model adaptation using multilingual language resource.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection.

[BibT_eX]

[DOI]

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2015

NLP based congestive heart failure case finding: A prospective analysis on statewide electronic medical records.

[BibT_eX]

[DOI]

Int. J. Medical Informatics, 2015

The NNI Query-by-Example System for MediaEval 2015.

[BibT_eX]

[DOI]

,

,

Cheung-Chi Leung

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the INTERSPEECH 2015, 2015

Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the INTERSPEECH 2015, 2015

Multi-softmax deep neural network for semi-supervised training.

[BibT_eX]

[DOI]

,

Proceedings of the INTERSPEECH 2015, 2015

Language independent query-by-example spoken term detection using N-best phone sequences and partial matching.

[BibT_eX]

[DOI]

,

,

,

,

Cheung-Chi Leung

,

,

,

,

,

,

,

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Low-resource keyword search strategies for tamil.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Cheung-Chi Leung

,

,

,

,

,

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

On statistical machine translation method for lexicon refinement in speech recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Detecting synthetic speech using long term magnitude and phase information.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Risk prediction of stroke: A prospective statewide study on patients in Maine.

[BibT_eX]

[DOI]

,

,

,

Karl G. Sylvester

,

Xuefeng Bruce Ling

,

Andrew Young Shin

,

,

,

,

,

,

,

,

Devore S. Culver

,

Shaun T. Alfreds

,

Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

On the study of very low-resource language keyword search.

[BibT_eX]

[DOI]

,

,

,

Tze Yuang Chong

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

System and keyword dependent fusion for spoken term detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The NNI Query-by-Example System for MediaEval 2014.

[BibT_eX]

[DOI]

,

,

,

,

Cheung-Chi Leung

,

,

,

,

,

,

,

,

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the INTERSPEECH 2014, 2014

Discriminative score normalization for keyword search decision.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Strategies for Vietnamese keyword search.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Towards better keyword search performance on Malay broadcast news data.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

The development and analysis of a Malay broadcasr news corpus.

[BibT_eX]

[DOI]

Tze Yuang Chong

,

,

,

,

,

,

,

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

2011

Aniterative approach to Bayes risk decoding and system combination.

[BibT_eX]

[DOI]

,

J. Zhejiang Univ. Sci. C, 2011

Minimum Bayes Risk decoding and system combination based on a recursion for edit distance.

[BibT_eX]

[DOI]

,

,

,

Comput. Speech Lang., 2011

2010

An improved consensus-like method for Minimum Bayes Risk decoding and lattice combination.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Minimum tag error for discriminative training of conditional random fields.

[BibT_eX]

[DOI]

,

,

,

Inf. Sci., 2009

Minimum hypothesis phone error as a decoding method for speech recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the INTERSPEECH 2009, 2009

An efficient multistage Rover method for Automatic Speech recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Minimum phone error based stream weight training for mandarin audio-visual Speech recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A hybrid visual feature extraction method for audio-visual speech recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Image Processing, 2009

2008

Towards more efficient and accurate methods for Mandarin LVCSR discriminative training.

[BibT_eX]

[DOI]

,

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Loading...