Ming Tu

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2023
Language-universal phonetic encoder for low-resource speech recognition.
CoRR, 2023

Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition.
CoRR, 2023

Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition.
CoRR, 2023

Efficient Neural Music Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance.
CoRR, 2022

Cloning One's Voice Using Very Limited Data in the Wild.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Graph Sequential Network for Reasoning over Sequences.
CoRR, 2020

Linear-Quadratic Tracking Control of a Commercial Vehicle Air Brake System.
IEEE Access, 2020

Speaker-Invariant Affective Representation Learning via Adversarial Training.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Select, Answer and Explain: Interpretable Multi-Hop Reading Comprehension over Multiple Documents.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Articulation constrained learning with application to speech emotion recognition.
EURASIP J. Audio Speech Music. Process., 2019

Multiple instance learning with graph neural networks.
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Towards adversarial learning of speaker-invariant representation for speech emotion recognition.
CoRR, 2019

Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment.
Proceedings of the Interspeech 2018, 2018

Investigating the Role of L1 in Automatic Pronunciation Evaluation of L2 Speech.
Proceedings of the Interspeech 2018, 2018

Simulating Dysarthric Speech for Training Data Augmentation in Clinical Speech Applications.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Improving efficiency in sparse learning with the feedforward inhibitory motif.
Neurocomputing, 2017

Interpretable Objective Assessment of Dysarthric Speech Based on Deep Neural Networks.
Proceedings of the Interspeech 2017, 2017

Speech enhancement based on Deep Neural Networks with skip connections.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Objective assessment of pathological speech using distribution regression.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Reducing the Model Order of Deep Neural Networks Using Information Theory.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016

Accent Identification by Combining Deep Neural Networks and Recurrent Neural Networks Trained on Long and Short Term Features.
Proceedings of the Interspeech 2016, 2016

Ranking the parameters of deep neural networks using the fisher information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Online speaking rate estimation using recurrent neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Models for objective evaluation of dysarthric speech from data annotated by multiple listeners.
Proceedings of the 50th Asilomar Conference on Signals, Systems and Computers, 2016

2015
Convex Weighting Criteria for Speaking Rate Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Estimating speaking rate in spontaneous discourse.
Proceedings of the 49th Asilomar Conference on Signals, Systems and Computers, 2015

2014
Towards improving statistical model based voice activity detection.
Proceedings of the INTERSPEECH 2014, 2014

Computational Auditory Scene Analysis Based Voice Activity Detection.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Improving voice quality of HMM-based speech synthesis using voice conversion method.
Proceedings of the IEEE International Conference on Acoustics, 2014

2012
OpenCDS ePHR: an Open-Source, Standards-Based Decision Support Platform for Electronic Public Health Reporting.
Proceedings of the AMIA 2012, 2012


  Loading...