Ming Li
According to our database1,
Ming Li
authored at least 79 papers
between 2000 and 2020.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2020
On-the-Fly Data Loader and Utterance-Level Aggregation for Speaker and Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Mutli-task Learning with Alignment Loss for Far-field Small-Footprint Keyword Spotting.
CoRR, 2020
Within-sample variability-invariant loss for robust speaker recognition under noisy environments.
CoRR, 2020
2019
String Stability Analysis for Vehicle Platooning Under Unreliable Communication Links With Event-Triggered Strategy.
IEEE Trans. Veh. Technol., 2019
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features.
CoRR, 2019
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion.
Proceedings of the Interspeech 2019, 2019
Proceedings of the Intelligent Robotics and Applications - 12th International Conference, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Cancellable speech template via random binary orthogonal matrices projection hashing.
Pattern Recognit., 2018
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the Interspeech 2018, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum.
Proceedings of the Interspeech 2017, 2017
Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion.
Proceedings of the Interspeech 2017, 2017
Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Response to name: A dataset and a multimodal machine learning framework towards autism study.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
2016
Generalized I-vector Representation with Phonetic Tokenizations and Tandem Features for both Text Independent and Text Dependent Speaker Verification.
J. Signal Process. Syst., 2016
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.
Comput. Speech Lang., 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Text-independent voice conversion using deep neural network based phonetic level features.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Comput. Speech Lang., 2015
The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge.
CoRR, 2015
Proceedings of the INTERSPEECH 2015, 2015
Proceedings of the INTERSPEECH 2015, 2015
Duration dependent covariance regularization in PLDA modeling for speaker verification.
Proceedings of the INTERSPEECH 2015, 2015
The SYSU system for the interspeech 2015 automatic speaker verification spoofing and countermeasures challenge.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Efficient autism spectrum disorder prediction with eye movement: A machine learning framework.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification.
Comput. Speech Lang., 2014
Intoxicated speech detection: A fusion framework with speaker-normalized hierarchical functionals and GMM supervectors.
Comput. Speech Lang., 2014
An iterative framework for unsupervised learning in the PLDA based speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Speaker verification and spoken language identification using a generalized i-vector framework with phonetic tokenizations and tandem features.
Proceedings of the INTERSPEECH 2014, 2014
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Verification based ECG biometrics with cardiac irregular conditions using heartbeat level and segment level information fusion.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion.
Comput. Speech Lang., 2013
Multi-band long-term signal variability features for robust voice activity detection.
Proceedings of the INTERSPEECH 2013, 2013
Proceedings of the INTERSPEECH 2013, 2013
Proceedings of the INTERSPEECH 2013, 2013
Classifying language-related developmental disorders from speech cues: the promise and the potential confounds.
Proceedings of the INTERSPEECH 2013, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the Ninth International Conference on Computational Intelligence and Security, 2013
2012
KNOWME: An Energy-Efficient Multimodal Body Area Network for Physical Activity Monitoring.
ACM Trans. Embed. Comput. Syst., 2012
IEEE Commun. Mag., 2012
Intelligibility classification of pathological speech using fusion of multiple high level descriptors.
Proceedings of the INTERSPEECH 2012, 2012
Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network.
Proceedings of the INTERSPEECH 2012, 2012
Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Speaker verification using Lasso based sparse total variability supervector with PLDA modeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
IEEE Trans. Signal Process., 2011
Proceedings of the INTERSPEECH 2011, 2011
Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors.
Proceedings of the INTERSPEECH 2011, 2011
Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors.
Proceedings of the IEEE International Conference on Acoustics, 2011
Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011
2010
Combining five acoustic level modeling methods for automatic speaker age and gender recognition.
Proceedings of the INTERSPEECH 2010, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
2009
IEICE Trans. Inf. Syst., 2009
Proceedings of the Distributed Computing in Sensor Systems, 2009
Proceedings of the 4th International ICST Conference on Body Area Networks, 2009
2008
IEICE Trans. Inf. Syst., 2008
Automatic Language Identification with Discriminative Language Characterization Based on SVM.
IEICE Trans. Inf. Syst., 2008
EURASIP J. Audio Speech Music. Process., 2008
Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping.
Proceedings of the INTERSPEECH 2008, 2008
An objective singing evaluation approach by relating acoustic measurements to perceptual ratings.
Proceedings of the INTERSPEECH 2008, 2008
2007
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Spoken language identification using score vector modeling and support vector machine.
Proceedings of the INTERSPEECH 2007, 2007
Authentication and Quality Monitoring based on Audio Watermark for Analog AM Shortwave Broadcasting.
Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007
2006
Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000