Ming Li

According to our database1, Ming Li authored at least 79 papers between 2000 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2020
On-the-Fly Data Loader and Utterance-Level Aggregation for Speaker and Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Mutli-task Learning with Alignment Loss for Far-field Small-Footprint Keyword Spotting.
CoRR, 2020

Within-sample variability-invariant loss for robust speaker recognition under noisy environments.
CoRR, 2020

The FFSVC 2020 Evaluation Plan.
CoRR, 2020

2019
String Stability Analysis for Vehicle Platooning Under Unreliable Communication Links With Event-Triggered Strategy.
IEEE Trans. Veh. Technol., 2019

Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features.
CoRR, 2019

The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion.
Proceedings of the Interspeech 2019, 2019

Fixation Based Object Recognition in Autism Clinic Setting.
Proceedings of the Intelligent Robotics and Applications - 12th International Conference, 2019

F0 Contour Estimation Using Phonetic Feature in Electrolaryngeal Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

Utterance-level End-to-end Language Identification Using Attention-based CNN-BLSTM.
Proceedings of the IEEE International Conference on Acoustics, 2019

DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Cancellable speech template via random binary orthogonal matrices projection hashing.
Pattern Recognit., 2018

Insights into End-to-End Learning Scheme for Language Identification.
CoRR, 2018

Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

End-to-end Language Identification using NetFV and NetVLAD.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Analysis of Length Normalization in End-to-End Speaker Verification System.
Proceedings of the Interspeech 2018, 2018

A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Insights in-to-End Learning Scheme for Language Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum.
Proceedings of the Interspeech 2017, 2017

Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion.
Proceedings of the Interspeech 2017, 2017

Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Response to name: A dataset and a multimodal machine learning framework towards autism study.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

2016
Generalized I-vector Representation with Phonetic Tokenizations and Tandem Features for both Text Independent and Text Dependent Speaker Verification.
J. Signal Process. Syst., 2016

Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.
Comput. Speech Lang., 2016

Speaker diarization system for autism children's real-life audio data.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Text-independent voice conversion using deep neural network based phonetic level features.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Locality sensitive discriminant analysis for speaker verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

On Order-Constrained Transitive Distance Clustering.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Automatic intelligibility classification of sentence-level pathological speech.
Comput. Speech Lang., 2015

The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge.
CoRR, 2015

Speech bandwidth expansion based on deep neural networks.
Proceedings of the INTERSPEECH 2015, 2015

Locality constrained transitive distance clustering on speech data.
Proceedings of the INTERSPEECH 2015, 2015

Duration dependent covariance regularization in PLDA modeling for speaker verification.
Proceedings of the INTERSPEECH 2015, 2015

The SYSU system for the interspeech 2015 automatic speaker verification spoofing and countermeasures challenge.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification.
Comput. Speech Lang., 2014

Intoxicated speech detection: A fusion framework with speaker-normalized hierarchical functionals and GMM supervectors.
Comput. Speech Lang., 2014

An iterative framework for unsupervised learning in the PLDA based speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Speaker verification and spoken language identification using a generalized i-vector framework with phonetic tokenizations and tandem features.
Proceedings of the INTERSPEECH 2014, 2014

Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework.
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Simplified and supervised i-vector modeling for speaker age regression.
Proceedings of the IEEE International Conference on Acoustics, 2014

Verification based ECG biometrics with cardiac irregular conditions using heartbeat level and segment level information fusion.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion.
Comput. Speech Lang., 2013

Multi-band long-term signal variability features for robust voice activity detection.
Proceedings of the INTERSPEECH 2013, 2013

Speaker verification based on fusion of acoustic and articulatory information.
Proceedings of the INTERSPEECH 2013, 2013

TRAP language identification system for RATS phase II evaluation.
Proceedings of the INTERSPEECH 2013, 2013

Classifying language-related developmental disorders from speech cues: the promise and the potential confounds.
Proceedings of the INTERSPEECH 2013, 2013

Speaker verification using simplified and supervised i-vector modeling.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic Vocal Segments Detection in Popular Music.
Proceedings of the Ninth International Conference on Computational Intelligence and Security, 2013

2012
KNOWME: An Energy-Efficient Multimodal Body Area Network for Physical Activity Monitoring.
ACM Trans. Embed. Comput. Syst., 2012

KNOWME: a case study in wireless body area sensor network design.
IEEE Commun. Mag., 2012

Intelligibility classification of pathological speech using fusion of multiple high level descriptors.
Proceedings of the INTERSPEECH 2012, 2012

Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network.
Proceedings of the INTERSPEECH 2012, 2012

Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speaker verification using Lasso based sparse total variability supervector with PLDA modeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Optimal Time-Resource Allocation for Energy-Efficient Physical Activity Detection.
IEEE Trans. Signal Process., 2011

Speaker Verification Using Sparse Representations on Total Variability i-vectors.
Proceedings of the INTERSPEECH 2011, 2011

Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors.
Proceedings of the INTERSPEECH 2011, 2011

Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors.
Proceedings of the IEEE International Conference on Acoustics, 2011

Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011

2010
Combining five acoustic level modeling methods for automatic speaker age and gender recognition.
Proceedings of the INTERSPEECH 2010, 2010

Robust ECG Biometrics by Fusing Temporal and Cepstral Information.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
Automatic Singing Performance Evaluation for Untrained Singers.
IEICE Trans. Inf. Syst., 2009

Optimal Allocation of Time-Resources for Multihypothesis Activity-Level Detection.
Proceedings of the Distributed Computing in Sensor Systems, 2009

Optimal time-resource allocation for activity-detection via multimodal sensing.
Proceedings of the 4th International ICST Conference on Body Area Networks, 2009

2008
Melody Track Selection Using Discriminative Language Model.
IEICE Trans. Inf. Syst., 2008

Automatic Language Identification with Discriminative Language Characterization Based on SVM.
IEICE Trans. Inf. Syst., 2008

Using SVM as Back-End Classifier for Language Identification.
EURASIP J. Audio Speech Music. Process., 2008

Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping.
Proceedings of the INTERSPEECH 2008, 2008

An objective singing evaluation approach by relating acoustic measurements to perceptual ratings.
Proceedings of the INTERSPEECH 2008, 2008

2007
Singing Melody Extraction in Polyphonic Music by Harmonic Tracking.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Spoken language identification using score vector modeling and support vector machine.
Proceedings of the INTERSPEECH 2007, 2007

Authentication and Quality Monitoring based on Audio Watermark for Analog AM Shortwave Broadcasting.
Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007

2006
A Novel Audio Watermarking in Wavelet Domain.
Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006

2000
Multi-group mixture weight HMM.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...