Tan Lee

According to our database1, Tan Lee authored at least 168 papers between 1992 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling.
CoRR, 2019

Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation.
CoRR, 2019

Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling.
CoRR, 2019

Enhancing Sound Texture in CNN-Based Acoustic Scene Classification.
CoRR, 2019

BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Enhancing Sound Texture in CNN-based Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Combining Phone Posteriorgrams from Strong and Weak Recognizers for Automatic Speech Assessment of People with Aphasia.
Proceedings of the IEEE International Conference on Acoustics, 2019

Adversarial Multi-task Deep Features and Unsupervised Back-end Adaptation for Language Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Revisiting Hidden Markov Models for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Guest Editorial: Advances in Deep Learning for Speech Processing.
Signal Processing Systems, 2018

Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

A Study on Acoustic Modeling for Child Speech Based on Multi-Task Learning.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

An End-to-End Approach to Automatic Speech Assessment for People with Aphasia.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

An Automated Assessment Tool for Child Speech Disorders.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Automatic Speech Assessment for People with Aphasia Using TDNN-BLSTM with Multi-Task Learning.
Proceedings of the Interspeech 2018, 2018

Cross-cultural (A)symmetries in Audio-visual Attitude Perception.
Proceedings of the Interspeech 2018, 2018

Exploiting Speaker and Phonetic Diversity of Mismatched Language Resources for Unsupervised Subword Modeling.
Proceedings of the Interspeech 2018, 2018

Improving Cross-Lingual Knowledge Transferability Using Multilingual TDNN-BLSTM with Language-Dependent Pre-Final Layer.
Proceedings of the Interspeech 2018, 2018

Reducing Model Complexity for DNN Based Large-Scale Audio Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Automatic Speech Assessment for Aphasic Patients Based on Syllable-Level Embedding and Supra-Segmental Duration Features.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Pattern Discovery from Thematic Speech Archives Based on Multilingual Bottleneck Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Audio-visual expressions of attitude: How many different attitudes can perceivers decode?
Speech Communication, 2017

Reducing Model Complexity for DNN Based Large-Scale Audio Classification.
CoRR, 2017

RNN-LDA Clustering for Feature Based DNN Adaptation.
Proceedings of the Interspeech 2017, 2017

Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-Level ASR Posterior Features.
Proceedings of the Interspeech 2017, 2017

On the Linguistic Relevance of Speech Units Learned by Unsupervised Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Shefce: A Cantonese-English bilingual speech corpus for pronunciation assessment.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Cross-Language Perception of Audio-visual Attitudinal Expressions.
Proceedings of the Auditory-Visual Speech Processing, 2017

2016
Surface Electromyographic Activity of Extrinsic Laryngeal Muscles in Cantonese Tone Production.
Signal Processing Systems, 2016

The Sheffield language recognition system in NIST LRE 2015.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Towards automatic assessment of aphasia speech using automatic speech recognition techniques.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Exploiting language-mismatched phoneme recognizers for unsupervised acoustic modeling.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Predicting Severity of Voice Disorder from DNN-HMM Acoustic Posteriors.
Proceedings of the Interspeech 2016, 2016

Hybrid Accelerated Optimization for Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Automatic speech recognition for acoustical analysis and assessment of cantonese pathological voice and speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Supervised Single-Microphone Multi-Talker Speech Separation with Conditional Random Fields.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

Acoustic Segment Modeling with Spectral Clustering Methods.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

A method of speech periodicity enhancement using transform-domain signal decomposition.
Speech Communication, 2015

Objective measures for quality assessment of noise-suppressed speech.
Speech Communication, 2015

Analysis of intonation patterns in Cantonese aphasia speech.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Multi-pitch estimation based on sparse representation with pre-screened dictionary.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Modeling temporal dependency for robust estimation of LP model parameters in speech enhancement.
Proceedings of the INTERSPEECH 2015, 2015

2014
CUHK System for QUESST Task of MediaEval 2014.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Automatic Key Partition Based on Tonal Organization Information of Classical Music.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Correcting Chord Classification Errors Based on Tonal Organization Information of Classical Music.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Surface electromyographic activity of non-laryngeal neck muscles in Cantonese tone production.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Multipitch tracking based on linear programming relaxation and sparsity-based pitch candidate estimation.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Large-margin conditional random fields for single-microphone speech separation.
Proceedings of the INTERSPEECH 2014, 2014

A graph-based Gaussian component clustering approach to unsupervised acoustic modeling.
Proceedings of the INTERSPEECH 2014, 2014

2013
Spoken Language Recognition With Prosodic Features.
IEEE Trans. Audio, Speech & Language Processing, 2013

Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique.
IEEE Trans. Audio, Speech & Language Processing, 2013

Shifted-Delta MLP Features for Spoken Language Recognition.
IEEE Signal Process. Lett., 2013

The CUHK Spoken Web Search System for MediaEval 2013.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams.
Proceedings of the INTERSPEECH 2013, 2013

Using dynamic conditional random field on single-microphone speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

Evaluation of pitch estimation algorithms on separated speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

A speech enhancement method for cochlear implant listeners.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Structured mean field method for single-microphone speech separation with factorial Hidden Markov Model.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Chord classification of multi-instrumental music using exemplar-based sparse representation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Improving the sound quality of an electronic voice box.
Proceedings of the 6th International Conference on Biomedical Engineering and Informatics, 2013

2012
CUHK System for the Spoken Web Search task at Mediaeval 2012.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Two objective measures for speech distortion and noise reduction evaluation of enhanced speech signals.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Robust Pitch Estimation Using l1-regularized Maximum Likelihood Estimation.
Proceedings of the INTERSPEECH 2012, 2012

Integrating multiple observations for model-based single-microphone speech separation with conditional random fields.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

An acoustic segment modeling approach to query-by-example spoken term detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Transform-domain Wiener filter for speech periodicity enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Sparsity-based confidence measure for pitch estimation in noisy speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Exploration of Phase and Vocal Excitation Modulation Features for Speaker Recognition.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

Classifying NMF components based on vector similarity for speech and music separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features.
IEEE Trans. Audio, Speech & Language Processing, 2011

Transform-domain speech periodicity enhancement with adaptive coefficient weighting.
Proceedings of the International Symposium on Intelligent Signal Processing and Communications Systems, 2011

Score fusion and calibration in multiple language detectors with large performance variation.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Detection target dependent score calibration for language recognition.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Similarity Measures for Chinese Pop Music Based on Low-level Audio Signal Attributes.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

SURE-MSE speech enhancement for robust speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Robust speaker verification using phase information of speech.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Spectral trajectory estimation using nonnegative matrix factorization for model-based monaural speech separation.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Perception and analysis of linearly approximated F0 contours in Cantonese speech.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Semantics-based language modeling for Cantonese-English code-mixing speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Exploitation of phase information for speaker recognition.
Proceedings of the INTERSPEECH 2010, 2010

Towards long-range prosodic attribute modeling for language recognition.
Proceedings of the INTERSPEECH 2010, 2010

Perception-based automatic approximation of F0 contours in Cantonese speech.
Proceedings of the INTERSPEECH 2010, 2010

Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks.
Proceedings of the INTERSPEECH 2010, 2010

Cross-lingual speaker adaptation via Gaussian component mapping.
Proceedings of the INTERSPEECH 2010, 2010

Prosodic attribute model for spoken language identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Improved Cantonese Tone Recognition with Approximated F0 Contour: Implications for Cochlear Implants.
Proceedings of the International Conference on Asian Language Processing, 2010

A method of speech periodicity enhancement based on transform-domain signal decomposition.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Analysis and Selection of Prosodic Features for Asian Language Recognition.
Int. J. of Asian Lang. Proc., 2009

Automatic Recognition of Cantonese-English Code-Mixing Speech.
IJCLCLP, 2009

Analysis and Signal Processing of Oesophageal and Pathological Voices.
EURASIP J. Adv. Sig. Proc., 2009

Exploration of vocal excitation modulation features for speaker recognition.
Proceedings of the INTERSPEECH 2009, 2009

Model-based speech separation: identifying transcription using orthogonality.
Proceedings of the INTERSPEECH 2009, 2009

Effects of language mixing for automatic recognition of Cantonese-English code-mixing utterances.
Proceedings of the INTERSPEECH 2009, 2009

Analysis and Selection of Prosodic Features for Language Identification.
Proceedings of the 2009 International Conference on Asian Language Processing, 2009

2008
Tone-enhanced generalized character posterior probability (GCPP) for Cantonese LVCSR.
Computer Speech & Language, 2008

Deriving MFCC Parameters from the Dynamic Spectrum for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Mandarin Tone Perception with Temporal Envelope and Periodicity Cues from Different Frequency Regions.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Entropy-Based Analysis of the Prosodic Features of Chinese Dialects.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A Perceptual Study of Approximated Cantonese Tone Contours.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Pitch Tracking for Model-Based Speech Separation.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Prosodic Variation in Cantonese-English Code-Mixed Speech.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech.
Proceedings of the INTERSPEECH 2008, 2008

Language modeling for speech recognition of spoken Cantonese.
Proceedings of the INTERSPEECH 2008, 2008

2007
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR.
IEEE Trans. Audio, Speech & Language Processing, 2007

Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation.
IEEE Trans. Audio, Speech & Language Processing, 2007

Integration of Complementary Acoustic Features for Speaker Recognition.
IEEE Signal Process. Lett., 2007

Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification.
IJCLCLP, 2007

A power-based adaptive method for eigenanalysis without square-root operations.
Digital Signal Processing, 2007

Quantitative analysis of F0 contours of emotional speech of Mandarin.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Perceptual equivalence of approximated Cantonese tone contours.
Proceedings of the INTERSPEECH 2007, 2007

Modeling tones in hakka on the basis of the command-response model.
Proceedings of the INTERSPEECH 2007, 2007

2006
Speech recognition on DSP: issues on computational efficiency and performance analysis.
Microprocessors and Microsystems, 2006

Using Duration Information in Cantonese Connected-Digit Recognition.
IJCLCLP, 2006

Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition.
IJCLCLP, 2006

Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Integrating Complementary Features with a Confidence Measure for Speaker Identification.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Towards automatic parameter extraction of command-response model for Cantonese.
Proceedings of the INTERSPEECH 2006, 2006

Improved tone modeling for Mandarin broadcast news speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

Automatic speech recognition of Cantonese-English code-mixing utterances.
Proceedings of the INTERSPEECH 2006, 2006

Tone-Enhanced Generalized Character Posterior Probability (GCPP) for Cantonese LVCSR.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Feature Extraction From Talking Mouths for Video-Based Bi-Modal Speaker Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Use of Vocal Source Features in Speaker Segmentation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Development of a Cantonese-English code-mixing speech corpus.
Proceedings of the INTERSPEECH 2005, 2005

Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Analysis and modeling of F0 contours for cantonese text-to-speech.
ACM Trans. Asian Lang. Inf. Process., 2004

On noise robustness of dynamic and static features for continuous Cantonese digit recognition.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Cantonese verbal information verification system using GMM-based anti-model.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Detection of language boundary in code-switching utterances by bi-phone probabilities.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Noise-robust automatic speech recognition using mainlobe-resilient time-frequency quantile-based noise estimation.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Explicit duration modeling for Cantonese connected-digit recognition.
Proceedings of the INTERSPEECH 2004, 2004

Time -frequency analysis of vocal source signal for speaker recognition.
Proceedings of the INTERSPEECH 2004, 2004

Tone information as a confidence measure for improving Cantonese LVCSR.
Proceedings of the INTERSPEECH 2004, 2004

2003
An HMM-based speech recognition IC.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Overlapped di-tone modeling for tone recognition in continuous Cantonese speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Modeling Cantonese pronunciation variation by acoustic model refinement.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Using tone information in Cantonese continuous speech recognition.
ACM Trans. Asian Lang. Inf. Process., 2002

Spoken language resources for Cantonese speech processing.
Speech Communication, 2002

A new approach to generating Pitch Cycle Waveform (PCW) for Waveform Interpolation codec.
Microprocessors and Microsystems, 2002

Modeling tones in continuous Cantonese speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Unsupervised n-best based model adaptation using model-level confidence measures.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Design, Compilation and Processing of CUCall: A Set of Cantonese Spoken Language Corpora Collected Over Telephone Networks.
Proceedings of the 14th Conference on Computational Linguistics and Speech Processing, 2001

A Low Missing Rate Audio Search Technique for Cantonese Radio Broadcast Recording.
Proceedings of the Advances in Multimedia Information Processing, 2001

ISIS: a learning system with combined interaction and delegation dialogs.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Cantonese text-to-speech synthesis using sub-syllable units.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
ISIS: A multilingual spoken dialog system developed with CORBA and KQML agents.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Using cross-syllable units for Cantonese speech synthesis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Incorporating tone information into Cantonese large-vocabulary continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Lexical tree decoding with a class-based language model for Chinese speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Acoustic modeling for Chinese speech recognition: a comparative study of Mandarin and Cantonese.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Cantonese syllable recognition using neural networks.
IEEE Trans. Speech and Audio Processing, 1999

Acoustic modeling and language modeling for cantonese LVCSR.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Micro-prosodic control in cantonese text-to-speech synthesis.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Isolated word recognition using modular recurrent neural networks.
Pattern Recognition, 1998

Context-dependent duration modelling for continuous speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
A neural network based speech recognition system for isolated Cantonese syllables.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Development of a large vocabulary speech database for Cantonese.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Automatic recognition of isolated Cantonese syllables using neural networks =: 利用神經網絡識別粤語單音節.
PhD thesis, 1996

On improving discrimination capability of an RNN based recognizer.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Tone recognition of isolated Cantonese syllables.
IEEE Trans. Speech and Audio Processing, 1995

An RNN based speech recognition system with discriminative training.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Recurrent neural networks for speech modeling and speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

1992
A Node Pruning Algorithm for Backpropagation Networks.
Int. J. Neural Syst., 1992


  Loading...