Lianhong Cai

Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Acoustics, content and geo-information based sentiment prediction from large-scale networked voice data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Psychological stress detection from cross-media microblog data using Deep Sparse Neural Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Contrastive auto-encoder for phoneme recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Learning dynamic features with neural networks for phoneme recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Automatic Emotion Variation Detection in continuous speech.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

Affective image adjustment with a single word.

[BibT_eX]

[DOI]

Xiaohui Wang

Vis. Comput., 2013

Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2013

WeCard: a multimodal solution for making personalized electronic greeting cards.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

SNR estimation for clipped audio based on amplitude distribution.

[BibT_eX]

[DOI]

Xiaoqing Liu

Proceedings of the Ninth International Conference on Natural Computation, 2013

Interpretable aesthetic features for affective image classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Investigation of tandem deep belief network approach for phoneme recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A real-time speech driven talking avatar based on deep neural network.

[BibT_eX]

[DOI]

Kai Zhao

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

TalkingAndroid: An interactive, multimodal and real-time talking avatar application on mobile phones.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Comparing feature dimension reduction algorithms for GMM-SVM based speech emotion recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Affective Image Colorization.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2012

Comparison of adaptation methods for GMM-SVM based speech emotion recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Understanding the emotional impact of images.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Can we understand van gogh's mood?: learning to infer affects from images in social networks.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A real-time tone enhancement method for continuous Mandarin speeches.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Detection and emphatic realization of contrastive word pairs for expressive text-to-speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Perceptual clustering based unit selection optimization for concatenative text-to-speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Analysis on mispronunciations in CAPT based on computational speech perception.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Hierarchical English Emphatic Speech Synthesis Based on HMM with Limited Training Data.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Intention understanding based on multi-source information integration for Chinese Mandarin spoken commands.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012

Image Colorization with an Affective Word.

[BibT_eX]

[DOI]

Proceedings of the Computational Visual Media - First International Conference, 2012

Modeling the correlation between modality semantics and facial expressions.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Emotional Audio-Visual Speech Synthesis Based on PAD.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Combining Active and Semi-Supervised Learning for Homograph Disambiguation in Mandarin Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Lyrics to Singing Voice Synthesis System with Variable Timbre.

[BibT_eX]

[DOI]

Proceedings of the Applied Informatics and Communication - International Conference, 2011

2010

Modeling prosody patterns for Chinese expressive text-to-speech synthesis.

[BibT_eX]

[DOI]

Helen M. Meng

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Investigation of the relation between acoustic features and articulation - An application to emotional speech analysis.

[BibT_eX]

[DOI]

Yongxin Wang

Jianwu Dang

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

HMM based TTS for mixed language text.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Comparison of Syllable/Phone HMM Based Mandarin TTS.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Emotional talking agent: System and evaluation.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Natural Computation, 2010

Facial expression synthesis based on motion patterns learned from face database.

[BibT_eX]

[DOI]

Shen Zhang

Proceedings of the International Conference on Image Processing, 2010

The Intelligent Music Editor: Towards an Automated Platform for Music Analysis and Editing.

[BibT_eX]

[DOI]

Yuxiang Liu

Roger B. Dannenberg

Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence, 2010

Facial Expression Synthesis Based on Emotion Dimensions for Affective Talking Avatar.

[BibT_eX]

[DOI]

Proceedings of the Modeling Machine Emotions for Realizing Intelligence, 2010

2009

Modeling the Expressivity of Input Text Semantics for Chinese Text-to-Speech Synthesis in a Spoken Dialog System.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Syllable HMM based Mandarin TTS and comparison with concatenative TTS.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Voiced/unvoiced decision algorithm for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Automatic Emphasis Labeling for Emotional Speech by Measuring Prosody Generation Error.

[BibT_eX]

[DOI]

Jun Xu

Proceedings of the Emerging Intelligent Computing Technology and Applications, 2009

Cultural style based music classification of audio signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Clustering Music Recordings by Their Keys.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

Analysis and Modeling of Affective Audio Visual Speech Based on PAD Emotion Space.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A New Prosodic Strength Calculation Method for Prosody Reduction Modeling.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Entering Tone Recognition in a Support Vector Machine Approach.

[BibT_eX]

[DOI]

Xiangcheng Wang

Ying Liu

Proceedings of the Fourth International Conference on Natural Computation, 2008

2007

Fingerprint matching based on weighting method and the SVM.

[BibT_eX]

[DOI]

Neurocomputing, 2007

Hierarchical non-uniform unit selection based on prosodic structure.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Fake Finger Detection Based on Time-Series Fingerprint Image Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007

A New Approach to Fake Finger Detection Based on Skin Elasticity Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Biometrics, International Conference, 2007

Head Movement Synthesis Based on Semantic and Prosodic Features for a Chinese Expressive Avatar.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Script Design Based on Decision Tree with Context Vector and Acoustic Distance for Mandarin TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Facial Expression Synthesis Using PAD Emotional Parameters for a Chinese Expressive Avatar.

[BibT_eX]

[DOI]

Proceedings of the Affective Computing and Intelligent Interaction, 2007

Affect Related Acoustic Features of Speech and Their Modification.

[BibT_eX]

[DOI]

Proceedings of the Affective Computing and Intelligent Interaction, 2007

2006

A flexible framework for key audio effects detection and auditory context inference.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model.

[BibT_eX]

[DOI]

Hongwu Yang

Dezhi Huang

IEICE Trans. Inf. Syst., 2006

Modelling the Global acoustic Correlates of Expressivity for Chinese Text-to-speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification.

[BibT_eX]

[DOI]

Xiaonan Zhang

Jun Xu

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Spectral Continuity Measures at Mandarin Syllable Boundaries.

[BibT_eX]

[DOI]

Jun Xu

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Investigation on Pleasure Related Acoustic Features of Affective Speech.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis.

[BibT_eX]

[DOI]

Hongwu Yang

Helen M. Meng

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Acoustic and Physiological Feature Analysis of Affective Speech.

[BibT_eX]

[DOI]

Dandan Cui

Proceedings of the Computational Intelligence, 2006

Multi-level Fusion of Audio and Visual Features for Speaker Identification.

[BibT_eX]

[DOI]

Helen M. Meng

Proceedings of the Advances in Biometrics, International Conference, 2006

2005

A TSVM-Based Minutiae Matching Approach for Fingerprint Verification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Biometric Person Authentication, 2005

Grapheme-to-phoneme conversion based on TBL algorithm in Mandarin TTS system.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Prosody Analysis and Modeling for Emotional Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Unsupervised auditory scene categorization via key audio effects and information-theoretic co-clustering.

[BibT_eX]

[DOI]

Rui Cai

Lie Lu

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Grapheme-to-Phoneme Conversion Based on a Fast TBL Algorithm in Mandarin TTS Systems.

[BibT_eX]

[DOI]

Proceedings of the Fuzzy Systems and Knowledge Discovery, Second International Conference, 2005

2004

Classifying emotion in Chinese speech by decomposing prosodic features.

[BibT_eX]

[DOI]

Dan-Ning Jiang

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Face Pose Estimation and its Application in Video Shot Selection.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Pattern Recognition, 2004

Speech emotion classification with the combination of statistic features and temporal features.

[BibT_eX]

[DOI]

Dan-Ning Jiang

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Improve audio representation by using feature structure patterns.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Approach to the Correlation Discovery of Chinese Linguistic Parameters Based on Bayesian Method.

[BibT_eX]

[DOI]

Wei Wang

J. Comput. Sci. Technol., 2003

An Improved Framework for Online Adaptive Information Filtering.

[BibT_eX]

[DOI]

Liang Ma

Qunxiu Chen

Proceedings of the Advances in Web-Age Information Management, 2003

An adaptive system for online document filtering.

[BibT_eX]

[DOI]

Liang Ma

Qunxiu Chen

Proceedings of the IEEE International Conference on Systems, 2003

Highlight sound effects detection in audio stream.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2002

Incremental Learning for Profile Training in Adaptive Document Filtering.

[BibT_eX]

[DOI]

Proceedings of The Eleventh Text REtrieval Conference, 2002

Voice quality analysis under the pitch effect.

[BibT_eX]

[DOI]

Dan-Ning Jiang

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Annotation of Chinese prosodic level based on probabilistic model.

[BibT_eX]

[DOI]

Rui Cai

Zhi-Yong Wu

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Automatic stress prediction of Chinese speech synthesis.

[BibT_eX]

[DOI]

Sheng Zhao

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Prosodic phrasing with inductive learning.

[BibT_eX]

[DOI]

Sheng Zhao

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Clustering and feature learning based F0 prediction for Chinese speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Music type classification by spectral contrast feature.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Learning Rules for Chinese Prosodic Phrase Prediction.

[BibT_eX]

[DOI]

Sheng Zhao

Proceedings of the First Workshop on Chinese Language Processing, 2002

2000

Research on dynamic characters of Chinese pitch contours.

[BibT_eX]

[DOI]

Tongchun Zhou

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

The design and application of a speech database for Chinese TTS system.

[BibT_eX]

[DOI]

Muhua Lv

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1998

The Statistical Model of Chinese Word Contours Based on Fuzzy.

[BibT_eX]

[DOI]