Jianhua Tao

According to our database1, Jianhua Tao authored at least 176 papers between 2000 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Drawing Order Recovery for Handwriting Chinese Characters.
Proceedings of the IEEE International Conference on Acoustics, 2019

Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Discriminative Video Representation with Temporal Order for Micro-expression Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Phoneme Dependent Speaker Embedding and Model Factorization for Multi-speaker Speech Synthesis and Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition.
Signal Processing Systems, 2018

Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning.
Signal Processing Systems, 2018

Deep Learning Based Speech Separation via NMF-Style Reconstructions.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Learning for Continuous Multiple Time Series Annotations.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018

Multimodal Continuous Emotion Recognition with Data Augmentation Using Recurrent Neural Networks.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018

A Novel Unified Framework for Speech Enhancement and Bandwidth Extension Based on Jointly Trained Neural Networks.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

CLMAD: A Chinese Language Model Adaptation Dataset.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in a Text-to-Speech Front-End.
Proceedings of the Interspeech 2018, 2018

On the Application and Compression of Deep Time Delay Neural Network for Embedded Statistical Parametric Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

Sparsity-Constrained Weight Mapping for Head-Related Transfer Functions Individualization from Anthropometric Features.
Proceedings of the Interspeech 2018, 2018

Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Speech Emotion Recognition from Variable-Length Inputs with Triplet Loss Function.
Proceedings of the Interspeech 2018, 2018

Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer.
Proceedings of the Interspeech 2018, 2018

Transfer Learning Based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

Pen Tip Motion Prediction for Handwriting Drawing Order Recovery using Deep Neural Network.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Reducing Tongue Shape Dimensionality from Hundreds of Available Resources Using Autoencoder.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Self-Talk: Responses to Users' Opinions and Challenges in Human Computer Dialog.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Architecture and Parameter Analysis to Convolutional Neural Network for Hand Tracking.
Proceedings of the Cloud Computing and Security - 4th International Conference, 2018

Adversarial Multilingual Training for Low-Resource Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

End-to-End Continuous Emotion Recognition from Video Using 3D Convlstm Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis.
Speech Communication, 2017

CHEAVD: a Chinese natural emotional audio-visual database.
J. Ambient Intelligence and Humanized Computing, 2017

Nonrigid point matching of Chinese characters for robot writing.
Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017

Research on modeling and machining algorithm of multi-shear and multi-punch CNC transverse shear line.
Proceedings of the 2017 IEEE International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics, 2017

Continuous Multimodal Emotion Prediction Based on Long Short Term Memory Recurrent Neural Network.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

Investigating Efficient Feature Representation Methods and Training Objective for BLSTM-Based Phone Duration Prediction.
Proceedings of the Interspeech 2017, 2017

Distilling Knowledge from an Ensemble of Models for Punctuation Prediction.
Proceedings of the Interspeech 2017, 2017

A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network.
Proceedings of the Interspeech 2017, 2017

A novel pitch extraction based on jointly trained deep BLSTM Recurrent Neural Networks with bottleneck features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.
Signal Processing Systems, 2016

Guest Editorial: Advances in Machine Learning for Speech Processing.
Signal Processing Systems, 2016

Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Boundaries Prediction.
Signal Processing Systems, 2016

Emotional head motion predicting from prosodic and linguistic features.
Multimedia Tools Appl., 2016

Football News Generation from Chinese Live Webcast Script.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Investigating deep neural network adaptation for generating exclamatory and interrogative speech in Mandarin.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

CTC regularized model adaptation for improving LSTM RNN based multi-accent Mandarin speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

End-to-end keywords spotting based on connectionist temporal classification for Mandarin.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach.
Proceedings of the Interspeech 2016, 2016

The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network Based Speech Synthesis.
Proceedings of the Interspeech 2016, 2016

A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions.
Proceedings of the Interspeech 2016, 2016

A Novel Research to Artificial Bandwidth Extension Based on Deep BLSTM Recurrent Neural Networks and Exemplar-Based Sparse Representation.
Proceedings of the Interspeech 2016, 2016

Extraction of tongue contour in real-time magnetic resonance imaging sequences.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Long short term memory recurrent neural network based encoding method for emotion recognition in video.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Recurrent Neural Network Based Language Model Adaptation for Accent Mandarin Speech.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

MEC 2016: The Multimodal Emotion Recognition Challenge of CCPR 2016.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Deep neural network based voice conversion with a large synthesized parallel corpus.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Micro-Expression Recognition Using Color Spaces.
IEEE Trans. Image Processing, 2015

Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech.
Speech Communication, 2015

User behavior fusion in dialog management with multi-modal history cues.
Multimedia Tools Appl., 2015

Long Short Term Memory Recurrent Neural Network based Multimodal Dimensional Emotion Recognition.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015

Combining extreme learning machine and decision tree for duration prediction in HMM based speech synthesis.
Proceedings of the INTERSPEECH 2015, 2015

A novel method of artificial bandwidth extension using deep architecture.
Proceedings of the INTERSPEECH 2015, 2015

Exploring smart pilot for partial packet recovery in super dense wireless networks.
Proceedings of the IEEE International Conference on Communication, 2015

Estimate articulatory MRI series from acoustic signal using deep architecture.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Evaluation of linear regression for speaker adaptation in HMM-based articulatory movements estimation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Voice quality: Not only about "you" but also about "your interlocutor".
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

From simulated speech to natural speech, what are the robust features for emotion recognition?
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Multi task sequence learning for depression scale prediction from video.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis.
Signal Processing Systems, 2014

Guest Editorial: Machine Learning for Signal Processing.
Signal Processing Systems, 2014

Introduction to the Issue on Statistical Parametric Speech Synthesis.
J. Sel. Topics Signal Processing, 2014

Phonological influences on the realization of final lowering evidence from dialogue Chinese Mandarin.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Multi-scale Temporal Modeling for Dimensional Emotion Recognition in Video.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

The expression of emotions by text and speech.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Survey on discriminative feature selection for speech emotion recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Evaluation of parameter generation using high order dynamic features and long span windows for HMM based speech synthesis.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Context features based pre-selection and weight prediction in concatenation speech synthesis system.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Investigating effect of rich syntactic features on Mandarin prosodic phrase boundaries prediction.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improving generation performance of speech emotion recognition by denoising autoencoders.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Combining prosodic and spectral features for Mandarin intonation recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A hierarchical viterbi algorithm for Mandarin hybrid speech synthesis system.
Proceedings of the INTERSPEECH 2014, 2014

Improving Mandarin prosodic boundary prediction with rich syntactic features.
Proceedings of the INTERSPEECH 2014, 2014

A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Tongue shape conversion with non-parallel training data.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A novel unit selection method for concatenation speech system using similarity measure.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Stress predicition for Mandarin text-to-speech system using discourse context feature.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Extraction of tongue contour in X-ray videos.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker-independent lips and tongue visualization of vowels.
Proceedings of the IEEE International Conference on Acoustics, 2013

On Constructing a Chinese Task-Oriental Subjectivity Lexicon.
Proceedings of the Chinese Lexical Semantics - 14th Workshop, 2013

Combining emotional history through multimodal fusion methods.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Extended Decision Tree with or Relationship for HMM-Based Speech Synthesis.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Bayesian Inference Based Temporal Modeling for Naturalistic Affective Expression Classification.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
A multimodal approach of generating 3D human-like talking agent.
J. Multimodal User Interfaces, 2012

Impact investigation of turbulence image features on the computation of flow kinetic energy.
IJSPM, 2012

Emotion and mental state recognition from speech.
EURASIP J. Adv. Sig. Proc., 2012

Statistical modification based post-filtering technique for HMM-based speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Multimodal emotion estimation and emotional synthesize for interaction virtual agent.
Proceedings of the 2nd IEEE International Conference on Cloud Computing and Intelligence Systems, 2012

2011
Utterance independent bimodal emotion recognition in spontaneous communication.
EURASIP J. Adv. Sig. Proc., 2011

An outlier rejection scheme for optical flow tracking.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

An excitation model based on inverse filtering for speech analysis and synthesis.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Preface.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Animating a Chinese interactive virtual character.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

HMM-based Tianjin Dialect speech synthesis using bilateral question Set.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Inverse Filtering Based Harmonic Plus Noise Excitation Model for HMM-Based Speech Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

Hierarchical Stress Modeling in Mandarin Text-to-Speech.
Proceedings of the INTERSPEECH 2011, 2011

The Stability Analysis of Disyllabic Stress in Mandarin Speech.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011

The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010
Supervisory Data Alignment for Text-Independent Voice Conversion.
IEEE Trans. Audio, Speech & Language Processing, 2010

HMM based speech synthesis with Global Variance Training method.
Proceedings of the 4th International Universal Communication Symposium, 2010

Real-time speech-driven lip synchronization.
Proceedings of the 4th International Universal Communication Symposium, 2010

The duration analysis of the checked tones in Cantonese speech.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

A novel hybrid approach for Mandarin speech synthesis.
Proceedings of the INTERSPEECH 2010, 2010

Text-based unstressed syllable prediction in Mandarin.
Proceedings of the INTERSPEECH 2010, 2010

Mood avatar: automatic text-driven head motion synthesis.
Proceedings of the 12th International Conference on Multimodal Interfaces / 7. International Workshop on Machine Learning for Multimodal Interaction, 2010

Does culture affect the perception of emotion in virtual faces?
Proceedings of the 7th Symposium on Applied Perception in Graphics and Visualization, 2010

2009
Realistic Visual Speech Synthesis Based on Hybrid Concatenation Method.
IEEE Trans. Audio, Speech & Language Processing, 2009

Dimension reducing of LSF parameters based on radial basis function neural network.
Proceedings of the INTERSPEECH 2009, 2009

Prosody modeling for mandarin exclamatory speech.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Prediction of Ground Water Level Based on DE-BP Neutral Network.
Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009

Categorizing terms' subjectivity and polarity manually for opinion mining in Chinese.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

A multiple perception model on emotional speech.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

Face Animation Based on Large Audiovisual Database.
Proceedings of the Affective Information Processing, 2009

Emotional Speech Generation by Using Statistic Prosody Conversion Methods.
Proceedings of the Affective Information Processing, 2009

2008
Improving HMM Based Speech Synthesis by Reducing Over-Smoothing Problems.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Prosody Modification on Mixed-Language Speech Synthesis.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A Novel Classifier Based on Enhanced Lipschitz Embedding for Speech Emotion Recognition.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2008

Text-independent voice conversion based on state mapped codebook.
Proceedings of the IEEE International Conference on Acoustics, 2008

Tree-guided transformation-based homograph disambiguation in Mandarin TTS system.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Manifolds Based Emotion Recognition in Speech.
IJCLCLP, 2007

Modeling incompletion phenomenon in Mandarin dialog prosody.
Proceedings of the INTERSPEECH 2007, 2007

Speech Emotion Recognition using an Enhanced Co-Training Algorithm.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Dynamic Audio-Visual Mapping using Fused Hidden Markov Model Inversion Method.
Proceedings of the International Conference on Image Processing, 2007

Speech Emotion Recognition Based on a Fusion of All-Class and Pairwise-Class Feature Selection.
Proceedings of the Computational Science, 2007

A Novel HMM-Based TTS System using Both Continuous HMMS and Discrete HMMS.
Proceedings of the IEEE International Conference on Acoustics, 2007

Expressive Face Animation Synthesis Based on Dynamic Mapping Method.
Proceedings of the Affective Computing and Intelligent Interaction, 2007

What Should a Generic Emotion Markup Language Be Able to Represent?
Proceedings of the Affective Computing and Intelligent Interaction, 2007

Combining Audio and Video by Dominance in Bimodal Emotion Recognition.
Proceedings of the Affective Computing and Intelligent Interaction, 2007

2006
Prosody conversion from neutral speech to emotional speech.
IEEE Trans. Audio, Speech & Language Processing, 2006

Nonlinear Emotional Prosody Generation and Annotation.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Prosodic Word Prediction Using a Maximum Entropy Approach.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Emotional Speech Analysis on Nonlinear Manifold.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Emotion Recognition from Noisy Speech.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A New Pitch Generation Model Based on Internal Dependence of Pitch Contour for Manadrin TTS System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Chinese prosodic phrasing with a constraint-based approach.
Proceedings of the INTERSPEECH 2005, 2005

Automatic 3D Face Modeling from Video.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Dynamic Mapping Method Based Speech Driven Face Animation System.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

Affective Computing: A Review.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

Features Importance Analysis for Emotional Speech Classification.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

Personalized Facial Animation Based on 3D Model Fitting from Two Orthogonal Face Images.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

A Hybrid GMM and Codebook Mapping Method for Spectral Conversion.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

2004
F0 Prediction Model of Speech Synthesis Based on Template and Statistical Method.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Acoustic and Linguistic Information Based Chinese Prosodic Boundary Labelling.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Multi-source based acoustic model for speech synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Rhythm correlation of speech synthesis system.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Grapheme-to-phoneme conversion in Chinese TTS system.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation.
Proceedings of the INTERSPEECH 2004, 2004

Context based emotion detection from text input.
Proceedings of the INTERSPEECH 2004, 2004

Emotional Chinese talking head system.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

2003
Emotion control of Chinese speech synthesis in natural environment.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Auditive learning based Chinese F0 prediction.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Chinese prosodic phrasing with extended features.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Auditive learning based Chinese F0 prediction.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Prosodic phrasing with inductive learning.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Clustering and feature learning based F0 prediction for Chinese speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Music type classification by spectral contrast feature.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Learning Rules for Chinese Prosodic Phrase Prediction.
Proceedings of the First Workshop on Chinese Language Processing, 2002

2000
Data-driven importance analysis of linguistic and phonetic information.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Data-driven importance analysis of linguistic and phonetic information.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...