Zhiyong Wu

According to our database1, Zhiyong Wu authored at least 100 papers between 2000 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Fast graph centrality computation via sampling: a case study of influence maximisation over OSNs.
IJHPCN, 2019

A robust model predictive control approach for post-disaster relief distribution.
Computers & Industrial Engineering, 2019

Non-Data-Aided Cycle Slip Self-Correcting Carrier Phase Estimation for QPSK Modulation Format of Coherent Wireless Optical Communication System.
IEEE Access, 2019

Towards Discriminative Representation Learning for Speech Emotion Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Walking with Perception: Efficient Random Walk Sampling via Common Neighbor Awareness.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Speech Emotion Recognition Using Capsule Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019

NN-based Ordinal Regression for Assessing Fluency of ESL Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams.
Proceedings of the IEEE International Conference on Acoustics, 2019

Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Code-switched TTS with Mix of Monolingual Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks.
Speech Communication, 2018

An Optimal Task Allocation Approach for Large-Scale Multiple Robotic Systems With Hierarchical Framework and Resource Constraints.
IEEE Systems Journal, 2018

High Precision Position Measurement Method for Laguerre-Gaussian Beams Using a Quadrant Detector.
Sensors, 2018

Hydrologic Evaluation of Multi-Source Satellite Precipitation Products for the Upper Huaihe River Basin, China.
Remote Sensing, 2018

Deep learning-based personality recognition from text posts of online social networks.
Appl. Intell., 2018

Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Speech Super-Resolution Using Parallel WaveNet.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection.
Proceedings of the Interspeech 2018, 2018

Detection of Glottal Closure Instants from Speech Signals: A Convolutional Neural Network Based Method.
Proceedings of the Interspeech 2018, 2018

Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms.
Proceedings of the Interspeech 2018, 2018

Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Feature Based Adaptation for Speaking Style Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Applying Multitask Learning to Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Discovery of an Extended Phoneme Set in L2 English Speech for Mispronunciation Detection and Diagnosis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Emphatic Speech Generation with Conditioned Input Layer and Bidirectional LSTMS for Expressive Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Practical Open Knowledge Base Canonicalization.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Modeling SSD RAID reliability under general settings.
Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018

Supervised Discrete Matrix Factorization Hashing For Cross-Modal Retrieval.
Proceedings of the 5th IEEE International Conference on Cloud Computing and Intelligence Systems, 2018

Learning Frame-Level Recurrent Neural Networks Representations for Query-by-Example Spoken Term Detection on Mobile Devices.
Proceedings of the Artificial Intelligence and Mobile Services - AIMS 2018, 2018

Multi-modal Multi-scale Speech Expression Evaluation in Computer-Assisted Language Learning.
Proceedings of the Artificial Intelligence and Mobile Services - AIMS 2018, 2018

2017
A multi-robot cooperative hunting approach based on dynamic prediction of target motion.
Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017

Movie Recommendation via BLSTM.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space.
Proceedings of the Interspeech 2017, 2017

Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion.
Proceedings of the Interspeech 2017, 2017

Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer.
Proceedings of the Interspeech 2017, 2017

Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Measuring and Maximizing Influence via Random Walk in Social Activity Networks.
Proceedings of the Database Systems for Advanced Applications, 2017

Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
A Novel Method for Classification of ECG Arrhythmias Using Deep Belief Networks.
International Journal of Computational Intelligence and Applications, 2016

A Real-Time Gesture-Based Unmanned Aerial Vehicle Control System.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Video Inpainting Based on Joint Gradient and Noise Minimization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

3D modeling based on multiple Unmanned Aerial Vehicles with the optimal paths.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2016

DBLSTM-based multi-task learning for pitch transformation in voice conversion.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Analysis on Gated Recurrent Unit Based Question Detection Approach.
Proceedings of the Interspeech 2016, 2016

Combining CNN and BLSTM to Extract Textual and Acoustic Features for Recognizing Stances in Mandarin Ideological Debate Competition.
Proceedings of the Interspeech 2016, 2016

Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data.
Proceedings of the Interspeech 2016, 2016

Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis.
Proceedings of the Interspeech 2016, 2016

Heterogeneity-entropy based unsupervised feature learning for personality prediction with cross-media data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Recognizing stances in Mandarin social ideological debates with text and acoustic features.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Learning cross-lingual information with multilingual BLSTM for speech synthesis of low-resource languages.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Question detection from acoustic features using recurrent neural network with gated recurrent unit.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Low level descriptors based DBLSTM bottleneck feature for speech driven talking avatar.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Acoustic to articulatory mapping with deep neural network.
Multimedia Tools Appl., 2015

Generating emphatic speech with hidden Markov model for expressive speech synthesis.
Multimedia Tools Appl., 2015

Polyphonic Music Modelling with LSTM-RTRBM.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Using tilt for automatic emphasis detection with Bayesian networks.
Proceedings of the INTERSPEECH 2015, 2015

Modelling High-Dimensional Sequences with LSTM-RTRBM: Application to Polyphonic Music Generation.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

HMM-based emphatic speech synthesis for corrective feedback in computer-aided pronunciation training.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A deep recurrent approach for acoustic-to-articulatory inversion.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Understanding speaking styles of internet speech data with LSTM and low-resource training.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Synthesizing English emphatic speech for multimodal corrective feedback in computer-aided pronunciation training.
Multimedia Tools Appl., 2014

Head and facial gestures synthesis using PAD model for an expressive talking avatar.
Multimedia Tools Appl., 2014

Automatic speech data clustering with human perception based weighted distance.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Multi-channel speech enhancement using sparse coding on local time-frequency structures.
Proceedings of the INTERSPEECH 2014, 2014

Using conditional random fields to predict focus word pair in spontaneous spoken English.
Proceedings of the INTERSPEECH 2014, 2014

Contrastive auto-encoder for phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Learning dynamic features with neural networks for phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Automatic Emotion Variation Detection in continuous speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
A Novel Ant Colony Optimization Routing Algorithm in Wireless Sensor Network.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Investigation of tandem deep belief network approach for phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

A real-time speech driven talking avatar based on deep neural network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Frequency-domain dereverberation on speech signal using surround retinex.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Sparse coding for sound event classification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Comparing feature dimension reduction algorithms for GMM-SVM based speech emotion recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Comparison of adaptation methods for GMM-SVM based speech emotion recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Detection and emphatic realization of contrastive word pairs for expressive text-to-speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Perceptual clustering based unit selection optimization for concatenative text-to-speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Hierarchical English Emphatic Speech Synthesis Based on HMM with Limited Training Data.
Proceedings of the INTERSPEECH 2012, 2012

Modeling the correlation between modality semantics and facial expressions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
A New Fuzzing Method Using Multi Data Samples Combination.
JCP, 2011

Combining Active and Semi-Supervised Learning for Homograph Disambiguation in Mandarin Text-to-Speech Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

2010
Construction of quasi cyclic LDPC with ACE constraint.
Proceedings of the IEEE International Conference on Wireless Communications, 2010

Modeling prosody patterns for Chinese expressive text-to-speech synthesis.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Comparison of Syllable/Phone HMM Based Mandarin TTS.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Financial Input-output Efficiency Analysis of Water-transporting Logistics Enterprises in China Based on DEA.
Proceedings of the International Conference on E-Business and E-Government, 2010

2009
Modeling the Expressivity of Input Text Semantics for Chinese Text-to-Speech Synthesis in a Spoken Dialog System.
IEEE Trans. Audio, Speech & Language Processing, 2009

2008
The Use of Dynamic Deformable Templates for Lip Tracking in an Audio-Visual Corpus with Large Variations in Head Pose, Face Illumination and Lip Shapes.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A New Prosodic Strength Calculation Method for Prosody Reduction Modeling.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2007
Head Movement Synthesis Based on Semantic and Prosodic Features for a Chinese Expressive Avatar.
Proceedings of the IEEE International Conference on Acoustics, 2007

Facial Expression Synthesis Using PAD Emotional Parameters for a Chinese Expressive Avatar.
Proceedings of the Affective Computing and Intelligent Interaction, 2007

2006
Modelling the Global acoustic Correlates of Expressivity for Chinese Text-to-speech Synthesis.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

A Corpus-Based Approach for Cooperative Response Generation in a Dialog System.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar.
Proceedings of the INTERSPEECH 2006, 2006

Multi-level Fusion of Audio and Visual Features for Speaker Identification.
Proceedings of the Advances in Biometrics, International Conference, 2006

2000
Research on dynamic characters of Chinese pitch contours.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...