Dong-Yan Huang

According to our database1, Dong-Yan Huang authored at least 58 papers between 1995 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram.
CoRR, 2019

Speech Emotion Recognition using Spectral Normalized CycleGAN.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

2018
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
Denoising Recurrent Neural Network for Deep Bidirectional LSTM Based Voice Conversion.
Proceedings of the Interspeech 2017, 2017

Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques.
Proceedings of the Interspeech 2017, 2017

The Study of the Work Parameters of the Corn Harvester Cutter.
Proceedings of the Computer and Computing Technologies in Agriculture XI, 2017

Application of Growth Curve in Agricultural Scientific Research.
Proceedings of the Computer and Computing Technologies in Agriculture XI, 2017

Audio-visual emotion recognition using deep transfer learning and multiple temporal models.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Voichap: A standalone real-time voice change application on iOS platform.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Facial action recognition using very deep networks for highly imbalanced class distribution.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

I2RNTU at SemEval-2016 Task 4: Classifier Fusion for Polarity Classification in Twitter.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion.
Proceedings of the Interspeech 2016, 2016

Audio and face video emotion recognition in the wild using deep neural networks and small datasets.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Deep neural network derived bottleneck features for accurate audio classification.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Exemplar-based sparse representation of timbre and prosody for voice conversion.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Combining multiple kernel models for automatic intelligibility detection of pathological speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation.
Proceedings of the INTERSPEECH 2015, 2015

An alternating optimization approach for phase retrieval.
Proceedings of the INTERSPEECH 2015, 2015

A real-time variable-q non-stationary Gabor transform for pitch shifting.
Proceedings of the INTERSPEECH 2015, 2015

Performance scoring of singing voice.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Perceptual speech quality improvement for vocoder based on amplitude spectrum of residual signal.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Non-negative matrix factorization using stable alternating direction method of multipliers for source separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Mapping frames with DNN-HMM recognizer for non-parallel voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Fundamental frequency modeling using wavelets for emotional voice conversion.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Speaker state classification based on fusion of asymmetric simple partial least squares (SIMPLS) and support vector machines.
Comput. Speech Lang., 2014

Soft constrained leading voice separation with music score guidance.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Acoustic emotion recognition based on fusion of multiple feature-dependent deep Boltzmann machines.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

I<sup>2</sup>r speech2singing perfects everyone's singing.
Proceedings of the INTERSPEECH 2014, 2014

Intelligibility detection of pathological speech using asymmetric sparse kernel partial least squares classifier.
Proceedings of the IEEE International Conference on Acoustics, 2014

Learning optimal features for music transcription.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

Emotional facial expression transfer based on temporal restricted Boltzmann machines.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Ensemble Nyström method for predicting conflict level from speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
A dynamic Gaussian process for voice conversion.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

2012
A comparison of SVM and asymmetric SIMPLS in emotion recognition from naturalistic dialogues.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Detecting Intelligibility by Linear Dimensionality Reduction and Normalized Voice Quality Hierarchical Features.
Proceedings of the INTERSPEECH 2012, 2012

2011
Speaker State Classification Based on Fusion of Asymmetric SIMPLS and Support Vector Machines.
Proceedings of the INTERSPEECH 2011, 2011

2010
High level emotional speech morphing using STRAIGHT.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Lombard effect mimicking.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

2009
Biologically inspired algorithm for enhancement of speech intelligibility over telephone channel.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

The Misadjustment of the Cascaded LMS Prediction Filter.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

2008
Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding.
IEEE Trans. Speech Audio Process., 2008

Convergence Performance of the Cascaded RLS-LMS Prediction.
Proceedings of the 67th IEEE Vehicular Technology Conference, 2008

2006
Eigenstructure algorithms for multirate adaptive lossless FIR filters.
IEEE Trans. Signal Process., 2006

2005
A performance bound for a cascade LMS predictor.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Software simulation tools on forward error correction schemes for the wireless transmission of MPEG4 AAC audio bitstreams.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Characterization of a cascade LMS predictor.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Speech pitch detection in noisy environment using multi-rate adaptive lossless FIR filters.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Sensitivity analysis of a cascade RLS-LMS algorithm for different resolution audio signals.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Performance analysis of an RLS-LMS algorithm for lossless audio compression.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Soft decision unequal error protection scheme for MPEG advanced audio coding.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Integer fast modified cosine transform.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2002
Robust and Inaudible Multi-echo Audio Watermarking.
Proceedings of the Advances in Multimedia Information Processing, 2002

1999
An Attack Processing of Audio Signal for Reducing Pre-echo in a Low Bit-Rate Audio Coding System.
Proceedings of the Signal and Image Processing (SIP), 1999

Implementation of the MPEG-4 advanced audio coding encoder on ADSP-21060 SHARC.
Proceedings of the 1999 International Symposium on Circuits and Systems, ISCAS 1999, Orlando, Florida, USA, May 30, 1999

1997
Comparison of two eigenstructure algorithms for lossless multirate filter optimization.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
An adaptive projection algorithm for multirate filter bank optimization.
Proceedings of the 8th European Signal Processing Conference, 1996

1995
Attainable error bounds in multirate adaptive lossless FIR filters.
Proceedings of the 1995 International Conference on Acoustics, 1995


  Loading...