Yu Hu

Affiliations:
  • IFLYTEK Research, Hefei, China
  • University of Science and Technology of China, National Engineering Laboratory of Speech and Language Information Processing, Hefei, China (PhD 2009)


According to our database1, Yu Hu authored at least 50 papers between 2000 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Joint optimization for attention-based generation and recognition of chinese characters using tree position embedding.
Pattern Recognit., August, 2023

QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

2021
A Multiple-Integration Encoder for Multi-Turn Text-to-SQL Semantic Parsing.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Robustness of Speech Spoofing Detectors Against Adversarial Post-Processing of Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement.
Neural Networks, 2021

Adversarial Voice Conversion Against Neural Spoofing Detectors.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention.
CoRR, 2020

Adversarial Post-Processing of Voice Conversion against Spoofing Detection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Knowledge Base Question Answering With Attentive Pooling for Question Representation.
IEEE Access, 2019

2017
Nonrecurrent Neural Structure for Long-Term Dependence.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Towards human-like and transhuman perception in AI 2.0: a review.
Frontiers Inf. Technol. Electron. Eng., 2017

Cause-Effect Knowledge Acquisition and Neural Association Model for Solving A Set of Winograd Schema Problems.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems.
Proceedings of the 2017 AAAI Spring Symposia, 2017

2016
Part-of-Speech Relevance Weights for Learning Word Embeddings.
CoRR, 2016

Probabilistic Reasoning via Deep Learning: Neural Association Models.
CoRR, 2016

Intra-Topic Variability Normalization based on Linear Projection for Topic Classification.
Proceedings of the NAACL HLT 2016, 2016

Modulation spectrum compensation for HMM-based speech synthesis using line spectral pairs.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring Semantic Representation in Brain Activity Using Word Embeddings.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
State-Clustering Based Multiple Deep Neural Networks Modeling Approach for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency.
CoRR, 2015

Keynote speech 1: Artificial intelligence needs a language cognitive revolution.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2012
Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011
Trust Region-Based Optimization for Maximum Mutual Information Estimation of HMMs in Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Boosted Mixture Learning of Gaussian Mixture Hidden Markov Models Based on Maximum Likelihood for Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

2010
Robust pronunciation evaluation in adverse environments.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Phonetic clustering based confidence measure for embedded speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2010, 2010

Boosted mixture learning of Gaussian mixture HMMs for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

A bounded trust region optimization for discriminative training of HMMS in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

HMM-based pseudo-clean speech synthesis for splice algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models.
Speech Commun., 2009

A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Pronunciation Space Models for Pronunciation Evaluation.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Evaluation of a Feature Compensation Approach Using High-Order Vector Taylor Series Approximation of an Explicit Distortion Modelon Aurora2, Aurora3, and Aurora4 Tasks.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Heteronym Verification for Mandarin Speech Synthesis.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Minimum word classification error training of HMMS for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2006
An HMM Compensation Approach Using Unscented Transformation for Noisy Speech Recognition.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Automatic Mandarin pronunciation scoring for native learners with dialect accent.
Proceedings of the INTERSPEECH 2006, 2006

2005
A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

2004
Modeling glottal effect on the spectral envelop of STRAIGHT using mixture of Gaussians.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Hearer model based stress prediction for Chinese TTS system.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Compression of speech database by feature separation and pattern clustering using STRAIGHT.
Proceedings of the INTERSPEECH 2004, 2004

2002
A miniature Chinese TTS system based on tailored corpus.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2000
Prosody generation in Chinese synthesis using the template of quantified prosodic unit and base intonation contour.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

KD2000 Chinese Text-To-Speech System.
Proceedings of the Advances in Multimodal Interfaces, 2000


  Loading...