Hao Li

Orcid: 0009-0005-4319-9026

Affiliations:
  • Kuaishou Technology Co., Beijing, China
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China


According to our database1, Hao Li authored at least 17 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Novel Simulation Method for 3D Digital-Image Correlation: Combining Virtual Stereo Vision and Image Super-Resolution Reconstruction.
Sensors, July, 2024

Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Enhancing Realism in 3D Facial Animation Using Conformer-Based Generation and Automated Post-Processing.
Proceedings of the IEEE International Conference on Acoustics, 2024

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Learning Speech Representation from Contrastive Token-Acoustic Pretraining.
Proceedings of the IEEE International Conference on Acoustics, 2024

Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Improving Spoken Language Understanding with Cross-Modal Contrastive Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2020
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech.
CoRR, 2020

2018
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2016
Emotional head motion predicting from prosodic and linguistic features.
Multim. Tools Appl., 2016

2015
User behavior fusion in dialog management with multi-modal history cues.
Multim. Tools Appl., 2015

Estimate articulatory MRI series from acoustic signal using deep architecture.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Evaluation of linear regression for speaker adaptation in HMM-based articulatory movements estimation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Tongue shape conversion with non-parallel training data.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Speaker-independent lips and tongue visualization of vowels.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Multimodal emotion estimation and emotional synthesize for interaction virtual agent.
Proceedings of the 2nd IEEE International Conference on Cloud Computing and Intelligence Systems, 2012


  Loading...