Di He

Affiliations:
  • Amazon Alexa, Seattle, WA, USA
  • University of Illinois at Urbana-Champaign, Department of Electrical and Computer Engineering, Coordinated Science Lab, Urbana, IL, USA (PhD 2019)
  • Inspirit IoT, Inc., Champaign, IL, USA


According to our database1, Di He authored at least 14 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion.
CoRR, 2024

2023
Personalized Predictive ASR for Latency Reduction in Voice Assistants.
CoRR, 2023

Adaptive Endpointing with Deep Contextual Multi-Armed Bandits.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards Accurate and Real-Time End-of-Speech Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Two-Pass Endpoint Detection for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
VADOI: Voice-Activity-Detection Overlapping Inference for End-To-End Long-Form Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
wav2vec-C: A Self-Supervised Model for Speech Representation Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2019
The benefits of acoustic perceptual information for speech processing systems
PhD thesis, 2019

When CTC Training Meets Acoustic Landmarks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Augmenting Input Method Language Model with user Location Type Information.
CoRR, 2018

Improved ASR for Under-resourced Languages through Multi-task Learning with Acoustic Landmarks.
Proceedings of the Interspeech 2018, 2018

2017
Acoustic Landmarks Contain More Information About the Phone String than Other Frames.
CoRR, 2017

Using Approximated Auditory Roughness as a Pre-Filtering Feature for Human Screaming and Affective Speech AED.
Proceedings of the Interspeech 2017, 2017

Machine learning on FPGAs to face the IoT revolution.
Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017


  Loading...