Wei Han

Orcid: 0000-0002-4201-9645

Affiliations:
  • Google
  • University of Illinois at Urbana-Champaign, Department of Electrical and Computer Engineering, Beckman Institute, Urbana, IL, USA


According to our database1, Wei Han authored at least 45 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Retrieval Augmented End-to-End Spoken Dialog Models.
CoRR, 2024

2023
RoboVQA: Multimodal Long-Horizon Reasoning for Robotics.
CoRR, 2023

Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding.
CoRR, 2023

Label Aware Speech Representation Learning For Language Identification.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

Accelerating RNN-T Training and Inference Using CTC Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Domain Adaptation for Speech Foundation Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2022

Speech Aware Dialog System Technology Challenge (DSTC11).
CoRR, 2022

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data.
CoRR, 2022

Universal Paralinguistic Speech Representations Using self-Supervised Conformers.
Proceedings of the IEEE International Conference on Acoustics, 2022


2021
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models.
CoRR, 2021

Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models.
CoRR, 2021

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Bridging the Gap Between Streaming and Non-Streaming ASR Systems by Distilling Ensembles of CTC and RNN-T Models.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling.
Proceedings of the 9th International Conference on Learning Representations, 2021

FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Better and Faster end-to-end Model for Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition.
CoRR, 2020

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling.
CoRR, 2020

Improved Noisy Student Training for Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context.
Proceedings of the Interspeech 2020, 2020

Conformer: Convolution-augmented Transformer for Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Streaming Object Detection for 3-D Point Clouds.
Proceedings of the Computer Vision - ECCV 2020, 2020

Scalability in Perception for Autonomous Driving: Waymo Open Dataset.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning compact neural network representations with structural priors
PhD thesis, 2019

StarNet: Targeted Computation for Object Detection in Point Clouds.
CoRR, 2019

A Comparison of End-to-End Models for Long-Form Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Learning 3D-FilterMap for Deep Convolutional Neural Networks.
CoRR, 2018

3D-FilterMap: A Compact Architecture for Deep Convolutional Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Image Super-Resolution via Dual-State Recurrent Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Dilated Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017


Balanced Two-Stage Residual Networks for Image Super-Resolution.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Robust Single Image Super-Resolution via Deep Networks With Sparse Prior.
IEEE Trans. Image Process., 2016

Seq-NMS for Video Object Detection.
CoRR, 2016

2015
Deeply Improved Sparse Coding for Image Super-Resolution.
CoRR, 2015

An Analysis of Unsupervised Pre-training in Light of Recent Advances.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Heterogeneous Network Embedding via Deep Architectures.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Deep Networks for Image Super-Resolution with Sparse Prior.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Self-tuned deep super resolution.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

2014
Multimedia Classification.
Proceedings of the Data Classification: Algorithms and Applications, 2014


  Loading...