Chao Wang

Affiliations:
  • Amazon.com Inc, Alexa Speech, Cambridge, MA, USA (since 2016)
  • Vlingo, Cambridge, MA, USA
  • Massachusetts Institute of Technology, CSAIL, Cambridge, MA, USA (PhD 2001)


According to our database1, Chao Wang authored at least 61 papers between 1997 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Weight-Sharing Supernet for Searching Specialized Acoustic Event Classification Networks Across Device Constraints.
Proceedings of the IEEE International Conference on Acoustics, 2023

FedRPO: Federated Relaxed Pareto Optimization for Acoustic Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Improved Representation Learning For Acoustic Event Classification Using Tree-Structured Ontology.
Proceedings of the IEEE International Conference on Acoustics, 2022

Wikitag: Wikipedia-Based Knowledge Embeddings Towards Improved Acoustic Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives.
Proceedings of the IEEE International Conference on Acoustics, 2022

Sentiment-Aware Automatic Speech Recognition Pre-Training for Enhanced Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Federated Self-Supervised Learning for Acoustic Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Multi-Task Self-Supervised Pre-Training for Music Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Contrastive Unsupervised Learning for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improved Student Model Training for Acoustic Event Detection Models.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Data Techniques For Online End-to-end Speech Recognition.
CoRR, 2020

Acoustic Scene Analysis with Multi-Head Attention Networks.
Proceedings of the Interspeech 2020, 2020

A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling.
Proceedings of the Interspeech 2020, 2020

Semi-Supervised ASR by End-to-End Self-Training.
Proceedings of the Interspeech 2020, 2020

Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging.
Proceedings of the Interspeech 2020, 2020

Raw Waveform Based End-to-end Deep Convolutional Network for Spatial Localization of Multiple Acoustic Sources.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Few-Shot Acoustic Event Detection Via Meta Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training.
CoRR, 2019

Compression of Acoustic Event Detection Models with Quantized Distillation.
Proceedings of the Interspeech 2019, 2019

Sub-Band Convolutional Neural Networks for Small-Footprint Spoken Term Classification.
Proceedings of the Interspeech 2019, 2019

Hierarchical Residual-pyramidal Model for Large Context Based Media Presence Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

Semi-supervised Acoustic Event Detection Based on Tri-training.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Emotion Classification through Variational Inference of Latent Variables.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Embeddings for Rare Audio Event Detection with Imbalanced Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multimodal and Multi-view Models for Emotion Recognition.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
A Simple Model for Detection of Rare Sound Events.
Proceedings of the Interspeech 2018, 2018

Detecting Media Sound Presence in Acoustic Scenes.
Proceedings of the Interspeech 2018, 2018

R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection.
Proceedings of the Interspeech 2018, 2018

2011
Your Mobile Virtual Assistant Just Got Smarter!
Proceedings of the INTERSPEECH 2011, 2011

2010
Good grief, i can speak it! preliminary experiments in audio restaurant reviews.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

A hybrid architecture for mobile voice user interfaces.
Proceedings of the INTERSPEECH 2010, 2010

2007
An interactive interpretation game for learning Chinese.
Proceedings of the Workshop on Speech and Language Technology in Education, 2007

Automatic Assessment of Student Translations for Foreign Language Tutoring.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Spoken Dialogue Systems for Language Learning.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Chinese Syntactic Reordering for Statistical Machine Translation.
Proceedings of the EMNLP-CoNLL 2007, 2007

A Spoken Translation Game for Second Language Learning.
Proceedings of the Artificial Intelligence in Education, 2007

2006
High-quality speech-to-speech translation for computer-aided language learning.
ACM Trans. Speech Lang. Process., 2006

Automatic induction of language model data for a spoken dialogue system.
Lang. Resour. Evaluation, 2006

High-quality speech translation in the flight domain.
Proceedings of the INTERSPEECH 2006, 2006

Scalable and portable web-based multimodal dialogue interaction with geographical databases.
Proceedings of the INTERSPEECH 2006, 2006

Combining Linguistic and Statistical Methods for Bi-directional English Chinese Translation in the Flight Domain.
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, 2006

2005
Statistical modeling of phonological rules through linguistic hierarchies.
Speech Commun., 2005

Language model data filtering via user simulation and dialogue resynthesis.
Proceedings of the INTERSPEECH 2005, 2005

Context-sensitive statistical language modeling.
Proceedings of the INTERSPEECH 2005, 2005

2004
Second language acquisition through human computer dialogue.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

An interactive English pronunciation dictionary for Korean learners.
Proceedings of the INTERSPEECH 2004, 2004

A dynamic vocabulary spoken dialogue interface.
Proceedings of the INTERSPEECH 2004, 2004

Combining linguistic knowledge and acoustic information in automatic pronunciation lexicon generation.
Proceedings of the INTERSPEECH 2004, 2004

2003
Automatic Acquisition of Names Using Speak and Spell Mode in Spoken Dialogue Systems.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Automatic induction of n-gram language models from a natural language grammar.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Empowering end users to personalize dialogue systems through spoken interaction.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2001
Prosodic modeling for improved speech recognition and understanding.
PhD thesis, 2001

Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the jupiter domain.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Voice transformations: from speech synthesis to mammalian vocalizations.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Improved tone recognition by normalizing for coarticulation and intonation effects.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

MUXING: a telephone-access Mandarin conversational system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust pitch tracking for prosodic modeling in telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
YINHE: a Mandarin Chinese version of the GALAXY system.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997


  Loading...