We stand with Ukraine

We stand with Ukraine

Chao Wang

Affiliations:

Amazon.com Inc, Alexa Speech, Cambridge, MA, USA (since 2016)
Vlingo, Cambridge, MA, USA
Massachusetts Institute of Technology, CSAIL, Cambridge, MA, USA (PhD 2001)

According to our database¹, Chao Wang authored at least 61 papers between 1997 and 2023.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2023

Weight-Sharing Supernet for Searching Specialized Acoustic Event Classification Networks Across Device Constraints.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

FedRPO: Federated Relaxed Pareto Optimization for Acoustic Event Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Improved Representation Learning For Acoustic Event Classification Using Tree-Structured Ontology.

[BibT_eX]

[DOI]

Arman Zharmagambetov

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Wikitag: Wikipedia-Based Knowledge Embeddings Towards Improved Acoustic Event Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives.

[BibT_eX]

[DOI]

,

Constantinos Papayiannis

,

,

Elizabeth Shriberg

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Sentiment-Aware Automatic Speech Recognition Pre-Training for Enhanced Speech Emotion Recognition.

[BibT_eX]

[DOI]

,

,

,

Elizabeth Shriberg

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Federated Self-Supervised Learning for Acoustic Event Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Spyros Matsoukas

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Multi-Task Self-Supervised Pre-Training for Music Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Juan Pablo Bello

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Contrastive Unsupervised Learning for Speech Emotion Recognition.

[BibT_eX]

[DOI]

,

,

,

Andreas Stolcke

,

,

Spyros Matsoukas

,

Constantinos Papayiannis

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification.

[BibT_eX]

[DOI]

Hsin-Ping Huang

,

Krishna C. Puvvada

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Improved Student Model Training for Acoustic Event Detection Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020

Data Techniques For Online End-to-end Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Acoustic Scene Analysis with Multi-Head Attention Networks.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Interspeech 2020, 2020

A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Interspeech 2020, 2020

Semi-Supervised ASR by End-to-End Self-Training.

[BibT_eX]

[DOI]

,

,

Proceedings of the Interspeech 2020, 2020

Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging.

[BibT_eX]

[DOI]

Chun-Chieh Chang

,

,

,

Proceedings of the Interspeech 2020, 2020

Raw Waveform Based End-to-end Deep Convolutional Network for Spatial Localization of Multiple Acoustic Sources.

[BibT_eX]

[DOI]

Harshavardhan Sundar

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Few-Shot Acoustic Event Detection Via Meta Learning.

[BibT_eX]

[DOI]

,

,

Krishna C. Puvvada

,

,

Spyros Matsoukas

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training.

[BibT_eX]

[DOI]

,

,

,

,

Spyros Matsoukas

,

CoRR, 2019

Compression of Acoustic Event Detection Models with Quantized Distillation.

[BibT_eX]

[DOI]

,

,

,

,

Spyros Matsoukas

,

Proceedings of the Interspeech 2019, 2019

Sub-Band Convolutional Neural Networks for Small-Footprint Spoken Term Classification.

[BibT_eX]

[DOI]

,

,

,

Shiv Vitaladevuni

,

Proceedings of the Interspeech 2019, 2019

Hierarchical Residual-pyramidal Model for Large Context Based Media Presence Detection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Semi-supervised Acoustic Event Detection Based on Tri-training.

[BibT_eX]

[DOI]

,

,

,

,

Spyros Matsoukas

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Emotion Classification through Variational Inference of Latent Variables.

[BibT_eX]

[DOI]

Srinivas Parthasarathy

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Embeddings for Rare Audio Event Detection with Imbalanced Data.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Multimodal and Multi-view Models for Emotion Recognition.

[BibT_eX]

[DOI]

Gustavo Aguilar

,

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

A Simple Model for Detection of Rare Sound Events.

[BibT_eX]

[DOI]

,

,

Proceedings of the Interspeech 2018, 2018

Detecting Media Sound Presence in Acoustic Scenes.

[BibT_eX]

[DOI]

Constantinos Papayiannis

,

,

,

,

Proceedings of the Interspeech 2018, 2018

R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Interspeech 2018, 2018

2011

Your Mobile Virtual Assistant Just Got Smarter!

[BibT_eX]

[DOI]

,

,

Enrico Bocchieri

,

Diamantino Caseiro

,

,

,

,

,

Proceedings of the INTERSPEECH 2011, 2011

2010

Good grief, i can speak it! preliminary experiments in audio restaurant reviews.

[BibT_eX]

[DOI]

Joseph Polifroni

,

Stephanie Seneff

,

S. R. K. Branavan

,

,

Regina Barzilay

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

A hybrid architecture for mobile voice user interfaces.

[BibT_eX]

[DOI]

,

Joseph Polifroni

,

,

Ghinwa F. Choueiter

,

Proceedings of the INTERSPEECH 2010, 2010

2007

An interactive interpretation game for learning Chinese.

[BibT_eX]

[DOI]

,

Stephanie Seneff

,

Proceedings of the Workshop on Speech and Language Technology in Education, 2007

Automatic Assessment of Student Translations for Foreign Language Tutoring.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Spoken Dialogue Systems for Language Learning.

[BibT_eX]

[DOI]

Stephanie Seneff

,

,

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Chinese Syntactic Reordering for Statistical Machine Translation.

[BibT_eX]

[DOI]

,

Michael Collins

,

Proceedings of the EMNLP-CoNLL 2007, 2007

A Spoken Translation Game for Second Language Learning.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the Artificial Intelligence in Education, 2007

2006

High-quality speech-to-speech translation for computer-aided language learning.

[BibT_eX]

[DOI]

,

Stephanie Seneff

ACM Trans. Speech Lang. Process., 2006

Automatic induction of language model data for a spoken dialogue system.

[BibT_eX]

[DOI]

,

,

Stephanie Seneff

Lang. Resour. Evaluation, 2006

High-quality speech translation in the flight domain.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the INTERSPEECH 2006, 2006

Scalable and portable web-based multimodal dialogue interaction with geographical databases.

[BibT_eX]

[DOI]

Alexander Gruenstein

,

Stephanie Seneff

,

Proceedings of the INTERSPEECH 2006, 2006

Combining Linguistic and Statistical Methods for Bi-directional English Chinese Translation in the Flight Domain.

[BibT_eX]

[DOI]

Stephanie Seneff

,

,

Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, 2006

2005

Statistical modeling of phonological rules through linguistic hierarchies.

[BibT_eX]

[DOI]

Stephanie Seneff

,

Speech Commun., 2005

Language model data filtering via user simulation and dialogue resynthesis.

[BibT_eX]

[DOI]

,

Stephanie Seneff

,

Proceedings of the INTERSPEECH 2005, 2005

Context-sensitive statistical language modeling.

[BibT_eX]

[DOI]

Alexander Gruenstein

,

,

Stephanie Seneff

Proceedings of the INTERSPEECH 2005, 2005

2004

Second language acquisition through human computer dialogue.

[BibT_eX]

[DOI]

Stephanie Seneff

,

,

Mitchell Peabody

,

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

An interactive English pronunciation dictionary for Korean learners.

[BibT_eX]

[DOI]

,

Mitchell Peabody

,

Stephanie Seneff

,

Proceedings of the INTERSPEECH 2004, 2004

A dynamic vocabulary spoken dialogue interface.

[BibT_eX]

[DOI]

Stephanie Seneff

,

,

I. Lee Hetherington

,

Proceedings of the INTERSPEECH 2004, 2004

Combining linguistic knowledge and acoustic information in automatic pronunciation lexicon generation.

[BibT_eX]

[DOI]

,

,

Stephanie Seneff

,

,

Proceedings of the INTERSPEECH 2004, 2004

2003

Automatic Acquisition of Names Using Speak and Spell Mode in Spoken Dialogue Systems.

[BibT_eX]

[DOI]

,

Stephanie Seneff

,

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Automatic induction of n-gram language models from a natural language grammar.

[BibT_eX]

[DOI]

Stephanie Seneff

,

,

Timothy J. Hazen

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Empowering end users to personalize dialogue systems through spoken interaction.

[BibT_eX]

[DOI]

Stephanie Seneff

,

,

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2001

Prosodic modeling for improved speech recognition and understanding.

[BibT_eX]

[DOI]

PhD thesis, 2001

Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the jupiter domain.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Voice transformations: from speech synthesis to mammalian vocalizations.

[BibT_eX]

[DOI]

,

,

Stephanie Seneff

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Improved tone recognition by normalizing for coarticulation and intonation effects.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

MUXING: a telephone-access Mandarin conversational system.

[BibT_eX]

[DOI]

,

D. Scott Cyphers

,

,

Joseph Polifroni

,

Stephanie Seneff

,

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust pitch tracking for prosodic modeling in telephone speech.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the IEEE International Conference on Acoustics, 2000

1998

A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognition.

[BibT_eX]

[DOI]

,

Stephanie Seneff

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997

YINHE: a Mandarin Chinese version of the GALAXY system.

[BibT_eX]

[DOI]

,

,

,

Joseph Polifroni

,

Stephanie Seneff

,

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Loading...