Ramon Sanabria

Hao Tang

Sharon Goldwater

Proceedings of the IEEE International Conference on Acoustics, 2023

Measuring the Impact of Domain Factors in Self-Supervised Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2022

2021

On the Difficulty of Segmenting Words with Attention.

[BibT_eX]

[DOI]

Hao Tang

Sharon Goldwater

CoRR, 2021

Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval.

[BibT_eX]

[DOI]

Austin Waters

Jason Baldridge

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Grounded Sequence to Sequence Transduction.

[BibT_eX]

[DOI]

Lucia Specia

Loïc Barrault

Ozan Caglayan

Amanda Cardoso Duarte

IEEE J. Sel. Top. Signal Process., 2020

Transfer learning for multimodal dialog.

[BibT_eX]

[DOI]

Shruti Palaskar

Comput. Speech Lang., 2020

Multimodal Speech Recognition with Unstructured Audio Masking.

[BibT_eX]

[DOI]

CoRR, 2020

Looking Enhances Listening: Recovering Missing Speech Using Images.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Fine-Grained Grounding for Multimodal Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019

Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions.

[BibT_eX]

[DOI]

CoRR, 2019

Grounding Object Detections With Transcriptions.

[BibT_eX]

[DOI]

CoRR, 2019

OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2019 Text Analysis Conference, 2019

MediaEval 2019: Eyes and Ears Together.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019

Multitask Learning For Different Subword Segmentations In Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Spoken Language Translation, 2019

CMU's Machine Translation System for IWSLT 2019.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Spoken Language Translation, 2019

The IWSLT 2019 Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Multimodal Grounding for Sequence-to-sequence Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

How2: A Large-scale Dataset for Multimodal Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2018

Hierarchical Multi Task Learning With CTC.

[BibT_eX]

[DOI]

CoRR, 2018

OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis.

[BibT_eX]

[DOI]

Eduard H. Hovy

Taylor Berg-Kirkpatrick

Jaime G. Carbonell

Hans Chalupsky

Anatole Gershman

Alexander G. Hauptmann

Hector Zhengzhong Liu

Proceedings of the 2018 Text Analysis Conference, 2018

Hierarchical Multitask Learning With CTC.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Eyes and Ears Together: New Task for Multimodal Spoken Content Analysis.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018

Subword and Crossword Units for CTC Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

End-to-end Multimodal Speech Recognition.

[BibT_eX]

[DOI]

Shruti Palaskar

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sequence-Based Multi-Lingual Low Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Comparison of Decoding Strategies for CTC Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Robust end-to-end deep audiovisual speech recognition.

[BibT_eX]

[DOI]