Karen Livescu

According to our database1, Karen Livescu authored at least 121 papers between 2000 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

On the Contributions of Visual and Textual Supervision in Low-resource Semantic Speech Retrieval.
CoRR, 2019

Semantic query-by-example speech search using visual grounding.
CoRR, 2019

Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition.
CoRR, 2019

2018
American Sign Language fingerspelling recognition in the wild.
CoRR, 2018

Pre-training on high-resource speech recognition improves low-resource speech-to-text translation.
CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
CoRR, 2018

Hierarchical Multitask Learning for CTC-based Speech Recognition.
CoRR, 2018

Low-Resource Speech-to-Text Translation.
CoRR, 2018

Acoustic feature learning using cross-domain articulatory measurements.
CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

American Sign Language Fingerspelling Recognition in the Wild.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Low-Resource Speech-to-Text Translation.
Proceedings of the Interspeech 2018, 2018

Acoustic Feature Learning Using Cross-Domain Articulatory Measurements.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Study of All-Convolutional Encoders for Connectionist Temporal Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Variational Sequential Labelers for Semi-Supervised Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
End-to-End Neural Segmental Models for Speech Recognition.
J. Sel. Topics Signal Processing, 2017

Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptation.
Computer Speech & Language, 2017

A Study of All-Convolutional Encoders for Connectionist Temporal Classification.
CoRR, 2017

Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition.
CoRR, 2017

Semantic keyword spotting by learning from images and speech.
CoRR, 2017

Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis.
CoRR, 2017

End-to-End Neural Segmental Models for Speech Recognition.
CoRR, 2017

Learning to Embed Words in Context for Syntactic Tasks.
CoRR, 2017

Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing.
CoRR, 2017

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition.
CoRR, 2017

Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings.
CoRR, 2017

Visually grounded learning of keyword prediction from untranscribed speech.
CoRR, 2017

An embedded segmental k-means model for unsupervised segmentation and clustering of speech.
CoRR, 2017

Learning to Embed Words in Context for Syntactic Tasks.
Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis.
Proceedings of the Interspeech 2017, 2017

Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings.
Proceedings of the Interspeech 2017, 2017

Visually Grounded Learning of Keyword Prediction from Untranscribed Speech.
Proceedings of the Interspeech 2017, 2017

Multi-view Recurrent Neural Acoustic Word Embeddings.
Proceedings of the 5th International Conference on Learning Representations, 2017

Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

An embedded segmental K-means model for unsupervised segmentation and clustering of speech.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Speech Production in Speech Technologies: Introduction to the CSL Special Issue.
Computer Speech & Language, 2016

Articulatory feature-based pronunciation modeling.
Computer Speech & Language, 2016

Charagram: Embedding Words and Sentences via Character n-grams.
CoRR, 2016

Towards Universal Paraphrastic Sentence Embeddings.
Proceedings of the 4th International Conference on Learning Representations, 2016

Deep Variational Canonical Correlation Analysis.
CoRR, 2016

Large-Scale Approximate Kernel Canonical Correlation Analysis.
Proceedings of the 4th International Conference on Learning Representations, 2016

On Deep Multi-View Representation Learning: Objectives and Optimization.
CoRR, 2016

Jointly Learning to Align and Convert Graphemes to Phonemes with Neural Attention Models.
CoRR, 2016

End-to-End Training Approaches for Discriminative Segmental Models.
CoRR, 2016

Efficient Segmental Cascades for Speech Recognition.
CoRR, 2016

Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches.
CoRR, 2016

Multi-view Recurrent Neural Acoustic Word Embeddings.
CoRR, 2016

Signer-independent Fingerspelling Recognition with Deep Neural Network Adaptation.
CoRR, 2016

Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation.
CoRR, 2016

Jointly learning to align and convert graphemes to phonemes with neural attention models.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

End-to-end training approaches for discriminative segmental models.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Discriminative acoustic word embeddings: Tecurrent neural network-based approaches.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Mapping Unseen Words to Task-Trained Embedding Spaces.
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

Triphone State-Tying via Deep Canonical Correlation Analysis.
Proceedings of the Interspeech 2016, 2016

Efficient Segmental Cascades for Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Nonparametric Canonical Correlation Analysis.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep convolutional acoustic word embeddings using word-pair side information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Signer-independent fingerspelling recognition with deep neural network adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Charagram: Embedding Words and Sentences via Character n-grams.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
From Paraphrase Database to Compositional Paraphrase Model and Back.
TACL, 2015

From Paraphrase Database to Compositional Paraphrase Model and Back.
CoRR, 2015

Stochastic Optimization for Deep CCA via Nonlinear Orthogonal Iterations.
CoRR, 2015

Discriminative Segmental Cascades for Feature-Rich Phone Recognition.
CoRR, 2015

Nonparametric Canonical Correlation Analysis.
CoRR, 2015

Mapping Unseen Words to Task-Trained Embedding Spaces.
CoRR, 2015

Deep convolutional acoustic word embeddings using word-pair side information.
CoRR, 2015

Deep Multilingual Correlation for Improved Word Embeddings.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

On Deep Multi-View Representation Learning.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Unsupervised learning of acoustic features via deep canonical correlation analysis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Discriminative segmental cascades for feature-rich phone recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Stochastic optimization for deep CCA via nonlinear orthogonal iterations.
Proceedings of the 53rd Annual Allerton Conference on Communication, 2015

2014
Reconstruction of articulatory measurements with smoothed low-rank matrix completion.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Revisiting Word Neighborhoods for Speech Recognition.
Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, 2014

A comparison of training approaches for discriminative segmental models.
Proceedings of the INTERSPEECH 2014, 2014

Multi-view learning with supervision for transformed bottleneck features.
Proceedings of the IEEE International Conference on Acoustics, 2014

Tailoring Continuous Word Representations for Dependency Parsing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Discriminative training of WFST factors with application to pronunciation modeling.
Proceedings of the INTERSPEECH 2013, 2013

Deep Canonical Correlation Analysis.
Proceedings of the 30th International Conference on Machine Learning, 2013

Fingerspelling Recognition with Semi-Markov Conditional Random Fields.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminative articulatory models for spoken term detection in low-resource conversational settings.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multi-view CCA-based acoustic features for phonetic recognition across speakers and domains.
Proceedings of the IEEE International Conference on Acoustics, 2013

Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Subword Modeling for Automatic Speech Recognition: Past, Present, and Emerging Approaches.
IEEE Signal Process. Mag., 2012

American sign language fingerspelling recognition with phonological feature-based tandem models.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Discriminative spoken term detection with limited data.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Kernel CCA for multi-view learning of acoustic features using articulatory measurements.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks.
Proceedings of the INTERSPEECH 2012, 2012

Stochastic optimization for PCA and PLS.
Proceedings of the 50th Annual Allerton Conference on Communication, 2012

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Articulatory Feature Classification Using Nearest Neighbors.
Proceedings of the INTERSPEECH 2011, 2011

Nearest Neighbors with Learned Distances for Phonetic Frame Classification.
Proceedings of the INTERSPEECH 2011, 2011

Lexical access experiments with context-dependent articulatory feature-based models.
Proceedings of the IEEE International Conference on Acoustics, 2011

A factored conditional random field model for articulatory feature forced transcription.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Audio-visual anticipatory coarticulation modeling by human and machine.
Proceedings of the INTERSPEECH 2010, 2010

Modeling pronunciation variation with context-dependent articulatory feature decision trees.
Proceedings of the INTERSPEECH 2010, 2010

2009
Multistream Articulatory Feature-Based Models for Visual Speech Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Multi-view clustering via canonical correlation analysis.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

On the phonetic information in ultrasonic microphone signals.
Proceedings of the IEEE International Conference on Acoustics, 2009

Multi-view learning of acoustic features for speaker recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Invited talk: Phonological Models in Automatic Speech Recognition.
Proceedings of the Tenth Meeting of ACL Special Interest Group on Computational Morphology and Phonology, 2008

2007
Articulatory feature classifiers trained on 2000 hours of telephone speech.
Proceedings of the INTERSPEECH 2007, 2007

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop.
Proceedings of the IEEE International Conference on Acoustics, 2007

Manual Transcription of Conversational Speech at the Articulatory Feature Level.
Proceedings of the IEEE International Conference on Acoustics, 2007

An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2007

Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
An Asynchronous DBN for Audio-Visual speech Recognition.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

2005
Feature-based pronunciation modeling for automatic speech recognition.
PhD thesis, 2005

Pronunciation modeling using a finite-state transducer representation.
Speech Communication, 2005

Visual Speech Recognition with Loosely Synchronized Feature Streams.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Production domain modeling of pronunciation for visual speech recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Feature-based Pronunciation Modeling for Speech Recognition.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Feature-based pronunciation modeling with trainable asynchrony probabilities.
Proceedings of the INTERSPEECH 2004, 2004

2003
Hidden feature models for speech recognition using dynamic Bayesian networks.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Structurally discriminative graphical models for automatic speech recognition - results from the 2001 Johns Hopkins Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Segment-based recognition on the phonebook task: initial results and observations on duration modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Lexical modeling of non-native speech for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000


  Loading...