Kristina Toutanova

Affiliations:
  • Google Research, Seattle, USA


According to our database1, Kristina Toutanova authored at least 84 papers between 1997 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Efficient End-to-End Visual Document Understanding with Rationale Distillation.
CoRR, 2023

Anchor Prediction: Automatic Refinement of Internet Links.
CoRR, 2023

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding.
Proceedings of the International Conference on Machine Learning, 2023

Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Generating recommendations for entity-oriented exploratory search.
CoRR, 2022

Improving Compositional Generalization with Latent Structure and Data Augmentation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Entity-Centric Query Refinement.
Proceedings of the 4th Conference on Automated Knowledge Base Construction, 2022

2021
Sparse, Dense, and Attentional Representations for Text Retrieval.
Trans. Assoc. Comput. Linguistics, 2021

Revisiting the Primacy of English in Zero-shot Cross-lingual Transfer.
CoRR, 2021

Joint Passage Ranking for Diverse Multi-Answer Retrieval.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Representations for Question Answering from Documents with Tables and Text.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Contextualized Representations Using Textual Encyclopedic Knowledge.
CoRR, 2020

Probabilistic Assumptions Matter: Improved Models for Distantly-Supervised Document-Level Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Natural Questions: a Benchmark for Question Answering Research.
Trans. Assoc. Comput. Linguistics, 2019

Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation.
CoRR, 2019

Language Model Pre-training for Hierarchical Document Representations.
CoRR, 2019

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Zero-Shot Entity Linking by Reading Entity Descriptions.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Latent Retrieval for Weakly Supervised Open Domain Question Answering.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Improving Span-based Question Answering Systems with Coarsely Labeled Data.
CoRR, 2018

2017
Cross-Sentence N-ary Relation Extraction with Graph LSTMs.
Trans. Assoc. Comput. Linguistics, 2017

NLP for Precision Medicine.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

A Nested Attention Neural Hybrid Model for Grammatical Error Correction.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

A Dataset and Evaluation Metrics for Abstractive Compression of Sentences and Short Paragraphs.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Microsummarization of Online Reviews: An Experimental Study.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Survey of data-selection methods in statistical machine translation.
Mach. Transl., 2015

Distant Supervision for Cancer Pathway Extraction from Text.
Proceedings of the Biocomputing 2015: Proceedings of the Pacific Symposium, 2015

Grounded Semantic Parsing for Complex Knowledge Extraction.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Detecting Translation Direction: A Cross-Domain Study.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Representing Text for Joint Embedding of Text and Knowledge Bases.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Model Selection for Type-Supervised Learning with Application to POS Tagging.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Observed versus latent features for knowledge base and text inference.
Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality, 2015

2014
Asymmetric Features Of Human Generated Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Graph-based Semi-Supervised Learning of Translation Models from Monolingual Data.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Beyond Left-to-Right: Multiple Decomposition Structures for SMT.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Regularized Minimum Error Rate Training.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Learning Non-linear Features for Machine Translation Using Gradient Boosting Machines.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
MSR SPLAT, a language analysis toolkit.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Clickthrough-based latent semantic models for web search.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Learning Discriminative Projections for Text Similarity Measures.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Why Initialization Matters for IBM Model 1: Multiple Optima and Non-Strict Convexity.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Unsupervised Bilingual Morpheme Segmentation and Alignment with Context-rich Hidden Semi-Markov Models.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Translingual Document Representations from Discriminative Projections.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

A Discriminative Lexicon Model for Complex Morphology.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

2009
Unsupervised Morphological Segmentation with Log-Linear Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Joint Optimization for Machine Translation System Combination.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

A global model for joint lemmatization and part-of-speech prediction.
Proceedings of the ACL 2009, 2009

2008
A Global Joint Model for Semantic Role Labeling.
Comput. Linguistics, 2008

Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation.
Proceedings of the COLING 2008, 2008

Applying Morphology Generation Models to Machine Translation.
Proceedings of the ACL 2008, 2008

2007
A Bayesian LDA-based model for semi-supervised part-of-speech tagging.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Generating Case Markers in Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Generating Complex Morphology for Machine Translation.
Proceedings of the ACL 2007, 2007

A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing.
Proceedings of the ACL 2007, 2007

A Discriminative Syntactic Word Order Model for Machine Translation.
Proceedings of the ACL 2007, 2007

2006
Microsoft Research Treelet Translation System: NAACL 2006 Europarl Evaluation.
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006

Automatic Semantic Role Labeling.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Competitive generative models with structure learning for NLP classification tasks.
Proceedings of the EMNLP 2006, 2006

Learning to Predict Case Markers in Japanese.
Proceedings of the ACL 2006, 2006

2005
Effective statistical models for syntactic and semantic disambiguation.
PhD thesis, 2005

A Joint Model for Semantic Role Labeling.
Proceedings of the Ninth Conference on Computational Natural Language Learning, 2005

Joint Learning Improves Semantic Role Labeling.
Proceedings of the ACL 2005, 2005

2004
Learning random walk models for inducing word dependency distributions.
Proceedings of the Machine Learning, 2004

The Leaf Path Projection View of Parse Trees: Exploring String Kernels for HPSG Parse Selection.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

2003
Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Optimizing Local Probability Models for Statistical Parsing.
Proceedings of the Machine Learning: ECML 2003, 2003

2002
Combining Heterogeneous Classifiers for Word Sense Disambiguation.
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

CGWorld - Architecture and Features.
Proceedings of the Conceptual Structures: Integration and Interfaces, 2002

Extentions to HMM-based Statistical Word Alignment Models.
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, 2002

Feature Selection for a Rich HPSG Grammar Using Decision Trees.
Proceedings of the 6th Conference on Natural Language Learning, 2002

The LinGO Redwoods Treebank: Motivation and Preliminary Applications.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Pronunciation Modeling for Improved Spelling Correction.
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, 2002

2001
Text Classification in a Hierarchical Mixture Model for Small Training Sets.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

1999
Using Conceptual Graphs to Solve a Resource Allocation Task.
Proceedings of the Conceptual Structures: Standards and Practices, 1999

1997
Menu-Based Interfaces to Conceptual Graphs: The CGLex Approach.
Proceedings of the Conceptual Structures: Fulfilling Peirce's Dream, 1997


  Loading...