Chris Callison-Burch

Orcid: 0000-0001-8196-1943

Affiliations:
  • University of Pennsylvania, Philadelphia, PA, USA


According to our database1, Chris Callison-Burch authored at least 222 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
CoMo: Controllable Motion Generation through Language Guided Pose Code Editing.
CoRR, 2024

PROC2PDDL: Open-Domain Planning Representations from Texts.
CoRR, 2024

FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models.
CoRR, 2024

Calibrating Large Language Models with Sample Consistency.
CoRR, 2024

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows.
CoRR, 2024

OpenPI2.0: An Improved Dataset for Entity Tracking in Texts.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A deep learning method to detect opioid prescription and opioid use disorder from electronic health records.
Int. J. Medical Informatics, March, 2023

Holodeck: Language Guided Generation of 3D Embodied AI Environments.
CoRR, 2023

Report of the 1st Workshop on Generative AI and Law.
CoRR, 2023

Grounded Intuition of GPT-Vision's Abilities with Scientific Images.
CoRR, 2023

Interpretable-by-Design Text Classification with Iteratively Generated Concept Bottleneck.
CoRR, 2023

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization.
CoRR, 2023

Choice-75: A Dataset on Decision Branching in Script Learning.
CoRR, 2023

Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications.
CoRR, 2023

CALYPSO: LLMs as Dungeon Masters' Assistants.
CoRR, 2023

Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models.
CoRR, 2023

This Land is {Your, My} Land: Evaluating Geopolitical Biases in Language Models.
CoRR, 2023

Learning Interpretable Style Embeddings via Prompting LLMs.
CoRR, 2023

Exploring the Curious Case of Code Prompts.
CoRR, 2023

Human-in-the-Loop Schema Induction.
CoRR, 2023

Language Models are Drummers: Drum Composition with Natural Language Pre-Training.
CoRR, 2023

Representation of Lexical Stylistic Features in Language Models' Embedding Space.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Faithful Chain-of-Thought Reasoning.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Bidirectional Language Models Are Also Few-shot Learners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Interpretable Style Embeddings via Prompting LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PAXQA: Generating Cross-lingual Question Answering Examples at Training Scale.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Causal Reasoning of Entities and Events in Procedural Texts.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Learn With Martian: A Tool For Creating Assignments That Can Write And Re-Write Themselves.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Mathematics Tutoring With A Code Scratchpad.
Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications, 2023

Automatically Generated Summaries of Video Lectures May Enhance Students' Learning Experience.
Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications, 2023

Enhancing Human Summaries for Question-Answer Generation in Education.
Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications, 2023

CALYPSO: LLMs as Dungeon Master's Assistants.
Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2023

FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Human-in-the-loop Schema Induction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

Explanation-based Finetuning Makes Models More Robust to Spurious Cues.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CoRRPUS: Code-based Structured Prompting for Neurosymbolic Story Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Real or Fake Text?: Investigating Human Ability to Detect Boundaries between Human-Written and Machine-Generated Text.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Rewriting the Script: Adapting Text Instructions for Voice Interaction.
Proceedings of the 2023 ACM Designing Interactive Systems Conference, 2023

2022
CORRPUS: Detecting Story Inconsistencies via Codex-Bootstrapped Neurosymbolic Reasoning.
CoRR, 2022

An AI Dungeon Master's Guide: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons.
CoRR, 2022

Low-Resource Authorship Style Transfer with In-Context Learning.
CoRR, 2022

Towards Faithful Model Explanation in NLP: A Survey.
CoRR, 2022

Multilingual Bidirectional Unsupervised Translation Through Multilingual Finetuning and Back-Translation.
CoRR, 2022

Empathic Conversations: A Multi-level Dataset of Contextualized Conversations.
CoRR, 2022

Creating Multimedia Summaries Using Tweets and Videos.
CoRR, 2022

A Feasibility Study of Answer-Agnostic Question Generation for Education.
CoRR, 2022

CIS2: A Simplified Commonsense Inference Evaluation for Story Prose.
CoRR, 2022

The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Did that happen? Predicting Social Media Posts that are Indicative of what happened in a scene: A case study of a TV show.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Unsupervised Entity Linking with Guided Summarization and Multiple-Choice Selection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

A Recipe for Arbitrary Text Style Transfer with Large Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Deduplicating Training Data Makes Language Models Better.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

A Feasibility Study of Answer-Unaware Question Generation for Education.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Induce, Edit, Retrieve: Language Grounded Multimodal Schema for Instructional Video Retrieval.
CoRR, 2021

SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets.
CoRR, 2021

"Wikily" Neural Machine Translation Tailored to Cross-Lingual Tasks.
CoRR, 2021

SynthBio: A Case Study in Faster Curation of Text Datasets.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

Cultural and Geographical Influences on Image Translatability of Words across Languages.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Goal-Oriented Script Construction.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

Visual Goal-Step Inference using wikiHow.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

"Wikily" Supervised Neural Translation Tailored to Cross-Lingual Tasks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

BiSECT: Learning to Split and Rephrase Sentences with Bitexts.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

GooAQ: Open Question Answering with Diverse Answer Types.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
The CLASSE GATOR (CLinical Acronym SenSE disambiGuATOR): A Method for predicting acronym sense from neonatal clinical notes.
Int. J. Medical Informatics, 2020

Simple-QE: Better Automatic Quality Estimation for Text Simplification.
CoRR, 2020

Automatic Standardization of Colloquial Persian.
CoRR, 2020

Intent Detection with WikiHow.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Reasoning about Goals, Steps, and Temporal Ordering with WikiHow.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Toward Better Storylines with Sentence-Level Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Automatic Detection of Generated Text is Easiest when Humans are Fooled.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Paraphrase-Sense-Tagged Sentences.
Trans. Assoc. Comput. Linguistics, 2019

Human and Automatic Detection of Generated Text.
CoRR, 2019

Bilingual is At Least Monolingual (BALM): A Novel Translation Algorithm that Encodes Monolingual Priors.
CoRR, 2019

Anonymization of Sensitive Information in Medical Health Records.
Proceedings of the Iberian Languages Evaluation Forum co-located with 35th Conference of the Spanish Society for Natural Language Processing, 2019

ChatEval: A Tool for Chatbot Evaluation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Seeing Things from a Different Angle: Discovering Diverse Perspectives about Claims.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Comparison of Context-sensitive Models for Lexical Substitution.
Proceedings of the 13th International Conference on Computational Semantics, 2019

Worker Demographics and Earnings on Amazon Mechanical Turk: An Exploratory Analysis.
Proceedings of the Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

Comparison of Diverse Decoding Methods from Conditional Language Models.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

PerspectroScope: A Window to the World of Diverse Perspectives.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Simplification Using Paraphrases and Context-Based Lexical Substitution.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Comparing Constraints for Taxonomic Organization.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Introducing NIEUW: Novel Incentives and Workflows for Eliciting Linguistic Data.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Magnitude: A Fast, Efficient Universal Vector Embedding Utility Package.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Learning Scalar Adjective Intensity from Paraphrases.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Data-Driven Analysis of Workers' Earnings on Amazon Mechanical Turk.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

Learning Translations via Images with a Massively Multilingual Image Dataset.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Crowd control: Effectively utilizing unscreened crowd workers for biomedical data annotation.
J. Biomed. Informatics, 2017

A Comprehensive Analysis of Bilingual Lexicon Induction.
Comput. Linguistics, 2017


Learning Antonyms with Paraphrases and a Morphology-Aware Neural Network.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017

Mapping the Paraphrase Database to WordNet.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017

Learning Translations via Matrix Completion.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

KnowYourNyms? A Game of Semantic Relationships.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

The Language of Place: Semantic Value from Geospatial Context.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Systematically Adapting Machine Translation for Grammatical Error Correction.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Constructing an Alias List for Named Entities during an Event.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016
Optimizing Statistical Machine Translation for Text Simplification.
Trans. Assoc. Comput. Linguistics, 2016

End-to-end statistical machine translation with zero or small parallel texts.
Nat. Lang. Eng., 2016

The Gun Violence Database.
CoRR, 2016

So-Called Non-Subsective Adjectives.
Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics, 2016

Sentential Paraphrasing as Black-Box Machine Translation.
Proceedings of the Demonstrations Session, 2016

Clustering Paraphrases by Word Sense.
Proceedings of the NAACL HLT 2016, 2016

The Gun Violence Database: A new task and data set for NLP.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Tense Manages to Predict Implicative Behavior in Verbs.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Simple PPDB: A Paraphrase Database for Simplification.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Most "babies" are "little" and most "problems" are "huge": Compositional Entailment in Adjective-Nouns.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Problems in Current Text Simplification Research: New Data Can Help.
Trans. Assoc. Comput. Linguistics, 2015

Use of Modality and Negation in Semantically-Informed Syntactic MT.
CoRR, 2015

Ideological Perspective Detection Using Semantic Features.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 2015

SemEval-2015 Task 1: Paraphrase and Semantic Similarity in Twitter (PIT).
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Cost Optimization in Crowdsourcing Translation: Low cost translations made even cheaper.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Crowdsourcing for NLP.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Extracting Structured Information via Automatic + Human Computation.
Proceedings of the Third AAAI Conference on Human Computation and Crowdsourcing, 2015

Automatically Scoring Freshman Writing: A Preliminary Investigation.
Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015

FrameNet+: Fast Paraphrastic Tripling of FrameNet.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Domain-Specific Paraphrase Extraction.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Adding Semantics to Data-Driven Paraphrasing.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Effectively Crowdsourcing Radiology Report Annotations.
Proceedings of the Sixth International Workshop on Health Text Mining and Information Analysis, 2015

2014
Extracting Lexically Divergent Paraphrases from Twitter.
Trans. Assoc. Comput. Linguistics, 2014

The Language Demographics of Amazon Mechanical Turk.
Trans. Assoc. Comput. Linguistics, 2014

Arabic Dialect Identification.
Comput. Linguistics, 2014

Using Comparable Corpora to Adapt MT Models to New Domains.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The American Local News Corpus.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The Multilingual Paraphrase Database.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Translations of the Callhome Egyptian Arabic corpus for conversational speech translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Poetry of the Crowd: A Human Computation Algorithm to Convert Prose into Rhyming Verse.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

Crowd-Workers: Aggregating Information Across Turkers to Help Them Find Higher Paying Work.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

PARADIGM: Paraphrase Diagnostics through Grammar Matching.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Crowdsourcing for grammatical error correction.
Proceedings of the Computer Supported Cooperative Work, 2014

Hallucinating Phrase Translations for Low Resource MT.
Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

Are Two Heads Better than One? Crowdsourced Translation via a Two-Step Collaboration of Non-Professional Translators and Editors.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT.
Trans. Assoc. Comput. Linguistics, 2013

Joshua 5.0: Sparser, Better, Faster, Server.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Combining Bilingual and Comparable Corpora for Low Resource Machine Translation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Findings of the 2013 Workshop on Statistical Machine Translation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Answer Extraction as Sequence Tagging with Tree Edit Distance.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Supervised Bilingual Lexicon Induction with Multiple Monolingual Signals.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

PPDB: The Paraphrase Database.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Semi-Markov Phrase-Based Monolingual Alignment.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

A Lightweight and High Performance Monolingual Word Aligner.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

PARMA: A Predicate Argument Aligner.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Dirt Cheap Web-Scale Parallel Text from the Common Crawl.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Modality and Negation in SIMT Use of Modality and Negation in Semantically-Informed Syntactic MT.
Comput. Linguistics, 2012

Using Categorial Grammar to Label Translation Rules.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Constructing Parallel Corpora for Six Indian Languages via Crowdsourcing.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Joshua 4.0: Packing, PRO, and Paraphrases.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Findings of the 2012 Workshop on Statistical Machine Translation.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Monolingual Distributional Similarity for Text-to-Text Generation.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

Machine Translation of Arabic Dialects.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Expectations of Word Sense in Parallel Corpora.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012





2011
Joshua 3.0: Syntax-based Machine Translation with the Thrax Grammar Extractor.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Findings of the 2011 Workshop on Statistical Machine Translation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

The Arabic Online Commentary Dataset: an Annotated Dataset of Informal Arabic with High Dialectal Content.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Crowdsourcing Translation: Professional Quality from Non-Professionals.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Incremental Syntactic Language Models for Phrase-based Translation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Evaluating Sentence Compression: Pitfalls and Suggested Remedies.
Proceedings of the Workshop on Monolingual Text-To-Text Generation@ACL, 2011

Paraphrastic Sentence Compression with a Character-based Metric: Tightening without Deletion.
Proceedings of the Workshop on Monolingual Text-To-Text Generation@ACL, 2011

Reranking Bilingually Extracted Paraphrases Using Monolingual Distributional Similarity.
Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics, 2011

Paraphrase Fragment Extraction from Monolingual Comparable Corpora.
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, 2011

2010
Visualizing Data Structures in Parsing-Based Machine Translation.
Prague Bull. Math. Linguistics, 2010

Hierarchical Phrase-Based Grammar Extraction in JoshuaSuffix Arrays and Prefix Trees.
Prague Bull. Math. Linguistics, 2010

Integrating Output from Specialized Modules in Machine TranslationTransliterations in Joshua.
Prague Bull. Math. Linguistics, 2010

Joshua 2.0: A Toolkit for Parsing-Based Machine Translation with Syntax, Semirings, Discriminative Training and Other Goodies.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Predicting Human-Targeted Translation Edit Rate via Untrained Human Annotators.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Cheap Facts and Counter-Facts.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Crowdsourced Accessibility: Elicitation of Wikipedia Articles.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Cheap, Fast and Good Enough: Automatic Speech Recognition with Non-Expert Transcription.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Stream-based Translation Models for Statistical Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Creating Speech and Language Data With Amazon's Mechanical Turk.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Using Mechanical Turk to Build Machine Translation Evaluation Sets.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Transliterating From All Languages.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

Semantically-Informed Syntactic Machine Translation: A Tree-Grafting Approach.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation.
Proceedings of the ACL 2010, 2010

2009
Decoding in JoshuaOpen Source, Parsing-Based Machine Translation.
Prague Bull. Math. Linguistics, 2009

Joshua: An Open Source Toolkit for Parsing-Based Machine Translation.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Findings of the 2009 Workshop on Statistical Machine Translation.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Feasibility of Human-in-the-loop Minimum Error Rate Training.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon's Mechanical Turk.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences.
Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009

Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation.
Proceedings of the ACL 2009, 2009

2008
Constructing Corpora for the Development and Evaluation of Paraphrase Systems.
Comput. Linguistics, 2008

Further Meta-Evaluation of Machine Translation.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Syntactic Constraints on Paraphrases Extracted from Parallel Corpora.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

ParaMetric: An Automatic Evaluation Metric for Paraphrasing.
Proceedings of the COLING 2008, 2008

2007
(Meta-) Evaluation of Machine Translation.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007


2006
Constraining the Phrase-Based, Joint Probability Statistical Translation Model.
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006

Improved Statistical Machine Translation Using Paraphrases.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Re-evaluation the Role of Bleu in Machine Translation Research.
Proceedings of the EACL 2006, 2006

Paraphrase Substitution for Recognizing Textual Entailment.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006

Constraining the Phrase-Based, Joint Probability Statistical Translation Model.
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, 2006

2005
Edinburgh system description for the 2005 IWSLT speech translation evaluation.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

A compact data structure for searchable translation memories.
Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases.
Proceedings of the ACL 2005, 2005

Paraphrasing with Bilingual Parallel Corpora.
Proceedings of the ACL 2005, 2005

2004
Improving statistical translation through editing.
Proceedings of the 9th EAMT Workshop: Broadening horizons of machine translation and its applications, 2004

Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

2003
Bootstrapping Parallel Corpora.
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 2003

2001
A program for automatically selecting the best output from multiple machine translation engines.
Proceedings of Machine Translation Summit VIII, 2001


  Loading...