Graham Neubig

Orcid: 0000-0002-2072-3789

Affiliations:
  • Carnegie Mellon University, USA


According to our database1, Graham Neubig authored at least 490 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
What Are Tools Anyway? A Survey from the Language Model Perspective.
CoRR, 2024

Wav2Gloss: Generating Interlinear Glossed Text from Speech.
CoRR, 2024

RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems.
CoRR, 2024

SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents.
CoRR, 2024

GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing.
CoRR, 2024

What Is Missing in Multilingual Visual Reasoning and How to Fix It.
CoRR, 2024

Repetition Improves Language Model Embeddings.
CoRR, 2024

Instruction-tuned Language Models are Better Knowledge Learners.
CoRR, 2024

Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes.
CoRR, 2024

Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate.
CoRR, 2024

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks.
CoRR, 2024

TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks.
CoRR, 2024

Fine-grained Hallucination Detection and Editing for Language Models.
CoRR, 2024

Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
DIRE and its Data: Neural Decompiled Variable Renamings with Respect to Software Class.
ACM Trans. Softw. Eng. Methodol., April, 2023

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing.
ACM Comput. Surv., 2023

An In-depth Look at Gemini's Language Abilities.
CoRR, 2023

Alignment for Honesty.
CoRR, 2023

Multitask Learning Can Improve Worst-Group Outcomes.
CoRR, 2023

Program-Aided Reasoners (better) Know What They Know.
CoRR, 2023

Divergences between Language Models and Human Brains.
CoRR, 2023

Learning to Filter Context for Retrieval-Augmented Generation.
CoRR, 2023

DeMuX: Data-efficient Multilingual Learning.
CoRR, 2023

Do LLMs exhibit human-like response biases? A case study in survey design.
CoRR, 2023

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents.
CoRR, 2023

It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk.
CoRR, 2023

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation.
CoRR, 2023

WebArena: A Realistic Web Environment for Building Autonomous Agents.
CoRR, 2023

FacTool: Factuality Detection in Generative AI - A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios.
CoRR, 2023

Improving Factuality of Abstractive Summarization via Contrastive Reward Learning.
CoRR, 2023

Large Language Models Enable Few-Shot Clustering.
CoRR, 2023

Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction.
CoRR, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
CoRR, 2023

Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation.
CoRR, 2023

A Gold Standard Dataset for the Reviewer Assignment Problem.
CoRR, 2023

User-Centric Evaluation of OCR Systems for Kwak'wala.
CoRR, 2023

Learning Performance-Improving Code Edits.
CoRR, 2023

ChatGPT MT: Competitive for High- (but Not Low-) Resource Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023

The Devil Is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation.
Proceedings of the Eighth Conference on Machine Translation, 2023

Syntax and Semantics Meet in the "Middle": Probing the Syntax-Semantics Interface of LMs Through Agentivity.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

SigMoreFun Submission to the SIGMORPHON Shared Task on Interlinear Glossing.
Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, 2023

Unlimiformer: Long-Range Transformers with Unlimited Length Input.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Why do Nearest Neighbor Language Models Work?
Proceedings of the International Conference on Machine Learning, 2023

Cross-Modal Fine-Tuning: Align then Refine.
Proceedings of the International Conference on Machine Learning, 2023

PAL: Program-aided Language Models.
Proceedings of the International Conference on Machine Learning, 2023

DocPrompting: Generating Code by Retrieving the Docs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DiffusER: Diffusion via Edit-based Reconstruction.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Mega: Moving Average Equipped Gated Attention.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Computational Language Acquisition with Theory of Mind.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AANG : Automating Auxiliary Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Execution-Based Evaluation for Open-Domain Code Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Prompt2Model: Generating Deployable Models from Natural Language Instructions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Active Retrieval Augmented Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CTC Alignments Improve Autoregressive Translation.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

A Multi-dimensional Evaluation of Tokenizer-free Multilingual Pretrained Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

EXCALIBUR: Encouraging and Evaluating Embodied Exploration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multi-lingual and Multi-cultural Figurative Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

When Does Translation Require Context? A Data-driven, Multilingual Exploration.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023


DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
In-IDE Code Generation from Natural Language: Promise and Challenges.
ACM Trans. Softw. Eng. Methodol., 2022

Evaluating Explanations: How Much Do Explanations from the Teacher Aid Students?
Trans. Assoc. Comput. Linguistics, 2022

Can We Automate Scientific Reviewing?
J. Artif. Intell. Res., 2022

AmericasNLI: Machine translation and natural language inference systems for Indigenous languages of the Americas.
Frontiers Artif. Intell., 2022

NusaCrowd: Open Source Initiative for Indonesian NLP Resources.
CoRR, 2022

Searching for Effective Multilingual Fine-Tuning Methods: A Case Study in Summarization.
CoRR, 2022

DiffusER: Discrete Diffusion via Edit-based Reconstruction.
CoRR, 2022

DocCoder: Generating Code by Retrieving and Reading Docs.
CoRR, 2022

Table Retrieval May Not Necessitate Table-specific Model Design.
CoRR, 2022

Testing the Ability of Language Models to Interpret Figurative Language.
CoRR, 2022

Learning to Scaffold: Optimizing Model Explanations for Teaching.
CoRR, 2022

AUTOLEX: An Automatic Framework for Linguistic Exploration.
CoRR, 2022

Augmenting Decompiler Output with Learned Variable Names and Types.
Proceedings of the 31st USENIX Security Symposium, 2022

A systematic evaluation of large language models of code.
Proceedings of the MAPS@PLDI 2022: 6th ACM SIGPLAN International Symposium on Machine Programming, 2022

Learning to Scaffold: Optimizing Model Explanations for Teaching.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Testing the Ability of Language Models to Interpret Figurative Language.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Quality-Aware Decoding for Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CMU's IWSLT 2022 Dialect Speech Translation System.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Building African Voices.
Proceedings of the Interspeech 2022, 2022

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Symmetric Machine Theory of Mind.
Proceedings of the International Conference on Machine Learning, 2022

Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval.
Proceedings of the International Conference on Machine Learning, 2022

Capturing Structural Locality in Non-parametric Language Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Distributionally Robust Models with Parametric Likelihood Ratios.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards a Unified View of Parameter-Efficient Transfer Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Prompt Consistency for Zero-Shot Task Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Interpreting Language Models with Contrastive Explanations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Paraphrastic Representations at Scale.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

English Contrastive Learning Can Learn Universal Cross-lingual Sentence Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning to Model Editing Processes.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Language Models of Code are Few-Shot Commonsense Learners.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Are representations built from the ground up? An empirical examination of local composition in language models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

He Said, She Said: Style Transfer for Shifting the Perspective of Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


Understanding and Improving Zero-shot Multi-hop Reasoning in Generative Question Answering.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

On The Ingredients of an Effective Zero-shot Semantic Parser.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

DataLab: A Platform for Data Analysis and Intervention.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

BRIO: Bringing Order to Abstractive Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Breaking Down Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Systematic Inequalities in Language Technology Performance across the World's Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

DEEP: DEnoising Entity Pre-training for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Lexically Aware Semi-Supervised Learning for OCR Post-Correction.
Trans. Assoc. Comput. Linguistics, 2021

How Can We Know <i>When</i> Language Models Know? On the Calibration of Language Models for Question Answering.
Trans. Assoc. Comput. Linguistics, 2021

WikiAsp: A Dataset for Multi-domain Aspect-based Summarization.
Trans. Assoc. Comput. Linguistics, 2021

Reducing Confusion in Active Learning for Part-Of-Speech Tagging.
Trans. Assoc. Comput. Linguistics, 2021

MasakhaNER: Named Entity Recognition for African Languages.
Trans. Assoc. Comput. Linguistics, 2021

Learning to Superoptimize Real-world Programs.
CoRR, 2021

Hierarchical Control of Situated Agents through Natural Language.
CoRR, 2021

When Does Translation Require Context? A Data-driven, Multilingual Exploration.
CoRR, 2021

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages.
CoRR, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.
CoRR, 2021

Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

Phrase-level Active Learning for Neural Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

BARTScore: Evaluating Generated Text as Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multi-view Subword Regularization.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On Learning Text Style Transfer with Direct Rewards.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Explicit Alignment Objectives for Multilingual Bidirectional Encoders.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

GSum: A General Framework for Guided Neural Abstractive Summarization.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Data Augmentation for Sign Language Gloss Translation.
Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages, 2021

Phoneme Recognition Through Fine Tuning of Phonetic Representations: A Case Study on Luhya Language Varieties.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Few-shot Language Coordination by Modeling Theory of Mind.
Proceedings of the 38th International Conference on Machine Learning, 2021

Examining and Combating Spurious Features under Distribution Shift.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Structural Edits via Incremental Tree Transformations.
Proceedings of the 9th International Conference on Learning Representations, 2021

Meta Back-Translation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Modeling the Second Player in Distributionally Robust Optimization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Distributionally Robust Multilingual Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Efficient Test Time Adapter Ensembling for Low-resource Language Varieties.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Evaluating the Morphosyntactic Well-formedness of Generated Texts.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Efficient Nearest Neighbor Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Towards More Fine-grained and Reliable NLP Performance Prediction.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Word Alignment by Fine-tuning Embeddings on Parallel Corpora.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Dependency Induction Through the Lens of Visual Perception.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Detecting Hallucinated Content in Conditional Neural Sequence Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Do Context-Aware Translation Models Pay the Right Attention?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

CitationIE: Leveraging the Citation Graph for Scientific Information Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ExplainaBoard: An Explainable Leaderboard for NLP.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Measuring and Increasing Context Usage in Context-Aware Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Speech Technology for Unwritten Languages.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multi-Source Neural Machine Translation With Missing Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

The Return of Lexical Dependencies: Neural Lexicalized PCFGs.
Trans. Assoc. Comput. Linguistics, 2020

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking.
Trans. Assoc. Comput. Linguistics, 2020

How Can We Know What Language Models Know.
Trans. Assoc. Comput. Linguistics, 2020

Improving neural machine translation through phrase-based soft forced decoding.
Mach. Transl., 2020

Optimizing segmentation granularity for neural machine translation.
Mach. Transl., 2020

A Set of Recommendations for Assessing Human-Machine Parity in Language Translation.
J. Artif. Intell. Res., 2020

How Can We Know When Language Models Know?
CoRR, 2020

Evaluating Explanations: How much do explanations from the teacher aid students?
CoRR, 2020

Decoding and Diversity in Machine Translation.
CoRR, 2020

A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos.
CoRR, 2020

Practical Comparable Data Collection for Low-Resource Languages via Images.
CoRR, 2020

Weight Poisoning Attacks on Pre-trained Models.
CoRR, 2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization.
CoRR, 2020

Findings of the WMT 2020 Shared Task on Machine Translation Robustness.
Proceedings of the Fifth Conference on Machine Translation, 2020

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Transliteration for Cross-Lingual Morphological Inflection.
Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

Learning Sparse Prototypes for Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

AlloVera: A Multilingual Allophone Database.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Optimizing Data Usage via Differentiable Rewards.
Proceedings of the 37th International Conference on Machine Learning, 2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding Knowledge Distillation in Non-autoregressive Machine Translation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework.
Proceedings of the 8th International Conference on Learning Representations, 2020

A Probabilistic Formulation of Unsupervised Text Style Transfer.
Proceedings of the 8th International Conference on Learning Representations, 2020

Differentiable Reasoning over a Virtual Knowledge Base.
Proceedings of the 8th International Conference on Learning Representations, 2020

Universal Phone Recognition with a Multilingual Allophone System.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Bilingual Generative Transformer for Semantic Sentence Embedding.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

OCR Post Correction for Endangered Language Texts.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Weakly- and Semi-supervised Evidence Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

NeuSpell: A Neural Spelling Correction Toolkit.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Interpretable Multi-dataset Evaluation for Named Entity Recognition.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Dynamic Data Selection and Weighting for Iterative Back-Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Automatic Extraction of Rules Governing Morphological Agreement.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Re-evaluating Evaluation in Text Summarization.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TICO-19: the Translation Initiative for COvid-19.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

Project MAIA: Multilingual AI Agent Assistant.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Endangered Languages meet Modern NLP.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Learning Relation Entailment with Structured and Textual Information.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Findings of the Fourth Workshop on Neural Generation and Translation.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020

TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Incorporating External Knowledge through Pre-training for Natural Language to Code Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Predicting Performance for Natural Language Processing Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Balancing Training for Multilingual Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Soft Gazetteers for Low-Resource Named Entity Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning to Deceive with Attention-Based Explanations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Politeness Transfer: A Tag and Generate Approach.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Weight Poisoning Attacks on Pretrained Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Generalizing Natural Language Analysis through Span-relation Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Should All Cross-Lingual Embeddings Speak English?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Merging Weak and Active Supervision for Semantic Parsing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Latent Relation Language Models.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation.
Trans. Assoc. Comput. Linguistics, 2019

The ARIEL-CMU Systems for LoReHLT18.
CoRR, 2019

An Adversarial Approach to High-Quality, Sentiment-Controlled Neural Dialogue Generation.
CoRR, 2019

Improving Robustness of Neural Machine Translation with Multi-task Learning.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the First Shared Task on Machine Translation Robustness.
Proceedings of the Fourth Conference on Machine Translation, 2019

Contextualized Representations for Low-resource Utterance Tagging.
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019

Are Sixteen Heads Really Better than One?
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Density Matching for Bilingual Word Embedding.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Lost in Interpretation: Predicting Untranslated Terminology in Simultaneous Interpretation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Improving Robustness of Machine Translation with Synthetic Noise.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Competence-based Curriculum Learning for Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

compare-mt: A Tool for Holistic Comparison of Language Generation Systems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Learning to Describe Unknown Phrases with Local and Global Contexts.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

DIRE: A Neural Approach to Decompiled Identifier Naming.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Mitigating Noisy Inputs for Question Answering.
Proceedings of the Interspeech 2019, 2019

Learning to Represent Edits.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multilingual Neural Machine Translation With Soft Decoupled Encoding.
Proceedings of the 7th International Conference on Learning Representations, 2019

Lagging Inference Networks and Posterior Collapse in Variational Autoencoders.
Proceedings of the 7th International Conference on Learning Representations, 2019

Handling Syntactic Divergence in Low-resource Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Findings of the Third Workshop on Neural Generation and Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Domain Differential Adaptation for Neural Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Little Annotation does a Lot of Good: A Study in Bootstrapping Low-resource Named Entity Recognizers.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Pushing the Limits of Low-Resource Morphological Inflection.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Comparing Top-Down and Bottom-Up Neural Generative Dependency Models.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Reranking for Neural Semantic Parsing.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Generalized Data Augmentation for Low-Resource Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Simple and Effective Paraphrastic Similarity from Parallel Translations.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Beyond BLEU: Training Neural Machine Translation with Semantic Similarity.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Self-Attentional Models for Lattice Inputs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Bilingual Lexicon Induction with Semi-supervision in Non-Isometric Embedding Spaces.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Choosing Transfer Languages for Cross-Lingual Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Improving Open Information Extraction via Iterative Rank-Aware Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Towards Zero-resource Cross-lingual Entity Linking.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

Zero-Shot Neural Transfer for Cross-Lingual Entity Linking.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Neural Lattice Language Models.
Trans. Assoc. Comput. Linguistics, 2018

An end-to-end model for cross-lingual transformation of paralinguistic information.
Mach. Transl., 2018

Learning to Generate Corrective Patches using Neural Machine Translation.
CoRR, 2018

Towards a General-Purpose Linguistic Annotation Backend.
CoRR, 2018

Learning to Describe Phrases with Local and Global Contexts.
CoRR, 2018

Parameter Sharing Methods for Multilingual Self-Attentional Translation Models.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

Contextual Encoding for Translation Quality Estimation.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Cavs: An Efficient Runtime System for Dynamic Neural Networks.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Guiding Neural Machine Translation with Retrieved Translation Pieces.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

When and Why Are Pre-Trained Word Embeddings Useful for Neural Machine Translation?
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Modelling Natural Language, Programs, and their Intersection.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Using Morphological Knowledge in Open-Vocabulary Neural Language Models.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Handling Homographs in Neural Machine Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Attentive Interaction Model: Modeling Changes in View in Argumentation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Learning to mine aligned code and natural language pairs from stack overflow.
Proceedings of the 15th International Conference on Mining Software Repositories, 2018

Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Multi-Source Neural Machine Translation with Data Augmentation.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

Self-Attentional Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Learning to mine parallel natural language/source code corpora from stack overflow.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Neural Cross-lingual Named Entity Recognition with Minimal Resources.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Tree-based Decoder for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Contextual Parameter Generation for Universal Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Rapid Adaptation of Neural Machine Translation to New Languages.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

MTNT: A Testbed for Machine Translation of Noisy Text.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Unsupervised Learning of Syntactic Structure with Invertible Neural Projections.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Retrieval-Based Neural Code Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Stress Test Evaluation for Natural Language Inference.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

XNMT: The eXtensible Neural Machine Translation Toolkit.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

Findings of the Second Workshop on Neural Machine Translation and Generation.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Automatic Estimation of Simultaneous Interpreter Performance.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Factor Graph Models for Cross-lingual Morphological Tagging.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Extreme Adaptation for Personalized Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Learning to Generate Move-by-Move Commentary for Chess Games from Large-Scale Social Forum Data.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Stack-Pointer Networks for Dependency Parsing.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

A Continuous Relaxation of Beam Search for End-to-End Training of Neural Sequence Models.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Preserving Word-Level Emphasis in Speech-to-Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Transcribing against time.
Speech Commun., 2017

Cavs: A Vertex-centric Programming Interface for Dynamic Neural Networks.
CoRR, 2017

Convolutional Neural Networks for Medical Diagnosis from Admission Notes.
CoRR, 2017

DyNet: The Dynamic Neural Network Toolkit.
CoRR, 2017

Neural Machine Translation and Sequence-to-sequence Models: A Tutorial.
CoRR, 2017

Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML.
CoRR, 2017

NICT-NAIST System for WMT17 Multimodal Translation Task.
Proceedings of the Second Conference on Machine Translation, 2017

Tree as a Pivot: Syntactic Matching Methods in Pivot Translation.
Proceedings of the Second Conference on Machine Translation, 2017

How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing.
Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, 2017

Controllable Invariance through Adversarial Feature Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

On-the-fly Operation Batching in Dynamic Computation Graphs.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Adaptive Spelling Error Correction Models for Learner English.
Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 21st International Conference KES-2017, 2017

Semi-Supervised Learning of a Pronunciation Dictionary from Disjoint Phonemic Transcripts and Text.
Proceedings of the Interspeech 2017, 2017

Improving Neural Machine Translation through Phrase-based Forced Decoding.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Neural Lattice-to-Sequence Models for Uncertain Inputs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Learning Language Representations for Typology Prediction.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Charmanteau: Character Embedding Models For Portmanteau Creation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

What Do Recurrent Neural Network Grammars Learn About Syntax?
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Learning to Translate in Real-time with Neural Machine Translation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Cross-Lingual Word Embeddings for Low-Resource Language Modeling.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Morphological Inflection Generation with Multi-space Variational Encoder-Decoders.
Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection, 2017

An investigation of how to design control parameters for statistical voice timbre control.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Overview of the 4th Workshop on Asian Translation.
Proceedings of the 4th Workshop on Asian Translation, 2017

An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation.
Proceedings of the First Workshop on Neural Machine Translation, 2017

Stronger Baselines for Trustable Results in Neural Machine Translation.
Proceedings of the First Workshop on Neural Machine Translation, 2017

Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

A Syntactic Neural Model for General-Purpose Code Generation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Neural Machine Translation via Binary Code Prediction.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Learning Character-level Compositionality with Visual Features.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Phonemic Transcription of Low-Resource Tonal Languages.
Proceedings of the Australasian Language Technology Association Workshop, 2017

2016
Teaching Social Communication Skills Through Human-Agent Interaction.
ACM Trans. Interact. Intell. Syst., 2016

Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Learning cooperative persuasive dialogue policies using framing.
Speech Commun., 2016

Learning local word reorderings for hierarchical phrase-based statistical machine translation.
Mach. Transl., 2016

A comparative study of dictionaries and corpora as methods for language resource addition.
Lang. Resour. Evaluation, 2016

A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models.
IEICE Trans. Inf. Syst., 2016

Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics.
IEICE Trans. Inf. Syst., 2016

Neural Network Approaches to Dialog Response Retrieval and Generation.
IEICE Trans. Inf. Syst., 2016

Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior.
IEICE Trans. Inf. Syst., 2016

Optimization for Statistical Machine Translation: A Survey.
Comput. Linguistics, 2016

Deep bottleneck features and sound-dependent i-vectors for simultaneous recognition of speech and environmental sounds.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Analyzing the Effect of Entrainment on Dialogue Acts.
Proceedings of the SIGDIAL 2016 Conference, 2016

Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation.
Proceedings of the NAACL HLT 2016, 2016

Morphological Inflection Generation Using Character Sequence to Sequence Learning.
Proceedings of the NAACL HLT 2016, 2016

Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Active Learning for Example-Based Dialog Systems.
Proceedings of the Dialogues with Social Robots, 2016

Unsupervised Phoneme Segmentation of Previously Unseen Languages.
Proceedings of the Interspeech 2016, 2016

Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition.
Proceedings of the Interspeech 2016, 2016

A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training.
Proceedings of the Interspeech 2016, 2016

Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models.
Proceedings of the Interspeech 2016, 2016

Learning a Translation Model from Word Lattices.
Proceedings of the Interspeech 2016, 2016

Personalized unknown word detection in non-native language reading using eye gaze.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Real-time vibration control of an electrolarynx based on statistical F0 contour prediction.
Proceedings of the 24th European Signal Processing Conference, 2016

Generalizing and Hybridizing Count-based and Neural Language Models.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Controlling Output Length in Neural Encoder-Decoders.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Incorporating Discrete Translation Lexicons into Neural Machine Translation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning a Lexicon and Translation Model from Phoneme Lattices.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Automated social skills training with audiovisual information.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

Lightly Supervised Quality Estimation.
Proceedings of the COLING 2016, 2016

Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST-CMU at WAT2016.
Proceedings of the 3rd Workshop on Asian Translation, 2016

Overview of the 3rd Workshop on Asian Translation.
Proceedings of the 3rd Workshop on Asian Translation, 2016

A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Semantic Parsing of Ambiguous Input through Paraphrasing and Verification.
Trans. Assoc. Comput. Linguistics, 2015

NOCOA+: Multimodal Computer-Based Training for Social and Communication Skills.
IEICE Trans. Inf. Syst., 2015

An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Pointwise Prediction and Sequence-Based Reranking for Adaptable Part-of-Speech Tagging.
Proceedings of the Computational Linguistics, 2015

Construction and analysis of social-affective interaction corpus in English and Indonesian.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Ckylark: A More Robust PCFG-LA Parser.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Multi-Target Machine Translation with Multi-Synchronous Context-free Grammars.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T).
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015

Pseudogen: A Tool to Automatically Generate Pseudo-Code from Source Code.
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015

Parser self-training for syntax-based machine translation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

The NAIST English speech recognition system for IWSLT 2015.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Improving translation of emphasis with pause prediction in speech-to-speech translation systems.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Automated Social Skills Trainer.
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015

Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential.
Proceedings of the INTERSPEECH 2015, 2015

Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments.
Proceedings of the INTERSPEECH 2015, 2015

Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics.
Proceedings of the INTERSPEECH 2015, 2015

A latent variable model for joint pause prediction and dependency parsing.
Proceedings of the INTERSPEECH 2015, 2015

Speed or accuracy? a study in evaluation of simultaneous speech translation.
Proceedings of the INTERSPEECH 2015, 2015

Statistical singing voice conversion based on direct waveform modification with global variance.
Proceedings of the INTERSPEECH 2015, 2015

Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs.
Proceedings of the INTERSPEECH 2015, 2015

Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

EEG signal enhancement using multi-channel wiener filter with a spatial correlation prior.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A Binarized Neural Network Joint Model for Machine Translation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

An evaluation of EEG ocular artifact removal with a multi-channel wiener filter based on probabilistic generative model.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction.
Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility, 2015

Incremental sentence compression using LSTM recurrent networks.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Adaptive selection from multiple response candidates in example-based dialogue.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015.
Proceedings of the 2nd Workshop on Asian Translation, 2015

Overview of the 2nd Workshop on Asian Translation.
Proceedings of the 2nd Workshop on Asian Translation, 2015

Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Improving Pivot Translation by Remembering the Pivot.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

An Analysis Towards Dialogue-Based Deception Detection.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

Unknown Word Detection Based on Event-Related Brain Desynchronization Responses.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

Linguistic Individuality Transformation for Spoken Language.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

Evaluation of a Fully Automatic Cooperative Persuasive Dialogue System.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

2014
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff.
Trans. Assoc. Comput. Linguistics, 2014

Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis.
IEEE J. Sel. Top. Signal Process., 2014

A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation.
IEICE Trans. Inf. Syst., 2014

Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-Oriented Dialog System.
IEICE Trans. Inf. Syst., 2014

Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model.
IEICE Trans. Inf. Syst., 2014

Voice Timbre Control Based on Perceived Age in Singing Voice Conversion.
IEICE Trans. Inf. Syst., 2014

Rule-based Syntactic Preprocessing for Syntax-based Machine Translation.
Proceedings of SSST@EMNLP 2014, 2014

On-the-fly user modeling for cost-sensitive correction of speech transcripts.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Conversation dialog corpora from television and movie scripts.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Building a free, general-domain paraphrase database for Japanese.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Memorable spoken quote corpora of TED public speaking.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Collection and analysis of a Japanese-English emphasized speech corpora.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Collection of a Simultaneous Translation Corpus for Comparative Analysis.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Language Resource Addition: Dictionary or Corpus?
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

NTT-NAIST syntax-based SMT systems for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

The NAIST-NTT TED talk treebank.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Emotion and Its Triggers in Human Spoken Dialogue: Recognition and Analysis.
Proceedings of the Situated Dialog in Speech-Based Human-Computer Interaction, 2014

Construction and Analysis of a Persuasive Dialogue Corpus.
Proceedings of the Situated Dialog in Speech-Based Human-Computer Interaction, 2014

Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models.
Proceedings of the INTERSPEECH 2014, 2014

Direct F<sub>0</sub> control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation.
Proceedings of the INTERSPEECH 2014, 2014

Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus.
Proceedings of the INTERSPEECH 2014, 2014

Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion.
Proceedings of the INTERSPEECH 2014, 2014

Statistical singing voice conversion with direct waveform modification based on the spectrum differential.
Proceedings of the INTERSPEECH 2014, 2014

A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics.
Proceedings of the INTERSPEECH 2014, 2014

An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

A postfilter to modify the modulation spectrum in HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Narrow Adaptive Regularization of weights for grapheme-to-phoneme conversion.
Proceedings of the IEEE International Conference on Acoustics, 2014

Regression approaches to perceptual age control in singing voice conversion.
Proceedings of the IEEE International Conference on Acoustics, 2014

Acquiring a Dictionary of Emotion-Provoking Events.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing.
Proceedings of the COLING 2014, 2014

Discriminative Language Models as a Tool for Machine Translation Error Analysis.
Proceedings of the COLING 2014, 2014

Unnecessary utterance detection for avoiding digressions in discussion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

An event-related brain potential study on the impact of speech recognition errors.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Recursive neural network paraphrase identification for example-based dialog retrieval.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

The use of semantic and acoustic features for open-domain TED talk summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Forest-to-String SMT for Asian Language Translation: NAIST at WAT 2014.
Proceedings of the 1st Workshop on Asian Translation, 2014

Optimizing Segmentation Strategies for Simultaneous Speech Translation.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

On the Elements of an Accurate Tree-to-String Machine Translation System.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children's Narrative.
Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2014

2013
Substring-based machine translation.
Mach. Transl., 2013

Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

NTT-NAIST SMT systems for IWSLT 2013.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Constructing a speech translation system using simultaneous interpretation data.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

The NAIST English speech recognition system for IWSLT 2013.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion.
Proceedings of the INTERSPEECH 2013, 2013

Improvements to HMM-based speech synthesis based on parameter generation with rich context models.
Proceedings of the INTERSPEECH 2013, 2013

Efficient speech transcription through respeaking.
Proceedings of the INTERSPEECH 2013, 2013

An empirical comparison of joint optimization techniques for speech translation.
Proceedings of the INTERSPEECH 2013, 2013

A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion.
Proceedings of the INTERSPEECH 2013, 2013

Grapheme-to-phoneme conversion based on adaptive regularization of weight vectors.
Proceedings of the INTERSPEECH 2013, 2013

An investigation of acoustic features for singing voice conversion based on perceptual age.
Proceedings of the INTERSPEECH 2013, 2013

Generalizing continuous-space translation of paralinguistic information.
Proceedings of the INTERSPEECH 2013, 2013

Simple, lexicalized choice of translation timing for simultaneous speech translation.
Proceedings of the INTERSPEECH 2013, 2013

A Framework and Tool for Collaborative Extraction of Reliable Information.
Proceedings of the Workshop on Language Processing and Crisis Information@IJCNLP 2013, 2013

Modality and contextual differences in computer based non-verbal communication training.
Proceedings of the IEEE 4th International Conference on Cognitive Infocommunications, 2013

NAIST at the CLEF 2013 QA4MRE Pilot Task.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Dialogue management for leading the conversation in persuasive dialogue systems.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Towards High-Reliability Speech Translation in the Medical Domain.
Proceedings of the First Workshop on Natural Language Processing for Medical and Healthcare Fields@IJCNLP 2013, 2013

Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Adaptation Data Selection using Neural Language Models: Experiments in Machine Translation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

How Much Is Said in a Tweet? A Multilingual, Information-theoretic Perspective.
Proceedings of the Analyzing Microtext, 2013

2012
Unsupervised learning of lexical information for language processing systems.
PhD thesis, 2012

Joint Phrase Alignment and Extraction for Statistical Machine Translation.
J. Inf. Process., 2012

Bayesian Learning of a Language Model from Continuous Speech.
IEICE Trans. Inf. Syst., 2012

A monotonic statistical machine translation approach to speaking style transformation.
Comput. Speech Lang., 2012

The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The NAIST machine translation system for IWSLT2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

A method for translation of paralinguistic information.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT-NAIST (contrastive) English ASR system for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Developing Non-goal Dialog System Based on Examples of Drama Television.
Proceedings of the Natural Interaction with Robots, 2012

Inducing a Discriminative Parser to Optimize Machine Translation Reordering.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Non-verbal cognitive skills and autistic conditions: An analysis and training tool.
Proceedings of the IEEE 3rd International Conference on Cognitive Infocommunications, 2012

Machine Translation without Words through Substring Alignment.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Searching Translation Memories for Paraphrases.
Proceedings of Machine Translation Summit XIII: Papers, 2011

The NICT translation system for IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

A Pointwise Approach to Pronunciation Estimation for a TTS Front-End.
Proceedings of the INTERSPEECH 2011, 2011

Safety Information Mining - What can NLP do in a disaster -.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Training Dependency Parsers from Partially Annotated Corpora.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

An Unsupervised Model for Joint Phrase Alignment and Extraction.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Word-based Partial Annotation for Efficient Corpus Construction.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Learning a language model from continuous speech.
Proceedings of the INTERSPEECH 2010, 2010

Semi-automated update of automatic transcription system for the Japanese national congress.
Proceedings of the INTERSPEECH 2010, 2010

Improved statistical models for SMT-based speaking style transformation.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A WFST-based log-linear framework for speaking-style transformation.
Proceedings of the INTERSPEECH 2009, 2009


  Loading...