David Chiang

Affiliations:
  • University of Notre Dame, IN, USA
  • University of Southern California, Marina del Rey, CA, USA (2006 - 2014)
  • University of Maryland, College Park, MD, USA (2004 - 2005)
  • University of Pennsylvania, Philadelphia, PA, USA (PhD 2004)


According to our database1, David Chiang authored at least 105 papers between 2000 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Exact Recursive Probabilistic Programming.
Proc. ACM Program. Lang., April, 2023

Bridging Graph Position Encodings for Transformers with Weighted Graph-Walking Automata.
Trans. Mach. Learn. Res., 2023

Named Tensor Notation.
Trans. Mach. Learn. Res., 2023

Transformers as Recognizers of Formal Languages: A Survey on Expressivity.
CoRR, 2023

Masked Hard-Attention Transformers and Boolean RASP Recognize Exactly the Star-Free Languages.
CoRR, 2023

Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns.
CoRR, 2023

Universal Automatic Phonetic Transcription into the International Phonetic Alphabet.
CoRR, 2023

Fine-Tuning BERT with Character-Level Noise for Zero-Shot Transfer to Dialects and Closely-Related Languages.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

Tighter Bounds on the Expressivity of Transformer Encoders.
Proceedings of the International Conference on Machine Learning, 2023

The Surprising Computational Power of Nondeterministic Stack RNNs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Convergence and Diversity in the Control Hierarchy.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Measuring Human Perception to Improve Handwritten Document Transcription.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning Hierarchical Structures with Differentiable Nondeterministic Stacks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Algorithms for Weighted Pushdown Automata.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Continuum of Generation Tasks for Investigating Length Bias and Degenerate Repetition.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

Overcoming a Theoretical Limitation of Self-Attention.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Syntax-Based Attention Masking for Neural Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

2020
Translating Recursive Probabilistic Programs to Factor Graph Grammars.
CoRR, 2020

Representing Unordered Data Using Multiset Automata and Complex Numbers.
CoRR, 2020

Look It Up: Bilingual and Monolingual Dictionaries Improve Neural Machine Translation.
Proceedings of the Fifth Conference on Machine Translation, 2020

Factor Graph Grammars.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Representing Unordered Data Using Complex-Weighted Multiset Automata.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning Context-free Languages with Nondeterministic Stack RNNs.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

2019
Learning Hyperedge Replacement Grammars for Graph Generation.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Neural Machine Translation of Text from Non-Native Speakers.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Efficiency through Auto-Sizing: Notre Dame NLP's Submission to the WNGT 2019 Efficiency Task.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Accelerating Sparse Matrix Operations in Neural Networks on Graphics Processing Units.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Incident-Driven Machine Translation and Name Tagging for Low-resource Languages.
Mach. Transl., 2018

Neural Machine Translation of Text from Non-Native Speakers.
CoRR, 2018

Growing Better Graphs With Latent-Variable Probabilistic Graph Grammars.
CoRR, 2018

Weighted DAG Automata for Semantic Graphs.
Comput. Linguistics, 2018

Correcting Length Bias in Neural Machine Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

Algorithms and Training for Weighted Multiset Automata and Regular Expressions.
Proceedings of the Implementation and Application of Automata, 2018

Improving Lexical Choice in Neural Machine Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Combining Character and Word Information in Neural Machine Translation Using a Multi-Level Attention.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Tied Multitask Learning for Neural Speech Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Leveraging Translations for Speech Transcription in Low-resource Settings.
Proceedings of the Interspeech 2018, 2018

Synchronous Hyperedge Replacement Graph Grammars.
Proceedings of the Graph Transformation - 11th International Conference, 2018

Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Composing Finite State Transducers on GPUs.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
DyNet: The Dynamic Neural Network Toolkit.
CoRR, 2017

A case study on using speech-to-translation alignments for language documentation.
CoRR, 2017

Transfer Learning across Low-Resource, Related Languages for Neural Machine Translation.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Spoken Term Discovery for Language Documentation using Translations.
Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

Decoding with Finite-State Transducers on GPUs.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Growing Graphs with Hyperedge Replacement Graph Grammars.
CoRR, 2016

An Attentional Model for Speech Translation Without Transcription.
Proceedings of the NAACL HLT 2016, 2016

An Unsupervised Probability Model for Speech-to-Translation Alignment of Low-Resource Languages.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Growing Graphs from Hyperedge Replacement Graph Grammars.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Model Invertibility Regularization: Sequence Alignment With or Without Parallel Data.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Multi-Task Word Alignment Triangulation for Low-Resource Languages.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Auto-Sizing Neural Networks: With Applications to n-gram Language Models.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Supervised Phrase Table Triangulation with Neural Word Embeddings for Low-Resource Languages.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
Improving Word Alignment using Word Similarity.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Kneser-Ney Smoothing on Expected Counts.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Decoding with Large-Scale Neural Language Models Improves Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Parsing Graphs with Hyperedge Replacement Grammars.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Grammars for Language and Genes - Theoretical and Empirical Investigations
Theory and Applications of Natural Language Processing, Springer, ISBN: 978-3-642-20444-9, 2012

Soft syntactic constraints for Arabic-English hierarchical phrase-based translation.
Mach. Transl., 2012

Hope and Fear for Discriminative Training of Statistical Translation Models.
J. Mach. Learn. Res., 2012

Machine Translation for Language Preservation.
Proceedings of the COLING 2012, 2012

An Exploration of Forest-to-String Translation: Does Translation Help or Hurt Parsing?
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Rule Markov Models for Fast Tree-to-String Translation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Models and Training for Unsupervised Preposition Sense Disambiguation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Two Easy Improvements to Lexical Weighting.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Language-Independent Parsing with Empty Elements.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Unsupervised Syntactic Alignment with Inversion Transduction Grammars.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Bayesian Inference for Finite-State Transducers.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Fast, Greedy Model Minimization for Unsupervised Tagging.
Proceedings of the COLING 2010, 2010

Efficient Optimization of an MDL-Inspired Objective Function for Unsupervised Part-Of-Speech Tagging.
Proceedings of the ACL 2010, 2010

Learning to Translate with Source and Target Syntax.
Proceedings of the ACL 2010, 2010

2009
Introduction to the Special Issue on Machine Translation of Asian Languages.
ACM Trans. Asian Lang. Inf. Process., 2009

11,001 New Features for Statistical Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Fast Consensus Decoding over Translation Forests.
Proceedings of the ACL 2009, 2009

2008
Flexible Composition and Delayed Tree-Locality.
Proceedings of the Ninth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2008

Online Large-Margin Training of Syntactic and Structural Translation Features.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Decomposability of Translation Metrics for Improved Evaluation and Efficient Algorithms.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Extracting Synchronous Grammar Rules From Word-Level Alignments in Linear Time.
Proceedings of the COLING 2008, 2008

2007
Hierarchical Phrase-Based Translation.
Comput. Linguistics, 2007

Forest Rescoring: Faster Decoding with Integrated Language Models.
Proceedings of the ACL 2007, 2007

Word Sense Disambiguation Improves Statistical Machine Translation.
Proceedings of the ACL 2007, 2007

2006
Grammatical Representations of Macromolecular Structure.
J. Comput. Biol., 2006

A Grammatical Theory for the Conformational Changes of Simple Helix Bundles.
J. Comput. Biol., 2006

The Hidden TAG Model: Synchronous Grammars for Parsing Resource-Poor Languages.
Proceedings of the Eighth International Workshop on Tree Adjoining Grammar and Related Formalisms, 2006

The Weak Generative Capacity of Linear Tree-Adjoining Grammars.
Proceedings of the Eighth International Workshop on Tree Adjoining Grammar and Related Formalisms, 2006

Parsing Arabic Dialects.
Proceedings of the EACL 2006, 2006

2005
The Hiero Machine Translation System: Extensions, Evaluation, and Analysis.
Proceedings of the HLT/EMNLP 2005, 2005

Better k-best Parsing.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

A Hierarchical Phrase-Based Model for Statistical Machine Translation.
Proceedings of the ACL 2005, 2005

2004
Uses and abuses of intersected languages.
Proceedings of the 7th International Workshop on Tree Adjoining Grammar and Related Formalisms, 2004

2002
Putting Some Weakly Context-Free Formalisms in Order.
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2002

Recovering Latent Information in Treebanks.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
Facilitating Treebank Annotation Using a Statistical Parser.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Constraints on Strong Generative Power.
Proceedings of the Association for Computational Linguistic, 2001

2000
Some remarks on an extension of synchronous TAG.
Proceedings of the Fifth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2000

Multi-Component TAG and Notions of Formal Power.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000

Statistical Parsing with an Automatically-Extracted Tree Adjoining Grammar.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000


  Loading...