Dongdong Zhang

Orcid: 0000-0003-0833-7903

Affiliations:
  • Microsoft Research Asia, Beijing, China


According to our database1, Dongdong Zhang authored at least 86 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Revamping Multilingual Agreement Bidirectionally via Switched Back-translation for Multilingual Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Syntax-Aware Data Augmentation for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing.
CoRR, 2023

Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus.
CoRR, 2023

Chain-of-Dictionary Prompting Elicits Translation in Large Language Models.
CoRR, 2023

On the Pareto Front of Multilingual Neural Machine Translation.
CoRR, 2023

Findings of the WMT 2023 Shared Task on Machine Translation with Terminologies.
Proceedings of the Eighth Conference on Machine Translation, 2023

Are More Layers Beneficial to Graph Transformers?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TRIP: Accelerating Document-level Multilingual Pre-training via Triangular Document-level Pre-training on Parallel Data Triplets.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

HanoiT: Enhancing Context-aware Translation via Selective Context.
Proceedings of the Database Systems for Advanced Applications, 2023

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Discourse-Centric Evaluation of Document-level Machine Translation with a New Densely Annotated Parallel Corpus of Novels.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
TRIP: Triangular Document-level Pre-training for Multilingual Language Models.
CoRR, 2022

LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation.
CoRR, 2022

A Bilingual Parallel Corpus with Discourse Annotations.
CoRR, 2022

Towards Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation.
CoRR, 2022

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation.
CoRR, 2022

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation.
CoRR, 2022

DeepNet: Scaling Transformers to 1, 000 Layers.
CoRR, 2022

Phrase-level Adversarial Example Generation for Neural Machine Translation.
CoRR, 2022

SMDT: Selective Memory-Augmented Neural Document Translation.
CoRR, 2022

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

High-resource Language-specific Training for Multilingual Neural Machine Translation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified Multilingual Prompt.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation.
CoRR, 2021

DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders.
CoRR, 2021

BlonD: An Automatic Evaluation Metric for Document-level MachineTranslation.
CoRR, 2021

Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Learning to Select Relevant Knowledge for Neural Machine Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

XGPT: Cross-modal Generative Pre-Training for Image Captioning.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

Smart-Start Decoding for Neural Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Improving Multilingual Neural Machine Translation with Auxiliary Source Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multilingual Agreement for Multilingual Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders.
CoRR, 2020

M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training.
CoRR, 2020

XGPT: Cross-modal Generative Pre-Training for Image Captioning.
CoRR, 2020

BiGCNN: Bidirectional Gated Convolutional Neural Network for Chinese Named Entity Recognition.
Proceedings of the Database Systems for Advanced Applications, 2020

Improving Neural Machine Translation with Soft Template Prediction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Simple and Effective Unified Encoder for Document-Level Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Alternating Language Modeling for Cross-Lingual Pre-Training.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learning Unsupervised Word Mapping via Maximum Mean Discrepancy.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Effective Soft-Adaptation for Neural Machine Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

2018
Dependency-to-Dependency Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Learning Unsupervised Word Mapping by Maximizing Mean Discrepancy.
CoRR, 2018

Achieving Human Parity on Automatic Chinese to English News Translation.
CoRR, 2018

Improved Neural Machine Translation with Chinese Phonologic Features.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

2017
Modeling Indicative Context for Statistical Machine Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Improved Neural Machine Translation with Source Syntax.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Sequence-to-Dependency Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Better Addressing Word Deletion for Statistical Machine Translation.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

2015
Efficient Disfluency Detection with Transition-based Parsing.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Woodpecker: An Automatic Methodology for Machine Translation Diagnosis with Rich Linguistic Knowledge.
J. Inf. Sci. Eng., 2014

Soft Dependency Matching for Hierarchical Phrase-based Machine Translation.
Proceedings of the COLING 2014, 2014

A Lexicalized Reordering Model for Hierarchical Phrase-based Translation.
Proceedings of the COLING 2014, 2014

Learning Topic Representation for SMT with Neural Networks.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Machine Translation with Real-Time Web Search.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Collective Corpus Weighting and Phrase Scoring for SMT Using Graph-Based Random Walk.
Proceedings of the Natural Language Processing and Chinese Computing, 2013

Multi-Domain Adaptation for SMT Using Multi-Task Learning.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Punctuation Prediction with Transition-based Parsing.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Bilingual Data Cleaning for SMT using Graph-based Random Walk.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
A Ranking-based Approach to Word Reordering for Statistical Machine Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Translation Model Size Reduction for Hierarchical Phrase-based Statistical Machine Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Hierarchical Chunk-to-String Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Function Word Generation in Statistical Machine Translation Systems.
Proceedings of Machine Translation Summit XIII: Papers, 2011

2010
Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation.
Proceedings of the COLING 2010, 2010

Mixture Model-based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems.
Proceedings of the COLING 2010, 2010

Hybrid Decoding: Decoding with Partial Hypotheses Combination over Multiple SMT Systems.
Proceedings of the COLING 2010, 2010

A Joint Rule Selection Model for Hierarchical Phrase-Based Translation.
Proceedings of the ACL 2010, 2010

2009
Introduction to China's CWMT2008 Machine Translation Evaluation.
Proceedings of Machine Translation Summit XII: Papers, 2009

Better Synchronous Binarization for Machine Translation.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Collaborative Decoding: Partial Hypothesis Re-ranking Using Translation Consensus between Decoders.
Proceedings of the ACL 2009, 2009

2008
An Empirical Study in Source Word Deletion for Phrase-Based Statistical Machine Translation.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Diagnostic Evaluation of Machine Translation Systems Using Automatically Constructed Linguistic Check-Points.
Proceedings of the COLING 2008, 2008

Measure Word Generation for English-Chinese SMT Systems.
Proceedings of the ACL 2008, 2008

2007
Phrase Reordering Model Integrating Syntactic Knowledge for SMT.
Proceedings of the EMNLP-CoNLL 2007, 2007

A Probabilistic Approach to Syntax-based Reordering for Statistical Machine Translation.
Proceedings of the ACL 2007, 2007


  Loading...