Yong Cheng

Affiliations:
  • Google DeepMind, China
  • Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China


According to our database1, Yong Cheng authored at least 43 papers between 2014 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MedGemma 1.5 Technical Report.
CoRR, April, 2026

2025
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model.
CoRR, October, 2025

Complementary Human-AI Clinical Reasoning in Ophthalmology.
CoRR, October, 2025

Advancing Conversational Diagnostic AI with Multimodal Reasoning.
CoRR, May, 2025

Towards Conversational AI for Disease Management.
CoRR, March, 2025

2024
Exploring Large Language Models for Specialist-level Oncology Care.
CoRR, 2024

Towards Democratization of Subspeciality Medical Expertise.
CoRR, 2024

Understanding the performance gap between online and offline alignment algorithms.
CoRR, 2024

Capabilities of Gemini Models in Medicine.
CoRR, 2024

Towards Conversational Diagnostic AI.
CoRR, 2024


Language Model Beats Diffusion - Tokenizer is key to visual generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Towards Accurate Differential Diagnosis with Large Language Models.
CoRR, 2023

PaLM 2 Technical Report.
CoRR, 2023

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mu<sup>2</sup>SLAM: Multitask, Multilingual Speech and Language Models.
Proceedings of the International Conference on Machine Learning, 2023

MAGVIT: Masked Generative Video Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Multilingual and Code-Switching ASR Using Large Language Model Generated Text.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
mSLAM: Massively multilingual joint pre-training for speech and text.
CoRR, 2022

Examining Scaling and Transfer of Language Model Architectures for Machine Translation.
Proceedings of the International Conference on Machine Learning, 2022

Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Self-supervised and Supervised Joint Training for Resource-rich Machine Translation.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Towards Web-based Etymological Hanzi Learning.
Proceedings of the Companion of The 2020 Web Conference 2020, 2020

Living Jiagu: Enabling Constructive Etymology for Chinese Learning.
Proceedings of the Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, 2020

AdvAug: Robust Adversarial Augmentation for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
An End-to-End Generative Architecture for Paraphrase Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Reducing Word Omission Errors in Neural Machine Translation: A Contrastive Learning Approach.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Robust Neural Machine Translation with Doubly Adversarial Inputs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Joint Training for Neural Machine Translation
Springer, ISBN: 978-981-32-9747-0, 2019

2018
Neural Machine Translation with Key-Value Memory-Augmented Attention.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Towards Robust Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Neural Parse Combination.
J. Comput. Sci. Technol., 2017

THUMT: An Open Source Toolkit for Neural Machine Translation.
CoRR, 2017

HiText: Text Reading with Dynamic Salience Marking.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Maximum Expected Likelihood Estimation for Zero-resource Neural Machine Translation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Joint Training for Pivot-based Neural Machine Translation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

A Teacher-Student Framework for Zero-Resource Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Maximum Reconstruction Estimation for Generative Latent-Variable Models.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Neural Machine Translation with Pivot Languages.
CoRR, 2016

Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Minimum Risk Training for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Semi-Supervised Learning for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2014
Query Lattice for Translation Retrieval.
Proceedings of the COLING 2014, 2014


  Loading...