Hyung Won Chung

According to our database¹, Hyung Won Chung authored at least 33 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents.

[BibT_eX]

[DOI]

CoRR, April, 2025

2024

Scaling Instruction-Finetuned Language Models.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2024

Deliberative Alignment: Reasoning Enables Safer Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Measuring short-form factuality in large language models.

[BibT_eX]

[DOI]

CoRR, 2024

Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Scaling Up Models and Data with t5x and seqio.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

PaLM: Scaling Language Modeling with Pathways.

[BibT_eX]

[DOI]

Vinodkumar Prabhakaran

Thanumalayan Sankaranarayana Pillai

Kathy Meier-Hellstern

J. Mach. Learn. Res., 2023

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts.

[BibT_eX]

[DOI]

CoRR, 2023

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

UL2: Unifying Language Learning Paradigms.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Language models are multilingual chain-of-thought reasoners.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Transcending Scaling Laws with 0.1% Extra Compute.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Large Language Models Encode Clinical Knowledge.

[BibT_eX]

[DOI]

CoRR, 2022

Scaling Instruction-Finetuned Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?

[BibT_eX]

[DOI]

CoRR, 2022

Scaling Up Models and Data with t5x and seqio.

[BibT_eX]

[DOI]

CoRR, 2022

What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization?

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Scale Efficiently: Insights from Pretraining and Finetuning Transformers.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Demystifying the Better Performance of Position Encoding Variants for Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

Neural Data Augmentation via Example Extrapolation.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking Embedding Coupling in Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Learning Compact Metrics for MT.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Do Transformer Modifications Transfer Across Implementations and Applications?

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Simple and Effective Positional Encoding for Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition.

[BibT_eX]

[DOI]

CoRR, 2020

Learning to Evaluate Translation Beyond English: BLEURT Submissions to the WMT Metrics 2020 Shared Task.

[BibT_eX]

[DOI]

Proceedings of the Fifth Conference on Machine Translation, 2020

Improving Multilingual Models with Language-Clustered Vocabularies.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2015

Entropy Generation of Desalination Powered by Variable Temperature Waste Heat.

[BibT_eX]

[DOI]

Entropy, 2015

Hyung Won Chung

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...