Yeyun Gong

Orcid: 0000-0001-9954-9674

According to our database1, Yeyun Gong authored at least 92 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Exploring the Mystery of Influential Data for Mathematical Reasoning.
CoRR, 2024

Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models.
CoRR, 2024

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning.
CoRR, 2024

LEAD: Liberal Feature-based Distillation for Dense Retrieval.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

2023
Competition-Level Problems are Effective LLM Evaluators.
CoRR, 2023

Adapting LLM Agents Through Communication.
CoRR, 2023

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving.
CoRR, 2023

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model with Knowledge Graph.
CoRR, 2023

CMMLU: Measuring massive multitask language understanding in Chinese.
CoRR, 2023

BeamSearchQA: Large Language Models are Strong Zero-Shot QA Solver.
CoRR, 2023

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing.
CoRR, 2023

PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization.
CoRR, 2023

Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models.
CoRR, 2023

AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators.
CoRR, 2023

PROD: Progressive Distillation for Dense Retrieval.
Proceedings of the ACM Web Conference 2023, 2023

MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders Are Better Dense Retrievers.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On-the-Fly Adapting Code Summarization on Trainable Cost-Effective Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.
Proceedings of the International Conference on Machine Learning, 2023

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Query Rewriting in Retrieval-Augmented Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Noisy Pair Corrector for Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Allies: Prompting Large Language Model with Beam Search.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Joint Generator-Ranker Learning for Natural Language Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
GENIE: Large Scale Pre-training for Text Generation with Diffusion Model.
CoRR, 2022

Curriculum Sampling for Dense Retrieval with Document Expansion.
CoRR, 2022

APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning.
CoRR, 2022

LEAD: Liberal Feature-based Distillation for Dense Retrieval.
CoRR, 2022

GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation.
CoRR, 2022

P<sup>3</sup>LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training.
CoRR, 2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.
CoRR, 2022

A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation.
CoRR, 2022

Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings.
CoRR, 2022

CodeRetriever: Unimodal and Bimodal Contrastive Learning.
CoRR, 2022

Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

CULG: Commercial Universal Language Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Adversarial Retriever-Ranker for Dense Text Retrieval.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

P3LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning.
CoRR, 2021

FastSeq: Make Sequence Generation Faster.
CoRR, 2021

Question Generation from Code Snippets and Programming Error Messages.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

Mask Attention Networks: Rethinking and Strengthen Transformer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Poolingformer: Long Document Modeling with Pooling Attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

EL-Attention: Memory Efficient Lossless Attention for Generation.
Proceedings of the 38th International Conference on Machine Learning, 2021

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.
Proceedings of the 38th International Conference on Machine Learning, 2021

KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

FastSeq: Make Sequence Generation Faster.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GLGE: A New General Language Generation Evaluation Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
CoRR, 2020

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation.
CoRR, 2020

ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Leveraging Document-Level Label Consistency for Named Entity Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Uncertainty-Aware Label Refinement for Sequence Labeling.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-level Alignment Pretraining for Multi-lingual Semantic Parsing.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

An Enhanced Knowledge Injection Model for Commonsense Generation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

RikiNet: Reading Wikipedia Pages for Natural Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Neural Semantic Parsing in Low-Resource Settings with Back-Translation and Meta-Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Graph-Based Transformer with Cross-Candidate Verification for Semantic Parsing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Pretraining-Based Natural Language Generation for Text Summarization.
CoRR, 2019

Weakly Supervised Multi-task Learning for Semantic Parsing.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Aggregating Bidirectional Encoder Representations Using MatchLSTM for Sequence Matching.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Joint Type Inference on Entities and Relations via Graph Convolutional Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Question Generation With Doubly Adversarial Nets.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Hashtag recommendation for multimodal microblog posts.
Neurocomputing, 2018

2017
Phrase-based hashtag recommendation for microblog posts.
Sci. China Inf. Sci., 2017

Hierarchical Dirichlet Processes with Social Influence.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Hashtag Recommendation for Multimodal Microblog Using Co-Attention Network.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Hashtag Recommendation Using End-To-End Memory Networks with Hierarchical Attention.
Proceedings of the COLING 2016, 2016

Retweet Prediction with Attention-based Deep Neural Network.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Hashtag Recommendation Using Dirichlet Process Mixture Models Incorporating Types of Hashtags.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Who Will You "@"?
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Retweet Behavior Prediction Using Hierarchical Dirichlet Process.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Time-aware Personalized Hashtag Recommendation on Social Media.
Proceedings of the COLING 2014, 2014

A Generative Model for Identifying Target Companies of Microblogs.
Proceedings of the COLING 2014, 2014

2013
Detecting Spammers in Community Question Answering.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Map search via a factor graph model.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2011
Classical Mongolian Words Recognition in Historical Document.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011


  Loading...