Yelong Shen

According to our database1, Yelong Shen authored at least 81 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models.
CoRR, 2024

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning.
CoRR, 2024

Multi-LoRA Composition for Image Generation.
CoRR, 2024

2023
Competition-Level Problems are Effective LLM Evaluators.
CoRR, 2023

Language Models can be Logical Solvers.
CoRR, 2023

Adapting LLM Agents Through Communication.
CoRR, 2023

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving.
CoRR, 2023

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models.
CoRR, 2023

Efficient RLHF: Reducing the Memory Usage of PPO.
CoRR, 2023

GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions.
CoRR, 2023

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing.
CoRR, 2023

Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models.
CoRR, 2023

What Matters In The Structured Pruning of Generative Language Models?
CoRR, 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

In-Context Learning Unlocked for Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.
Proceedings of the International Conference on Machine Learning, 2023

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Joint Generator-Ranker Learning for Natural Language Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
GENIE: Large Scale Pre-training for Text Generation with Diffusion Model.
CoRR, 2022

Generation-Augmented Query Expansion For Code Retrieval.
CoRR, 2022

GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation.
CoRR, 2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.
CoRR, 2022

Explanations from Large Language Models Make Small Reasoners Better.
CoRR, 2022

A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation.
CoRR, 2022

CodeRetriever: Unimodal and Bimodal Contrastive Learning.
CoRR, 2022

Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Adversarial Retriever-Ranker for Dense Text Retrieval.
Proceedings of the Tenth International Conference on Learning Representations, 2022

LoRA: Low-Rank Adaptation of Large Language Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Controllable Natural Language Generation with Contrastive Prefixes.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Finding the Dominant Winning Ticket in Pre-Trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
LoRA: Low-Rank Adaptation of Large Language Models.
CoRR, 2021

Poolingformer: Long Document Modeling with Pooling Attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

Integrated Defense for Resilient Graph Matching.
Proceedings of the 38th International Conference on Machine Learning, 2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding.
Proceedings of the 9th International Conference on Learning Representations, 2021

Memory-Efficient Differentiable Transformer Architecture Search.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Reader-Guided Passage Reranking for Open-Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Generation-Augmented Retrieval for Open-Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

UnitedQA: A Hybrid Approach for Open Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Improving Self-supervised Pre-training via a Fully-Explored Masked Language Model.
CoRR, 2020

A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation.
CoRR, 2020

Adversarial Attacks on Deep Graph Matching.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020


MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

StoryGAN: A Sequential Conditional GAN for Story Visualization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Hybrid Retrieval-Generation Neural Conversation Model.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Multi-Task Learning for Machine Reading Comprehension.
CoRR, 2018

M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

ReinforceWalk: Learning to Walk in Graph with Monte Carlo Tree Search.
Proceedings of the 6th International Conference on Learning Representations, 2018

FusionNet: Fusing via Fully-aware Attention with Application to Machine Comprehension.
Proceedings of the 6th International Conference on Learning Representations, 2018

Language-Based Image Editing With Recurrent Attentive Models.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Stochastic Answer Networks for Machine Reading Comprehension.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Towards Human-level Machine Reading Comprehension: Reasoning and Inference with Multiple Strategies.
CoRR, 2017

Modeling Large-Scale Structured Relationships with Shared Memory for Knowledge Base Completion.
Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

ReasoNet: Learning to Stop Reading in Machine Comprehension.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Deep Context Modeling for Web Query Entity Disambiguation.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Dynamic socialized Gaussian process models for human behavior prediction in a health social network.
Knowl. Inf. Syst., 2016

Implicit ReasoNet: Modeling Large-Scale Structured Relationships with Shared Memory.
CoRR, 2016

2015
Deep Sentence Embedding Using the Long Short Term Memory Network: Analysis and Application to Information Retrieval.
CoRR, 2015

End-to-end Learning of Latent Dirichlet Allocation by Mirror-Descent Back Propagation.
CoRR, 2015

End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

A Deep Embedding Model for Co-occurrence Learning.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2014
Semantic Modelling with Long-Short-Term Memory for Information Retrieval.
CoRR, 2014

Learning semantic representations using convolutional neural networks for web search.
Proceedings of the 23rd International World Wide Web Conference, 2014

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Limiting the Neighborhood: De-Small-World Network for Outbreak Prevention
CoRR, 2013

2012
Learning personal + social latent factor model for social recommendation.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Socialized Gaussian Process Model for Human Behavior Prediction in a Health Social Network.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Reliable Clustering on Uncertain Graphs.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

2011
Sparse hidden-dynamics conditional random fields for user intent understanding.
Proceedings of the 20th International Conference on World Wide Web, 2011

Learning to rank audience for behavioral targeting in display ads.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011


  Loading...