Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Language Models can be Deductive Solvers.

[BibT_eX]

[DOI]

Jiazhan Feng

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Competition-Level Problems are Effective LLM Evaluators.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Language Models can be Logical Solvers.

[BibT_eX]

[DOI]

CoRR, 2023

Adapting LLM Agents Through Communication.

[BibT_eX]

[DOI]

CoRR, 2023

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient RLHF: Reducing the Memory Usage of PPO.

[BibT_eX]

[DOI]

CoRR, 2023

GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Damien Jose

Xiang Ren

CoRR, 2023

What Matters In The Structured Pruning of Generative Language Models?

[BibT_eX]

[DOI]

CoRR, 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

In-Context Learning Unlocked for Diffusion Models.

[BibT_eX]

[DOI]

Zhangyang (Atlas) Wang

Mingyuan Zhou

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Joint Generator-Ranker Learning for Natural Language Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model.

[BibT_eX]

[DOI]

CoRR, 2022

Generation-Augmented Query Expansion For Code Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation.

[BibT_eX]

[DOI]

CoRR, 2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

Explanations from Large Language Models Make Small Reasoners Better.

[BibT_eX]

[DOI]

CoRR, 2022

A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation.

[BibT_eX]

[DOI]

CoRR, 2022

CodeRetriever: Unimodal and Bimodal Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Adversarial Retriever-Ranker for Dense Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

LoRA: Low-Rank Adaptation of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Controllable Natural Language Generation with Contrastive Prefixes.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Finding the Dominant Winning Ticket in Pre-Trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

LoRA: Low-Rank Adaptation of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2021

Poolingformer: Long Document Modeling with Pooling Attention.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Integrated Defense for Resilient Graph Matching.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Memory-Efficient Differentiable Transformer Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Reader-Guided Passage Reranking for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Generation-Augmented Retrieval for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

UnitedQA: A Hybrid Approach for Open Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Improving Self-supervised Pre-training via a Fully-Explored Masked Language Model.

[BibT_eX]

[DOI]

CoRR, 2020

A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Adversarial Attacks on Deep Graph Matching.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

StoryGAN: A Sequential Conditional GAN for Story Visualization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Hybrid Retrieval-Generation Neural Conversation Model.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018

Multi-Task Learning for Machine Reading Comprehension.

[BibT_eX]

[DOI]

CoRR, 2018

M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

ReinforceWalk: Learning to Walk in Graph with Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

FusionNet: Fusing via Fully-aware Attention with Application to Machine Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Language-Based Image Editing With Recurrent Attentive Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Stochastic Answer Networks for Machine Reading Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Towards Human-level Machine Reading Comprehension: Reasoning and Inference with Multiple Strategies.

[BibT_eX]

[DOI]

CoRR, 2017

Modeling Large-Scale Structured Relationships with Shared Memory for Knowledge Base Completion.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Deep Context Modeling for Web Query Entity Disambiguation.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Dynamic socialized Gaussian process models for human behavior prediction in a health social network.

[BibT_eX]

[DOI]

Knowl. Inf. Syst., 2016

Implicit ReasoNet: Modeling Large-Scale Structured Relationships with Shared Memory.

[BibT_eX]

[DOI]

CoRR, 2016

ReasoNet: Learning to Stop Reading in Machine Comprehension.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 co-located with the 30th Annual Conference on Neural Information Processing Systems (NIPS 2016), 2016

2015

Deep Sentence Embedding Using the Long Short Term Memory Network: Analysis and Application to Information Retrieval.

[BibT_eX]

[DOI]