Hao Cheng

CoRR, 2024

GRIN: GRadient-INformed MoE.

[BibT_eX]

[DOI]

CoRR, 2024

Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering.

[BibT_eX]

[DOI]

CoRR, 2024

Encode Once and Decode in Parallel: Efficient Transformer Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking.

[BibT_eX]

[DOI]

Chia-Hsuan Lee

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fast-ELECTRA for Efficient Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Language Models as Inductive Reasoners.

[BibT_eX]

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

DocLens: Multi-aspect Fine-grained Medical Text Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Fine-tuning large neural language models for biomedical natural language processing.

[BibT_eX]

[DOI]

Patterns, April, 2023

InSCIt: Information-Seeking Conversations with Mixed-Initiative Interactions.

[BibT_eX]

[DOI]

Prithviraj Ammanabrolu

Hannaneh Hajishirzi

Trans. Assoc. Comput. Linguistics, 2023

Enhancing Medical Text Evaluation with GPT-4.

[BibT_eX]

[DOI]

CoRR, 2023

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks.

[BibT_eX]

[DOI]

CoRR, 2023

MathVista: Evaluating Math Reasoning in Visual Contexts with GPT-4V, Bard, and Other Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Verification Improves Few-Shot Clinical Information Extraction.

[BibT_eX]

[DOI]

CoRR, 2023

Chain-of-Skills: A Configurable Model for Open-domain Question Answering.

[BibT_eX]

[DOI]

CoRR, 2023

Pre-training Transformers for Knowledge Graph Completion.

[BibT_eX]

[DOI]

CoRR, 2023

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Augmenting Language Models with Long-Term Memory.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Understand and Modularize Generator Optimization in ELECTRA-style Pretraining.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Visually-Augmented Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Chain-of-Skills: A Configurable Model for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022

Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing.

[BibT_eX]

[DOI]

ACM Trans. Comput. Heal., 2022

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Open-domain Question Answering via Chain of Reasoning over Heterogeneous Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Unsupervised Learning of Hierarchical Conversation Structure.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Knowledge-Rich Self-Supervision for Biomedical Entity Linking.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Open Domain Question Answering with A Unified Knowledge Interface.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Knowledge-Rich Self-Supervised Entity Linking.

[BibT_eX]

[DOI]

CoRR, 2021

CLUES: Few-Shot Learning Evaluation in Natural Language Understanding.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Jianfeng Gao

CoRR, 2021

Open Domain Question Answering over Virtual Documents: A Unified Approach for Data and Text.

[BibT_eX]

[DOI]

CoRR, 2021

Few-Shot Learning Evaluation in Natural Language Understanding.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Jianfeng Gao

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Targeted Adversarial Training for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Posterior Differential Regularization with f-divergence for Improving Model Robustness.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Dialogue State Tracking with a Language Model using Schema-Driven Prompting.

[BibT_eX]

[DOI]

Chia-Hsuan Lee

Michael Sejr Schlichtkrull

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

UnitedQA: A Hybrid Approach for Open Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Adversarial Training for Large Neural Language Models.

[BibT_eX]

[DOI]

CoRR, 2020

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned.

[BibT_eX]

[DOI]

Sewon Min

Jordan L. Boyd-Graber

Sonal Gupta

Yashar Mehdad

Wen-tau Yih

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Probabilistic Assumptions Matter: Improved Models for Distantly-Supervised Document-Level Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

A Dynamic Speaker Model for Conversational Interactions.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018

Improving Span-based Question Answering Systems with Coarsely Labeled Data.

[BibT_eX]

[DOI]

CoRR, 2018

Sounding Board: A User-Centric and Content-Driven Social Chatbot.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

2017

A Factored Neural Network Model for Characterizing Online Discussions in Vector Space.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016

Learning Latent Local Conversation Modes for Predicting Community Endorsement in Online Discussions.

[BibT_eX]

[DOI]

CoRR, 2016

Bi-directional Attention with Agreement for Dependency Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Scalable and Sound Low-Rank Tensor Learning.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Learning Latent Local Conversation Modes for Predicting Comment Endorsement in Online Discussions.

[BibT_eX]

[DOI]

Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

2015

Open-Domain Name Error Detection using a Multi-Task RNN.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Language Models for Image Captioning: The Quirks and What Works.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2013

Convex Relaxations of Bregman Divergence Clustering.

[BibT_eX]

[DOI]