Robin Jia

According to our database1, Robin Jia authored at least 48 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Proving membership in LLM pretraining data via data watermarks.
CoRR, 2024

2023
Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions?
CoRR, 2023

Efficient End-to-End Visual Document Understanding with Rationale Distillation.
CoRR, 2023

Do Localization Methods Actually Localize Memorized Data in LLMs?
CoRR, 2023

Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.
CoRR, 2023

Operationalizing content moderation "accuracy" in the Digital Services Act.
CoRR, 2023

How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Estimating Large Language Model Capabilities without Labeled Test Data.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Benchmarking Long-tail Generalization with Likelihood Splits.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Do Question Answering Modeling Improvements Hold Across Benchmarks?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Are Sample-Efficient NLP Models More Robust?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Data Curation Alone Can Stabilize In-context Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Careful Data Curation Stabilizes In-context Learning.
CoRR, 2022

CoNAL: Anticipating Outliers with Large Language Models.
CoRR, 2022

On the Robustness of Reading Comprehension Models to Entity Renaming.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Knowledge Base Question Answering by Case-based Reasoning over Subgraphs.
Proceedings of the International Conference on Machine Learning, 2022

Generalization Differences between End-to-End and Neuro-Symbolic Vision-Language Reasoning Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Analyzing Dynamic Adversarial Training Data in the Limit.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

On Continual Model Refinement in Out-of-Distribution Data Streams.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Question Answering Infused Pre-training of General-Purpose Contextualized Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Can Small and Synthetic Benchmarks Drive Modeling Innovation? A Retrospective Study of Question Answering Modeling Approaches.
CoRR, 2021

Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Dynabench: Rethinking Benchmarking in NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

To what extent do human explanations of model behavior align with actual model behavior?
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

The statistical advantage of automatic NLG metrics at the system level.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Do Explanations Help Users Detect Errors in Open-Domain QA? An Evaluation of Spoken vs. Visual Explanations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Building robust natural language processing systems.
PhD thesis, 2020

Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA.
CoRR, 2020

On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

With Little Power Comes Great Responsibility.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Selective Question Answering under Domain Shift.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Robust Encodings: A Framework for Combating Adversarial Typos.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Document-Level N-ary Relation Extraction with Multiscale Representation Learning.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Certified Robustness to Adversarial Word Substitutions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2018
Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Know What You Don't Know: Unanswerable Questions for SQuAD.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Learning concepts through conversations in spoken dialogue systems.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Adversarial Examples for Evaluating Reading Comprehension Systems.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Data Recombination for Neural Semantic Parsing.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016


  Loading...