Shi Feng

Orcid: 0009-0001-1685-8912

Affiliations:
  • New York University, NY, USA
  • University of Chicago, IL, USA
  • University of Maryland, MD, USA
  • Shanghai Jiaotong University, China


According to our database1, Shi Feng authored at least 31 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Unsupervised Elicitation of Language Models.
CoRR, June, 2025

Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Language Models Learn to Mislead Humans via RLHF.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Spontaneous Reward Hacking in Iterative Self-Refinement.
CoRR, 2024

LLM Evaluators Recognize and Favor Their Own Generations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Large Language Models Help Humans Verify Truthfulness - Except When They Are Convincingly Wrong.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Machine Explanations and Human Understanding.
Trans. Mach. Learn. Res., 2023

Learning Human-Compatible Representations for Case-Based Decision Support.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Active Example Selection for In-Context Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning to Explain Selectively: A Case Study on Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Calibrate Before Use: Improving Few-Shot Performance of Language Models.
CoRR, 2021

Concealed Data Poisoning Attacks on NLP Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Calibrate Before Use: Improving Few-shot Performance of Language Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Customizing Triggers with Concealed Data Poisoning.
CoRR, 2020

2019
Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples.
Trans. Assoc. Comput. Linguistics, 2019

Universal Adversarial Triggers for NLP.
CoRR, 2019

Quizbowl: The Case for Incremental Question Answering.
CoRR, 2019

What can AI do for me?: evaluating machine learning interpretations in cooperative play.
Proceedings of the 24th International Conference on Intelligent User Interfaces, 2019

Understanding Impacts of High-Order Loss Approximations and Features in Deep Learning Interpretation.
Proceedings of the 36th International Conference on Machine Learning, 2019

Universal Adversarial Triggers for Attacking and Analyzing NLP.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Misleading Failures of Partial-input Baselines.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions.
CoRR, 2018

Right Answer for the Wrong Reason: Discovery and Mitigation.
CoRR, 2018

Interpreting Neural Networks with Nearest Neighbors.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Pathologies of Neural Models Make Interpretation Difficult.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task.
Proceedings of the Second Conference on Machine Translation, 2017


  Loading...