We stand with Ukraine

We stand with Ukraine

Taiwei Shi

According to our database¹, Taiwei Shi authored at least 22 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Skill Reuse as Compression in Agentic RL.

[DOI]

,

,

,

,

,

CoRR, May, 2026

Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks.

[DOI]

,

,

,

,

,

CoRR, April, 2026

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents.

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2026

Video-Based Reward Modeling for Computer-Use Agents.

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2026

DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning.

[DOI]

,

,

,

,

,

,

,

,

Virginia Estellers

,

,

,

,

Tadas Baltrusaitis

,

Jennifer Neville

,

,

CoRR, February, 2026

Experiential Reinforcement Learning.

[DOI]

,

,

,

,

,

CoRR, February, 2026

One Model, All Roles: Multi-Turn, Multi-Agent Self-Play Reinforcement Learning for Conversational Social Intelligence.

[DOI]

,

,

,

,

Camillo J. Taylor

,

,

,

CoRR, February, 2026

2025

CoAct-1: Computer-using Agents with Coding as Actions.

[DOI]

,

,

,

,

,

,

,

Silvio Savarese

,

,

,

,

CoRR, August, 2025

The Hallucination Tax of Reinforcement Finetuning.

[DOI]

,

,

CoRR, May, 2025

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning.

[DOI]

,

,

,

,

CoRR, April, 2025

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base.

[DOI]

,

,

,

,

Ryotaro Shimizu

,

,

,

,

CoRR, March, 2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Bryan Hooi Kuen-Yew

,

,

Elias Stengel-Eskin

,

,

,

,

,

,

,

,

,

,

Swabha Swayamdipta

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Or Cohen Sasson

,

Prasanna Sattigeri

,

,

,

,

,

,

,

,

,

,

Nitesh V. Chawla

,

,

,

,

,

Neil Zhenqiang Gong

,

,

,

Xiangliang Zhang

CoRR, February, 2025

Detecting and Filtering Unsafe Training Data via Data Attribution.

[DOI]

,

,

,

CoRR, February, 2025

The Hallucination Tax of Reinforcement Finetuning.

[DOI]

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

STEER-BENCH: A Benchmark for Evaluating the Steerability of Large Language Models.

[DOI]

,

,

,

Kristina Lerman

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback.

[DOI]

,

,

,

,

,

,

,

Sujay Kumar Jauhar

,

,

,

Jennifer Neville

CoRR, 2024

Safer-Instruct: Aligning Language Models with Automated Preference Data.

[DOI]

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Can Language Model Moderators Improve the Health of Online Discourse?

[DOI]

,

,

,

,

,

,

,

,

Jonathan Gratch

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

How Susceptible are Large Language Models to Ideological Manipulation?

[DOI]

,

,

,

,

Kristina Lerman

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Decentralized Inventory Transshipments with Quantal Response Equilibrium.

[DOI]

,

,

,

Syst., July, 2023

CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation.

[DOI]

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Neural Story Planning.

[DOI]

,

Christopher Cui

,

,

CoRR, 2022

Loading...