Taiwei Shi

According to our database1, Taiwei Shi authored at least 22 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Skill Reuse as Compression in Agentic RL.
CoRR, May, 2026

Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks.
CoRR, April, 2026

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents.
CoRR, April, 2026

Video-Based Reward Modeling for Computer-Use Agents.
CoRR, March, 2026

DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning.
CoRR, February, 2026

Experiential Reinforcement Learning.
CoRR, February, 2026

One Model, All Roles: Multi-Turn, Multi-Agent Self-Play Reinforcement Learning for Conversational Social Intelligence.
CoRR, February, 2026

2025
CoAct-1: Computer-using Agents with Coding as Actions.
CoRR, August, 2025

The Hallucination Tax of Reinforcement Finetuning.
CoRR, May, 2025

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning.
CoRR, April, 2025

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base.
CoRR, March, 2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.
CoRR, February, 2025

Detecting and Filtering Unsafe Training Data via Data Attribution.
CoRR, February, 2025

The Hallucination Tax of Reinforcement Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

STEER-BENCH: A Benchmark for Evaluating the Steerability of Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback.
CoRR, 2024

Safer-Instruct: Aligning Language Models with Automated Preference Data.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Can Language Model Moderators Improve the Health of Online Discourse?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

How Susceptible are Large Language Models to Ideological Manipulation?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Decentralized Inventory Transshipments with Quantal Response Equilibrium.
Syst., July, 2023

CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Neural Story Planning.
CoRR, 2022


  Loading...