Fengshuo Bai

According to our database1, Fengshuo Bai authored at least 21 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling.
CoRR, October, 2025

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation.
CoRR, September, 2025

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective.
CoRR, July, 2025

EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding.
CoRR, May, 2025

Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand.
CoRR, February, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.
CoRR, February, 2025

STAR: Efficient Preference-based Reinforcement Learning via Dual Regularization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

β-DQN: Improving Deep Q-Learning By Evolving the Behavior.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AdaptFlow: Adaptive Workflow Optimization via Meta-Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems.
Proceedings of the Artificial General Intelligence - 18th International Conference, 2025

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Efficient Model-agnostic Alignment via Bayesian Persuasion.
CoRR, 2024

Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation.
CoRR, 2024

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects.
CoRR, 2024

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Measuring Value Understanding in Language Models through Discriminator-Critique Gap.
CoRR, 2023

Zero-shot Preference Learning for Offline RL via Optimal Transport.
CoRR, 2023

PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...