Fengshuo Bai

According to our database1, Fengshuo Bai authored at least 18 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
AdaptFlow: Adaptive Workflow Optimization via Meta-Learning.
CoRR, August, 2025

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective.
CoRR, July, 2025

EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding.
CoRR, May, 2025

Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand.
CoRR, February, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.
CoRR, February, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

β-DQN: Improving Deep Q-Learning By Evolving the Behavior.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems.
Proceedings of the Artificial General Intelligence - 18th International Conference, 2025

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Efficient Model-agnostic Alignment via Bayesian Persuasion.
CoRR, 2024

Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation.
CoRR, 2024

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects.
CoRR, 2024

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Measuring Value Understanding in Language Models through Discriminator-Critique Gap.
CoRR, 2023

Zero-shot Preference Learning for Offline RL via Optimal Transport.
CoRR, 2023

PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...