Fengshuo Bai

According to our database¹, Fengshuo Bai authored at least 21 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling.

[BibT_eX]

[DOI]

CoRR, October, 2025

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation.

[BibT_eX]

[DOI]

CoRR, September, 2025

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective.

[BibT_eX]

[DOI]

CoRR, July, 2025

EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding.

[BibT_eX]

[DOI]

CoRR, May, 2025

Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand.

[BibT_eX]

[DOI]

CoRR, February, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.

[BibT_eX]

[DOI]

CoRR, February, 2025

STAR: Efficient Preference-based Reinforcement Learning via Dual Regularization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

β-DQN: Improving Deep Q-Learning By Evolving the Behavior.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AdaptFlow: Adaptive Workflow Optimization via Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems.

[BibT_eX]

[DOI]

Proceedings of the Artificial General Intelligence - 18th International Conference, 2025

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Efficient Model-agnostic Alignment via Bayesian Persuasion.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation.

[BibT_eX]

[DOI]

CoRR, 2024

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects.

[BibT_eX]

[DOI]

CoRR, 2024

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Measuring Value Understanding in Language Models through Discriminator-Critique Gap.

[BibT_eX]

[DOI]

CoRR, 2023

Zero-shot Preference Learning for Offline RL via Optimal Transport.

[BibT_eX]

[DOI]

CoRR, 2023

PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fengshuo Bai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...