Takumi Tanabe

According to our database1, Takumi Tanabe authored at least 8 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning.
CoRR, March, 2026

Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
A Provable Approach for End-to-End Safe Reinforcement Learning.
CoRR, May, 2025

Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing.
CoRR, February, 2025

2024
Stepwise Alignment for Constrained Language Model Policy Optimization.
CoRR, 2024

Stepwise Alignment for Constrained Language Model Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Level generation for angry birds with sequential VAE and latent variable evolution.
Proceedings of the GECCO '21: Genetic and Evolutionary Computation Conference, 2021


  Loading...