Takumi Tanabe

According to our database1, Takumi Tanabe authored at least 6 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A Provable Approach for End-to-End Safe Reinforcement Learning.
CoRR, May, 2025

Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing.
CoRR, February, 2025

2024
Stepwise Alignment for Constrained Language Model Policy Optimization.
CoRR, 2024

Stepwise Alignment for Constrained Language Model Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Level generation for angry birds with sequential VAE and latent variable evolution.
Proceedings of the GECCO '21: Genetic and Evolutionary Computation Conference, 2021


  Loading...