Takumi Tanabe

According to our database¹, Takumi Tanabe authored at least 8 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, March, 2026

Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

A Provable Approach for End-to-End Safe Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing.

[BibT_eX]

[DOI]

CoRR, February, 2025

2024

Stepwise Alignment for Constrained Language Model Policy Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Stepwise Alignment for Constrained Language Model Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2022

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Level generation for angry birds with sequential VAE and latent variable evolution.

[BibT_eX]

[DOI]

Proceedings of the GECCO '21: Genetic and Evolutionary Computation Conference, 2021

Takumi Tanabe

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...