Seungyub Han

Orcid: 0009-0001-8704-8968

According to our database1, Seungyub Han authored at least 13 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning.
CoRR, April, 2026

Learning graph based individual intrinsic reward for multi-agent reinforcement learning.
ICT Express, 2026

2025
Deterministic Uncertainty Estimation for Multi-Modal Regression With Deep Neural Networks.
IEEE Access, 2025

Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Policy-labeled Preference Learning: Is Preference Enough for RLHF?
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation.
CoRR, 2024

D2NAS: Efficient Neural Architecture Search With Performance Improvement and Model Size Reduction for Diverse Tasks.
IEEE Access, 2024

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Learning to Learn Unlearned Feature for Brain Tumor Segmentation.
CoRR, 2023

On the Convergence of Continual Learning with Adaptive Methods.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


  Loading...