Yuki Ichihara

According to our database1, Yuki Ichihara authored at least 8 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Reliable Chain-of-Thought via Prefix Consistency.
CoRR, May, 2026

CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency.
CoRR, May, 2026

Consensus Group Relative Policy Optimization for Text Generation.
CoRR, February, 2026

2025
MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems.
CoRR, September, 2025

Evaluation of Best-of-N Sampling Strategies for Language Model Alignment.
Trans. Mach. Learn. Res., 2025

Auto-Weighted Group Relative Preference Optimization for Multi-Objective Text Generation Tasks.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Theoretical Guarantees for Minimum Bayes Risk Decoding.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees.
CoRR, 2024


  Loading...