Koki Wataoka

According to our database1, Koki Wataoka authored at least 6 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Predict, Don't React: Value-Based Safety Forecasting for LLM Streaming.
CoRR, April, 2026

2025
Foundation Models as Guardrails: LLM-and VLM-Based Approaches to Safety and Alignment.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

MergePrint: Merge-Resistant Fingerprints for Robust Black-box Ownership Verification of Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Self-Preference Bias in LLM-as-a-Judge.
CoRR, 2024

MergePrint: Robust Fingerprinting against Merging Large Language Models.
CoRR, 2024

2023
Verbosity Bias in Preference Labeling by Large Language Models.
CoRR, 2023


  Loading...