Ziran Yang

According to our database1, Ziran Yang authored at least 14 papers between 2023 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI.
CoRR, May, 2026

Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning.
CoRR, May, 2026

Goedel-Code-Prover: Hierarchical Proof Search for Open State-of-the-Art Code Verification.
CoRR, March, 2026

AlgoVeri: An Aligned Benchmark for Verified Code Generation on Classical Algorithms.
CoRR, February, 2026

2025
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction.
CoRR, August, 2025

Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Offline Reinforcement Learning for LLM Multi-step Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Offline Reinforcement Learning for LLM Multi-Step Reasoning.
CoRR, 2024

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding.
CoRR, 2024

ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain.
CoRR, 2024

SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset.
CoRR, 2024

Panacea: Pareto Alignment via Preference Adaptation for LLMs.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models.
CoRR, 2023


  Loading...