Yushi Bai

Orcid: 0009-0009-7611-1093

According to our database1, Yushi Bai authored at least 39 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models.
CoRR, August, 2025

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning.
CoRR, June, 2025

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models.
CoRR, June, 2025

How does Transformer Learn Implicit Reasoning?
CoRR, May, 2025

Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models.
CoRR, May, 2025

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes.
CoRR, April, 2025

Shifting Long-Context LLMs Research from Input to Output.
CoRR, March, 2025

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models.
CoRR, February, 2025

Long Context vs. RAG: Strategies for Processing Long Documents in LLMs.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongWriter: Unleashing 10, 000+ Word Generation from Long Context LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-Context QA.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Pre-training Distillation for Large Language Models: A Design Space Exploration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs.
CoRR, 2024

Finding Safety Neurons in Large Language Models.
CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.
CoRR, 2024

DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning.
CoRR, 2024

Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation.
CoRR, 2024

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations.
CoRR, 2024

Text-image conditioned diffusion for consistent text-to-3D generation.
Comput. Aided Geom. Des., 2024

Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


Large Language Models Can Be Contextual Privacy Protection Learners.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LongAlign: A Recipe for Long Context Alignment of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs.
CoRR, 2023

T<sup>3</sup>Bench: Benchmarking Current Progress in Text-to-3D Generation.
CoRR, 2023

Large Language Models Can Be Good Privacy Protection Learners.
CoRR, 2023

Benchmarking Foundation Models with Language-Model-as-an-Examiner.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Answering Complex Logical Queries on Knowledge Graphs via Query Computation Tree Optimization.
Proceedings of the International Conference on Machine Learning, 2023

2022
Fair Allocations for Smoothed Utilities.
Proceedings of the EC '22: The 23rd ACM Conference on Economics and Computation, Boulder, CO, USA, July 11, 2022

Envy-Free and Pareto-Optimal Allocations for Agents with Asymmetric Random Valuations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Envy-Free and Pareto-Optimal Allocations for Asymmetric Agents.
CoRR, 2021

Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...