Zhuoshi Pan

According to our database1, Zhuoshi Pan authored at least 23 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs.
CoRR, April, 2026

ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch.
CoRR, January, 2026

2025
OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value.
CoRR, December, 2025

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning.
CoRR, October, 2025

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning.
CoRR, September, 2025

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning.
CoRR, July, 2025

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once.
CoRR, July, 2025

Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast.
CoRR, May, 2025

IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment.
CoRR, May, 2025

MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion.
CoRR, March, 2025

MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer.
CoRR, March, 2025

On Memory Construction and Retrieval for Personalized Conversational Agents.
CoRR, February, 2025

SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes Under Herd Behavior.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection.
CoRR, 2024

From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
From Trojan Horses to Castle Walls: Unveiling Bilateral Backdoor Effects in Diffusion Models.
CoRR, 2023


  Loading...