Hejian Sang

Orcid: 0009-0000-2001-677X

According to our database1, Hejian Sang authored at least 18 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training.
CoRR, May, 2026

TIP: Token Importance in On-Policy Distillation.
CoRR, April, 2026

SODA: Semi On-Policy Black-Box Distillation for Large Language Models.
CoRR, April, 2026

PACED: Distillation and Self-Distillation at the Frontier of Student Competence.
CoRR, March, 2026

Not all tokens are needed(NAT): token efficient reinforcement learning.
CoRR, March, 2026

On-Policy Self-Distillation for Reasoning Compression.
CoRR, March, 2026

AriadneMem: Threading the Maze of Lifelong Memory for LLM Agents.
CoRR, March, 2026

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning.
CoRR, February, 2026

Scaling In-Context Online Learning Capability of LLMs via Cross-Episode Meta-RL.
CoRR, February, 2026

2025
Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation.
CoRR, December, 2025

Aligning Diffusion Language Models via Unpaired Preference Optimization.
CoRR, October, 2025

Scaling Up Efficient Small Language Models Serving and Deployment for Semantic Job Search.
CoRR, October, 2025

Debunk the Myth of SFT Generalization.
CoRR, October, 2025

Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs.
CoRR, September, 2025

Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications.
CoRR, February, 2025

PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications.
Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, 2025

Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2018
Adaptive Stochastic Gradient Langevin Dynamics: Taming Convergence and Saddle Point Escape Time.
CoRR, 2018


  Loading...