Senjie Jin

According to our database¹, Senjie Jin authored at least 29 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees.

[BibT_eX]

[DOI]

CoRR, March, 2026

Locate, steer, and improve: A practical survey of actionable mechanistic interpretability in large language models.

[BibT_eX]

[DOI]

Comput. Sci. Rev., 2026

AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Memory in the Age of AI Agents.

[BibT_eX]

[DOI]

CoRR, December, 2025

The Role of Entropy in Visual Grounding: Analysis and Optimization.

[BibT_eX]

[DOI]

CoRR, December, 2025

DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training.

[BibT_eX]

[DOI]

CoRR, December, 2025

AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress.

[BibT_eX]

[DOI]

CoRR, November, 2025

Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization.

[BibT_eX]

[DOI]

CoRR, September, 2025

VRPO: Rethinking Value Modeling for Robust RL Training under Noisy Supervision.

[BibT_eX]

[DOI]

CoRR, August, 2025

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.

[BibT_eX]

[DOI]

CoRR, July, 2025

Reinforcement Fine-Tuning Enables MLLMs Learning Novel Tasks Stably.

[BibT_eX]

[DOI]

CoRR, June, 2025

Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction.

[BibT_eX]

[DOI]

CoRR, June, 2025

EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection.

[BibT_eX]

[DOI]

CoRR, March, 2025

The rise and potential of large language model based agents: a survey.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2025

Parrot: A Training Pipeline Enhances Both Program CoT and Natural Language CoT for Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

MouSi: Poly-Visual-Expert Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Secrets of RLHF in Large Language Models Part II: Reward Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

The Rise and Potential of Large Language Model Based Agents: A Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Secrets of RLHF in Large Language Models Part I: PPO.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Senjie Jin

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...