Jiajun Chai

Orcid: 0000-0002-7611-064X

According to our database1, Jiajun Chai authored at least 23 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Tacit mechanism: Bridging pre-training of individuality to multi-agent adversarial coordination.
Neural Networks, 2026

2025
MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL.
CoRR, October, 2025

SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning.
CoRR, October, 2025

ResT: Reshaping Token-Level Policy Gradients for Tool-Use Large Language Models.
CoRR, September, 2025

RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use.
CoRR, September, 2025

Meta Learning Task Representation in Multiagent Reinforcement Learning: From Global Inference to Local Inference.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

Promoting Efficient Reasoning with Verifiable Stepwise Reward.
CoRR, August, 2025

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning.
CoRR, June, 2025

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy.
CoRR, June, 2025

RLAE: Reinforcement Learning-Assisted Ensemble for LLMs.
CoRR, June, 2025

LDR: Learning Discrete Representation to Improve Noise Robustness in Multiagent Tasks.
IEEE Trans. Syst. Man Cybern. Syst., January, 2025

Learning Pre-Trained Tacit Behavior for Efficient Multi-Agent Adversarial Coordination.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., December, 2024

CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration.
CoRR, 2024

Aligning Credit for Multi-Agent Cooperation via Model-based Counterfactual Imagination.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat.
IEEE Trans. Syst. Man Cybern. Syst., September, 2023

UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

2022
NVIF: Neighboring Variational Information Flow for Large-Scale Cooperative Multi-Agent Scenarios.
CoRR, 2022

UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios.
CoRR, 2022

Learning Continuous 3-DoF Air-to-Air Close-in Combat Strategy using Proximal Policy Optimization.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

LILAC: Learning a Leader for Cooperative Reinforcement Learning.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022


  Loading...