Jiajun Chai
Orcid: 0000-0002-7611-064X
According to our database1,
Jiajun Chai authored at least 37 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning.
CoRR, April, 2026
π-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data.
CoRR, April, 2026
CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling.
CoRR, March, 2026
CoRR, March, 2026
SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training.
CoRR, March, 2026
CPIG: Leveraging Consistency Policy With Intention Guidance for Multiagent Exploration.
IEEE Trans. Cogn. Dev. Syst., February, 2026
CoRR, February, 2026
Tacit mechanism: Bridging pre-training of individuality to multi-agent adversarial coordination.
Neural Networks, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration of Reasoning Rewards.
CoRR, December, 2025
CoRR, December, 2025
CoRR, December, 2025
CoRR, December, 2025
CoRR, November, 2025
MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL.
CoRR, October, 2025
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning.
CoRR, October, 2025
CoRR, September, 2025
RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use.
CoRR, September, 2025
Meta Learning Task Representation in Multiagent Reinforcement Learning: From Global Inference to Local Inference.
IEEE Trans. Neural Networks Learn. Syst., August, 2025
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning.
CoRR, June, 2025
LDR: Learning Discrete Representation to Improve Noise Robustness in Multiagent Tasks.
IEEE Trans. Syst. Man Cybern. Syst., January, 2025
Learning Pre-Trained Tacit Behavior for Efficient Multi-Agent Adversarial Coordination.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
UIOrchestra: Generating High-Fidelity Code from UI Designs with a Multi-agent System.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
2024
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., December, 2024
CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration.
CoRR, 2024
Aligning Credit for Multi-Agent Cooperation via Model-based Counterfactual Imagination.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
2023
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat.
IEEE Trans. Syst. Man Cybern. Syst., September, 2023
IEEE Trans. Neural Networks Learn. Syst., April, 2023
2022
NVIF: Neighboring Variational Information Flow for Large-Scale Cooperative Multi-Agent Scenarios.
CoRR, 2022
CoRR, 2022
Learning Continuous 3-DoF Air-to-Air Close-in Combat Strategy using Proximal Policy Optimization.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022