Junwu Xiong

Orcid: 0009-0008-2028-510X

According to our database1, Junwu Xiong authored at least 20 papers between 2009 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Understanding Agentic AI: Algorithms and Infrastructure.
IEEE CAA J. Autom. Sinica, April, 2026

Thousand-GPU Large-Scale Training and Optimization Recipe for AI-Native Cloud Embodied Intelligence Infrastructure.
CoRR, March, 2026

RL-VLA<sup>3</sup>: Reinforcement Learning VLA Accelerating via Full Asynchronism.
CoRR, February, 2026

2025
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways.
ACM Comput. Surv., July, 2025

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs.
CoRR, June, 2025

How Social is It? A Benchmark for LLMs' Capabilities in Multi-user Multi-turn Social Agent Tasks.
CoRR, May, 2025

2024
Model-Based Off-Policy Deep Reinforcement Learning With Model-Embedding.
IEEE Trans. Emerg. Top. Comput. Intell., August, 2024

Hummer: Towards Limited Competitive Preference Dataset.
CoRR, 2024

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems.
CoRR, 2024

2022
Digital Human Interactive Recommendation Decision-Making Based on Reinforcement Learning.
CoRR, 2022

2021
Unit Ball Model for Hierarchical Embeddings in Complex Hyperbolic Space.
CoRR, 2021

2020
Model Embedding Model-Based Reinforcement Learning.
CoRR, 2020

Intention Propagation for Multi-agent Reinforcement Learning.
CoRR, 2020

Cost-Effective Incentive Allocation via Structured Counterfactual Inference.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Latent Dirichlet Allocation for Internet Price War.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
A Policy Gradient Method with Variance Reduction for Uplift Modeling.
CoRR, 2018

Personalized Behavior Prediction with Encoder-to-Decoder Structure.
Proceedings of the 2018 IEEE International Conference on Networking, 2018

2013
COCA: Constructing optimal clustering architecture to maximize sensor network lifetime.
Comput. Commun., 2013

2009
Constructing Optimal Clustering Architecture for Maximizing Lifetime in Large Scale Wireless Sensor Networks.
Proceedings of the 15th IEEE International Conference on Parallel and Distributed Systems, 2009


  Loading...