Weixun Wang

Orcid: 0000-0002-2727-8948

According to our database1, Weixun Wang authored at least 43 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization.
CoRR, 2024

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ASN: action semantics network for multiagent reinforcement learning.
Auton. Agents Multi Agent Syst., October, 2023

Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Off-Beat Multi-Agent Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents.
Frontiers Inf. Technol. Electron. Eng., 2022

MARLlib: Extending RLlib for Multi-agent Reinforcement Learning.
CoRR, 2022

A2C is a special case of PPO.
CoRR, 2022

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks.
CoRR, 2022

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization.
CoRR, 2022

Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Individual Reward Assisted Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment.
CoRR, 2021

An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework.
CoRR, 2020

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Efficient Deep Reinforcement Learning via Adaptive Policy Transfer.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Action Semantics Network: Considering the Effects of Actions in Multiagent Systems.
Proceedings of the 8th International Conference on Learning Representations, 2020

Efficient Deep Reinforcement Learning through Policy Transfer.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Agent Game Abstraction via Graph Attention Neural Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas.
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019

Learning Adaptive Display Exposure for Real-Time Advertising.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
CoRR, 2018

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach.
CoRR, 2018

2012
Energy-Aware Scheduling and Dynamic Reconfiguration in Real-Time Systems.
Proceedings of the Handbook of Energy-Aware and Green Computing - Two Volume Set., 2012

System-Wide Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Multitasking Systems.
IEEE Trans. Very Large Scale Integr. Syst., 2012

Dynamic Cache Reconfiguration for Soft Real-Time Systems.
ACM Trans. Embed. Comput. Syst., 2012

TCEC: Temperature and Energy-Constrained Scheduling in Real-Time Multitasking Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2012

Energy-aware dynamic slack allocation for real-time multitasking systems.
Sustain. Comput. Informatics Syst., 2012

A Novel Approach for Handling Misbehaving Nodes in Behavior-Aware Mobile Networking
CoRR, 2012

2011
Energy-aware dynamic reconfiguration algorithms for real-time multitasking systems.
Sustain. Comput. Informatics Syst., 2011

Dynamic Reconfiguration of Two-Level Cache Hierarchy in Real-Time Embedded Systems.
J. Low Power Electron., 2011

A General Algorithm for Energy-Aware Dynamic Reconfiguration in Multitasking Systems.
Proceedings of the VLSI Design 2011: 24th International Conference on VLSI Design, 2011

Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systems.
Proceedings of the 48th Design Automation Conference, 2011

2010
Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Real-Time Systems.
Proceedings of the VLSI Design 2010: 23rd International Conference on VLSI Design, 2010

Temperature- and energy-constrained scheduling in multitasking systems: a model checking approach.
Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010

PreDVS: preemptive dynamic voltage scaling for real-time systems using approximation scheme.
Proceedings of the 47th Design Automation Conference, 2010

2009
SACR: Scheduling-Aware Cache Reconfiguration for Real-Time Embedded Systems.
Proceedings of the VLSI Design 2009: Improving Productivity through Higher Abstraction, 2009

Dynamic Reconfiguration of Two-Level Caches in Soft Real-Time Embedded Systems.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2009


  Loading...