Bei Peng

Orcid: 0000-0003-0152-3180

Affiliations:
  • University of Liverpool, UK
  • University of Oxford, UK (former)
  • Washington State University, Pullman, WA, USA (PhD 2018)


According to our database1, Bei Peng authored at least 34 papers between 2012 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Deep Reinforcement Learning for Continuous Control of Material Thickness.
Proceedings of the Artificial Intelligence XL, 2023

2022
Curriculum Learning for Relative Overgeneralization.
CoRR, 2022

Accelerating Laboratory Automation Through Robot Skill Learning For Sample Scraping.
CoRR, 2022

Dependable learning-enabled multiagent systems.
AI Commun., 2022

2021
Special issue on adaptive and learning agents 2018.
Knowl. Eng. Rev., 2021

Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients.
CoRR, 2021

Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning.
CoRR, 2021

FACMAC: Factored Multi-Agent Centralised Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Regularized Softmax Deep Multi-Agent Q-Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

RODE: Learning Roles to Decompose Multi-Agent Tasks.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Special issue on adaptive and learning agents 2019.
Knowl. Eng. Rev., 2020

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey.
J. Mach. Learn. Res., 2020

Weighted QMIX: Expanding Monotonic Value Function Factorisation.
CoRR, 2020

AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning.
CoRR, 2020

Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control.
CoRR, 2020

Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Optimistic Exploration even with a Pessimistic Initialisation.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
VIABLE: Fast Adaptation via Backpropagating Learned Loss.
CoRR, 2019

Interactive Learning of Environment Dynamics for Sequential Tasks.
CoRR, 2019

2018
Curriculum Design for Machine Learners in Sequential Decision Tasks.
IEEE Trans. Emerg. Top. Comput. Intell., 2018

2017
Interactive Learning from Policy-Dependent Human Feedback.
CoRR, 2017

Interactive Learning from Policy-Dependent Human Feedback.
Proceedings of the 34th International Conference on Machine Learning, 2017

How Do Humans Teach: On Curriculum Design for Machine Learners.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

2016
Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning.
Auton. Agents Multi Agent Syst., 2016

A Need for Speed: Adapting Agent Action Speed to Improve Task Learning from Non-Expert Humans.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Towards Behavior-Aware Model Learning from Human-Generated Trajectories.
Proceedings of the 2016 AAAI Fall Symposia, Arlington, Virginia, USA, November 17-19, 2016, 2016

2015
Towards Integrating Real-Time Crowd Advice with Reinforcement Learning.
Proceedings of the 20th International Conference on Intelligent User Interfaces Companion, 2015

On the Ability to Provide Demonstrations on a UAS: Observing 90 Untrained Participants Abusing a Flying Robot.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Generating Real-Time Crowd Advice to Improve Reinforcement Learning Agents.
Proceedings of the Learning for General Competency in Video Games, 2015

2014
Learning something from nothing: Leveraging implicit human feedback strategies.
Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication, 2014

A Strategy-Aware Technique for Learning Behaviors from Discrete Human Feedback.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2012
A GPU-Based Accelerator for Chinese Word Segmentation.
Proceedings of the Web Technologies and Applications - 14th Asia-Pacific Web Conference, 2012


  Loading...