Peter Vamplew

Orcid: 0000-0002-8687-4424

Affiliations:
  • University of Ballarat, Victoria, Australia


According to our database1, Peter Vamplew authored at least 89 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Value function interference and greedy action selection in value-based multi-objective reinforcement learning.
CoRR, 2024

Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning.
CoRR, 2024

An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments.
CoRR, 2024

2023
Persistent rule-based interactive reinforcement learning.
Neural Comput. Appl., November, 2023

Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario.
Neural Comput. Appl., September, 2023

Human engagement providing evaluative and informative advice for interactive reinforcement learning.
Neural Comput. Appl., September, 2023

AI apology: interactive multi-objective reinforcement learning for human-aligned AI.
Neural Comput. Appl., August, 2023

Explainable reinforcement learning for broad-XAI: a conceptual framework and survey.
Neural Comput. Appl., August, 2023

A conceptual framework for externally-influenced agents: an assisted reinforcement learning review.
J. Ambient Intell. Humaniz. Comput., 2023

Intent-aligned AI systems deplete human agency: the need for agency foundations research in AI safety.
CoRR, 2023

Elastic step DDPG: Multi-step reinforcement learning for improved sample efficiency.
Proceedings of the International Joint Conference on Neural Networks, 2023

A Brief Guide to Multi-Objective Reinforcement Learning and Planning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Scalar Reward is Not Enough.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
The impact of environmental stochasticity on value-based multiobjective reinforcement learning.
Neural Comput. Appl., 2022

Discrete-to-deep reinforcement learning methods.
Neural Comput. Appl., 2022

An online scalarization multi-objective reinforcement learning algorithm: TOPSIS Q-learning.
Knowl. Eng. Rev., 2022

Broad-persistent Advice for Interactive Reinforcement Learning Scenarios.
CoRR, 2022

Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks.
CoRR, 2022

Scalar reward is not enough: a response to Silver, Singh, Precup and Sutton (2021).
Auton. Agents Multi Agent Syst., 2022

A practical guide to multi-objective reinforcement learning and planning.
Auton. Agents Multi Agent Syst., 2022

Evaluating Human-like Explanations for Robot Actions in Reinforcement Learning Scenarios.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

2021
A Prioritized objective actor-critic method for deep reinforcement learning.
Neural Comput. Appl., 2021

Potential-based multiobjective reinforcement learning approaches to low-impact agents for AI safety.
Eng. Appl. Artif. Intell., 2021

Levels of explainable artificial intelligence for human-aligned conversational explanations.
Artif. Intell., 2021

Language Representations for Generalization in Reinforcement Learning.
Proceedings of the Asian Conference on Machine Learning, 2021

2020
A multi-objective deep reinforcement learning framework.
Eng. Appl. Artif. Intell., 2020

Explainable robotic systems: Interpreting outcome-focused actions in a reinforcement learning scenario.
CoRR, 2020

Discrete-to-Deep Supervised Policy Learning.
CoRR, 2020

A Demonstration of Issues with Value-Based Multiobjective Reinforcement Learning Under Stochastic State Transitions.
CoRR, 2020

Identifying Cross-Version Function Similarity Using Contextual Features.
Proceedings of the 19th IEEE International Conference on Trust, 2020

Unified Expression Ripple Down Rules based Fraud Detection Technique for Scalable Data.
Proceedings of the Advances in Data Mining, 2020

API Based Discrimination of Ransomware and Benign Cryptographic Programs.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Motivational Factors of Australian Mobile Gamers.
Proceedings of the Australasian Computer Science Week, 2020

2019
Griefing in MMORPGs.
Proceedings of the Encyclopedia of Computer Graphics and Games., 2019

Survey of intrusion detection systems: techniques, datasets and challenges.
Cybersecur., 2019

Evolved Similarity Techniques in Malware Analysis.
Proceedings of the 18th IEEE International Conference On Trust, 2019

Enhancing Model Performance for Fraud Detection by Feature Engineering and Compact Unified Expressions.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2019

An Empirical Study of Reward Structures for Actor-Critic Reinforcement Learning in Air Combat Manoeuvring Simulation.
Proceedings of the AI 2019: Advances in Artificial Intelligence, 2019

Memory-Based Explainable Reinforcement Learning.
Proceedings of the AI 2019: Advances in Artificial Intelligence, 2019

Integrating Biological Heuristics and Gene Expression Data for Gene Regulatory Network Inference.
Proceedings of the Australasian Computer Science Week Multiconference, 2019

2018
Non-functional regression: A new challenge for neural networks.
Neurocomputing, 2018

Human-aligned artificial intelligence is a multiobjective problem.
Ethics Inf. Technol., 2018

SoniFight: Software to Provide Additional Sonification Cues to Video Games for Visually Impaired Players.
Comput. Games J., 2018

Correction to: Griefers Versus the Griefed - What Motivates Them to Play Massively Multiplayer Online Role-Playing Games?
Comput. Games J., 2018

Rapid Anomaly Detection Using Integrated Prudence Analysis (IPA).
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2018

An Anomaly Intrusion Detection System Using C5 Decision Tree Classifier.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2018

Categorical Features Transformation with Compact One-Hot Encoder for Fraud Detection in Distributed Environment.
Proceedings of the Data Mining - 16th Australasian Conference, AusDM 2018, Bahrurst, NSW, 2018

2017
Steering approaches to Pareto-optimal multiobjective reinforcement learning.
Neurocomputing, 2017

Softmax exploration strategies for multiobjective reinforcement learning.
Neurocomputing, 2017

Special issue on multi-objective reinforcement learning.
Neurocomputing, 2017

A taxonomy of griefer type by motivation in massively multiplayer online role-playing games.
Behav. Inf. Technol., 2017

Evaluating Accuracy in Prudence Analysis for Cyber Security.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

An Agile Group Aware Process beyond CRISP-DM: A Hospital Data Mining Case Study.
Proceedings of the International Conference on Compute and Data Analysis, 2017

2016
A Heuristic Gene Regulatory Networks Model for Cardiac Function and Pathology.
Proceedings of the Computing in Cardiology, CinC 2016, Vancouver, 2016

2015
Patient admission prediction using a pruned fuzzy min-max neural network with rule extraction.
Neural Comput. Appl., 2015

Reinforcement Learning of Pareto-Optimal Multiobjective Policies Using Steering.
Proceedings of the AI 2015: Advances in Artificial Intelligence, 2015

2014
Griefers versus the Griefed - what motivates them to play Massively Multiplayer Online Role-Playing Games?
Comput. Games J., 2014

2013
A Survey of Multi-Objective Sequential Decision-Making.
J. Artif. Intell. Res., 2013

Ganking, corpse camping and ninja looting from the perception of the MMORPG community: acceptable behavior or unacceptable griefing?
Proceedings of the 9th Australasian Conference on Interactive Entertainment, 2013

2012
Using psycholinguistic features for profiling first language of authors.
J. Assoc. Inf. Sci. Technol., 2012

RM and RDM, a Preliminary Evaluation of Two Prudent RDR Techniques.
Proceedings of the Knowledge Management and Acquisition for Intelligent Systems, 2012

An Empirical Comparison of Two Common Multiobjective Reinforcement Learning Algorithms.
Proceedings of the AI 2012: Advances in Artificial Intelligence, 2012

2011
Empirical evaluation methods for multiobjective reinforcement learning algorithms.
Mach. Learn., 2011

Reinforcement Learning Approach to AIBO Robot's Decision Making Process in Robosoccer's Goal Keeper Problem.
Proceedings of the 12th ACIS International Conference on Software Engineering, 2011

2010
Automated opinion detection: Implications of the level of agreement between human raters.
Inf. Process. Manag., 2010

The Ballarat Incremental Knowledge Engine.
Proceedings of the Knowledge Management and Acquisition for Smart Systems and Services, 2010

2009
Incorporating Expert Advice into Reinforcement Learning Using Constructive Neural Networks.
Proceedings of the Constructive Neural Networks, 2009

Weblogs for market research: finding more relevant opinion documents using system fusion.
Online Inf. Rev., 2009

Applying Clustering and Ensemble Clustering Approaches to phishing Profiling.
Proceedings of the Eighth Australasian Data Mining Conference, AusDM 2009, Melbourne, 2009

Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks.
Proceedings of the AI 2009: Advances in Artificial Intelligence, 2009

Inference of Gene Expression Networks Using Memetic Gene Expression Programming.
Proceedings of the Computer Science 2009, 2009

2008
Unsupervised Segmentation of Industrial Images Using Markov Random Field Model.
Proceedings of the Technological Developments in Education and Automation, 2008

MRF Model Based Unsupervised Color Textured Image Segmentation Using Multidimensional Spatially Variant Finite Mixture Model.
Proceedings of the Technological Developments in Education and Automation, 2008

On the Limitations of Scalarisation for Multi-objective Reinforcement Learning of Pareto Fronts.
Proceedings of the AI 2008: Advances in Artificial Intelligence, 2008

Using Stereotypes to Improve Early-Match Poker Play.
Proceedings of the AI 2008: Advances in Artificial Intelligence, 2008

2007
Portal-based sound propagation for first-person computer games.
Proceedings of the 4th Australasian Conference on Interactive Entertainment, 2007

Unsupervised Color Textured Image Segmentation Using Cluster Ensembles and MRF Model.
Proceedings of the Advances in Computer and Information Sciences and Engineering, 2007

Using Corpus Analysis to Inform Research into Opinion Detection in Blogs.
Proceedings of the Data Mining and Analytics 2007, 2007

2006
More Effective Web Search Using Bigrams and Trigrams.
Webology, 2006

An efficient approach to unbounded bi-objective archives -: introducing the mak_tree algorithm.
Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Enhanced Temporal Difference Learning Using Compiled Eligibility Traces.
Proceedings of the AI 2006: Advances in Artificial Intelligence, 2006

2005
Concurrent Q-learning: Reinforcement learning for dynamic goals and environments.
Int. J. Intell. Syst., 2005

On-Line Reinforcement Learning Using Cascade Constructive Neural Networks.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

The Combative Accretion Model - Multiobjective Optimisation Without Explicit Pareto Ranking.
Proceedings of the Evolutionary Multi-Criterion Optimization, 2005

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning.
Proceedings of the AI 2005: Advances in Artificial Intelligence, 2005

Accelerating Real-Valued Genetic Algorithms Using Mutation-with-Momentum.
Proceedings of the AI 2005: Advances in Artificial Intelligence, 2005

An Anti-Plagiarism Editor for Software Development Courses.
Proceedings of the Seventh Australasian Computing Education Conference (ACE 2005), 2005

2003
A simplified artificial life model for multiobjective optimisation: a preliminary report.
Proceedings of the IEEE Congress on Evolutionary Computation, 2003

1995
Recognition and anticipation of hand motions using a recurrent neural network.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995


  Loading...