Kazuteru Miyazaki

Orcid: 0000-0001-8175-213X

According to our database1, Kazuteru Miyazaki authored at least 39 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Proposal of a Course-Classification Support System Using Deep Learning and its Evaluation When Combined with Reinforcement Learning.
J. Adv. Comput. Intell. Intell. Informatics, March, 2024

Editorial: Cutting Edge of Reinforcement Learning and its Hybrid Methods.
J. Adv. Comput. Intell. Intell. Informatics, March, 2024

2022
Traffic Signal Control System Using Deep Reinforcement Learning With Emphasis on Reinforcing Successful Experiences.
IEEE Access, 2022

Modeling of placebo effect in stochastic reward tasks by reinforcement learning.
Proceedings of the 2022 Annual International Conference on Brain-Inspired Cognitive Architectures for Artificial Intelligence, 2022

2021
Proposal and evaluation of deep exploitation-oriented learning under multiple reward environment.
Cogn. Syst. Res., 2021

Home Energy Management Algorithm Based on Deep Reinforcement Learning Using Multistep Prediction.
IEEE Access, 2021

2020
Classification of Medical Data using Character-level CNN.
Proceedings of the ICISS 2020: The 3rd International Conference on Information Science and System, 2020

Application of Deep Reinforcement Learning to Decision-Making System based on Consciousness.
Proceedings of the 2020 Annual International Conference on Brain-Inspired Cognitive Architectures for Artificial Intelligence, 2020

2019
Deep Reinforcement Learning with Dual Targeting Algorithm.
Proceedings of the International Joint Conference on Neural Networks, 2019

2018
Consistency Assessment between Diploma Policy and Curriculum Policy using Character-Level CNN.
Proceedings of the 2018 Joint 10th International Conference on Soft Computing and Intelligent Systems (SCIS) and 19th International Symposium on Advanced Intelligent Systems (ISIS), 2018

Proposal of Detour Path Suppression Method in PS Reinforcement Learning and Its Application to Altruistic Multi-agent Environment.
Proceedings of the PRIMA 2018: Principles and Practice of Multi-Agent Systems - 21st International Conference, Tokyo, Japan, October 29, 2018

Proposal and Evaluation of an Indirect Reward Assignment Method for Reinforcement Learning by Profit Sharing Method.
Proceedings of the Intelligent Systems and Applications, 2018

A Proposal for Reducing the Number of Trial-and-Error Searches for Deep Q-Networks Combined with Exploitation-Oriented Learning.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

2017
Proposal of PSwithEFP and its Evaluation in Multi-Agent Reinforcement Learning.
J. Adv. Comput. Intell. Intell. Informatics, 2017

Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network -.
J. Adv. Comput. Intell. Intell. Informatics, 2017

Proposal of a Deep Q-network with Profit Sharing.
Proceedings of the 8th Annual International Conference on Biologically Inspired Cognitive Architectures, 2017

2016
Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multi-agent Learning.
Proceedings of the IEEE International Conference on Agents, 2016

Proposal of an Action Selection Strategy with Expected Failure Probability and Its Evaluation in Multi-agent Reinforcement Learning.
Proceedings of the Multi-Agent Systems and Agreement Technologies, 2016

A Study of an Indirect Reward on Multi-agent Environments.
Proceedings of the 7th Annual International Conference on Biologically Inspired Cognitive Architectures, 2016

2014
The Necessity of a Secondary System in Machine Consciousness.
Proceedings of the 5th Annual International Conference on Biologically Inspired Cognitive Architectures, 2014

2013
Proposal of an Exploitation-oriented Learning Method on Multiple Rewards and Penalties Environments and the Design Guideline.
J. Comput., 2013

2012
Proposal of the Continuous-Valued Penalty Avoiding Rational Policy Making Algorithm.
J. Adv. Comput. Intell. Intell. Informatics, 2012

Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation.
J. Adv. Comput. Intell. Intell. Informatics, 2012

Proposal of an Active Course Classification Support system with Exploitation-oriented Learning extended by positive and negative examples.
Proceedings of the 6th International Conference on Soft Computing and Intelligent Systems (SCIS), 2012

Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment.
Proceedings of the Intelligent Information and Database Systems - 4th Asian Conference, 2012

2011
Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

2010
The Penalty Avoiding Rational Policy Making Algorithm in Continuous Action Spaces.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2010

2009
A New Improved Penalty Avoiding Rational Policy Making Algorithm for Keepaway with Continuous State Spaces.
J. Adv. Comput. Intell. Intell. Informatics, 2009

Exploitation-Oriented Learning PS-r#.
J. Adv. Comput. Intell. Intell. Informatics, 2009

2008
Proposal of Exploitation-Oriented Learning PS-r#.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2008

2007
Reinforcement Learning for Penalty Avoidance in Continuous State Spaces.
J. Adv. Comput. Intell. Intell. Informatics, 2007

2006
Multi User Learning Agent on the Distribution of MDPs.
Proceedings of the 15th IEEE International Symposium on Robot and Human Interactive Communication, 2006

2004
Development of a reinforcement learning system to play Othello.
Artif. Life Robotics, 2004

2001
Rationality of Reward Sharing in Multi-agent Reinforcement Learning.
New Gener. Comput., 2001

2000
Reinforcement learning for penalty avoiding policy making.
Proceedings of the IEEE International Conference on Systems, 2000

1999
Multi-agent Reinforcement Learning for Crane Control Problem: Designing Rewards for Conflict Resolution.
Proceedings of the Fourth International Symposium on Autonomous Decentralized Systems, 1999

1997
k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning.
Artif. Intell., 1997

Reinforcement Learning in POMDPs with Function Approximation.
Proceedings of the Fourteenth International Conference on Machine Learning (ICML 1997), 1997


  Loading...