Kenji Doya

According to our database1, Kenji Doya authored at least 134 papers between 1989 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Maintaining the Publication Infrastructure in a Worldwide Pandemic.
Neural Networks, 2021

2020
Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks.
Neural Networks, 2020

Announcement of the Neural Networks Best Paper Award.
Neural Networks, 2020

Diffusion functional MRI reveals global brain network functional abnormalities driven by targeted local activity in a neuropsychiatric disease mouse model.
NeuroImage, 2020

Imitation learning based on entropy-regularized forward and inverse reinforcement learning.
CoRR, 2020

Variational Recurrent Models for Solving Partially Observable Control Tasks.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
MarmoNet: a pipeline for automated projection mapping of the common marmoset brain from whole-brain serial two-photon tomography.
CoRR, 2019

Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning.
CoRR, 2019

Emergence of Hierarchy via Reinforcement Learning Using a Multiple Timescale Stochastic RNN.
CoRR, 2019

Model-based Empowerment Computation for Dynamical Agents.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

An Experimental Study of Emergence of Communication of Reinforcement Learning Agents.
Proceedings of the Artificial General Intelligence - 12th International Conference, 2019

2018
Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.
Neural Networks, 2018

Fostering deep learning and beyond.
Neural Networks, 2018

Connectivity inference from neural recording data: Challenges, mathematical bases and research directions.
Neural Networks, 2018

Unbounded Output Networks for Classification.
CoRR, 2018

Robustness of linearly solvable Markov games employing inaccurate dynamics model.
Artif. Life Robotics, 2018

PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos.
Proceedings of the 35th International Conference on Machine Learning, 2018

Online meta-learning by parallel algorithm competition.
Proceedings of the Genetic and Evolutionary Computation Conference, 2018

2017
Promoting Further Developments of Neural Networks.
Neural Networks, 2017

Adaptive Baseline Enhances EM-Based Policy Search: Validation in a View-Based Positioning Task of a Smartphone Balancer.
Frontiers Neurorobotics, 2017

Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming.
CoRR, 2017

Sparse kernel canonical correlation analysis for discovery of nonlinear interactions in high-dimensional data.
BMC Bioinform., 2017

Average Reward Optimization with Multiple Discounting Reinforcement Learners.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

2016
From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning.
Neural Networks, 2016

State of Neural Networks Is Strong.
Neural Networks, 2016

EM-based policy hyper parameter exploration: application to standing and balancing of a two-wheeled smartphone robot.
Artif. Life Robotics, 2016

Emergence of communication among reinforcement learning agents under coordination environment.
Proceedings of the 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, 2016

2015
Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum.
PLoS Comput. Biol., 2015

Expected energy-based restricted Boltzmann machine for classification.
Neural Networks, 2015

Computational Complexity Reduction for Functional Connectivity Estimation in Large Scale Neural Network.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Resting state functional connectivity explains individual scores of multiple clinical measures for major depression.
Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

2014
Combining learned controllers to achieve new goals based on linearly solvable MDPs.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Inter Subject Correlation of Brain Activity during Visuo-Motor Sequence Learning.
Proceedings of the Neural Information Processing - 21st International Conference, 2014

Inverse reinforcement learning using Dynamic Policy Programming.
Proceedings of the 4th International Conference on Development and Learning and on Epigenetic Robotics, 2014

2013
Evaluation of linearly solvable Markov decision process with dynamic model learning in a mobile robot navigation task.
Frontiers Neurorobotics, 2013

Scaled free-energy based reinforcement learning for robust and efficient learning in high-dimensional state spaces.
Frontiers Neurorobotics, 2013

A model-based prediction of the calcium responses in the striatal synaptic spines depending on the timing of cortical and dopaminergic inputs and post-synaptic spikes.
Frontiers Comput. Neurosci., 2013

Reinforcement learning with state-dependent discount factor.
Proceedings of the 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics, 2013

2012
Loss of a Co-Editor-in-Chief and friend.
Neural Networks, 2012

Expedited review process.
Neural Networks, 2012

Changing the structure of complex visuo-motor sequences selectively activates the fronto-parietal network.
NeuroImage, 2012

MOSAIC for Multiple-Reward Environments.
Neural Comput., 2012

Neural Computations Supporting Cognition: Rumelhart Prize Symposium in Honor of Peter Dayan.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

2011
Multi-scale, multi-modal neural modeling and simulation.
Neural Networks, 2011

An excellent year and a transition.
Neural Networks, 2011

Neurocomputational models of brain disorders.
Neural Networks, 2011

Darwinian embodied evolution of the learning ability for survival.
Adapt. Behav., 2011

2010
A Kinetic Model of Dopamine- and Calcium-Dependent Striatal Synaptic Plasticity.
PLoS Comput. Biol., 2010

A computational neural model of goal-directed utterance selection.
Neural Networks, 2010

Editorial for 2010.
Neural Networks, 2010

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
Neural Comput., 2010

Adaptive Selection of Learning Strategy for Autonomous Sequence Learning in Rats.
Aust. J. Intell. Inf. Process. Syst., 2010

Toward a Spiking-Neuron Model of the Oculomotor System.
Proceedings of the From Animals to Animats 11, 2010

Free-Energy Based Reinforcement Learning for Vision-Based Navigation with High-Dimensional Sensory Inputs.
Proceedings of the Neural Information Processing. Theory and Algorithms, 2010

Free-energy-based reinforcement learning in a partially observable environment.
Proceedings of the ESANN 2010, 2010

2009
Co-evolution of Rewards and Meta-parameters in Embodied Evolution.
Proceedings of the Creating Brain-Like Intelligence: From Basic Principles to Complex Intelligent Systems, 2009

New action editors join the journal! Five exciting special issues in the works!
Neural Networks, 2009

A hierarchical Bayesian method to resolve an inverse problem of MEG contaminated with eye movement artifacts.
NeuroImage, 2009

A Generalized Natural Actor-Critic Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Emergence of Different Mating Strategies in Artificial Embodied Evolution.
Proceedings of the Neural Information Processing, 16th International Conference, 2009

Calcium Responses Model in Striatum Dependent on Timed Input Sources.
Proceedings of the Artificial Neural Networks, 2009

2008
Finding intrinsic rewards by embodied evolution and constrained reinforcement learning.
Neural Networks, 2008

Mini-special issue: ICONIP 2007.
Neural Networks, 2008

Neural Networks goes electronic at twenty!
Neural Networks, 2008

Combining Modalities with Different Latencies for Optimal Motor Control.
J. Cogn. Neurosci., 2008

Learning how, what, and whether to communicate: emergence of protocommunication in reinforcement learning agents.
Artif. Life Robotics, 2008

Natural actor-critic with baseline adjustment for variance reduction.
Artif. Life Robotics, 2008

Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning.
Adapt. Behav., 2008

A New Natural Policy Gradient by Stationary Distribution Metric.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

NeuroEvolution Based on Reusable and Hierarchical Modular Representation.
Proceedings of the Advances in Neuro-Information Processing, 15th International Conference, 2008

Robust Population Coding in Free-Energy-Based Reinforcement Learning.
Proceedings of the Artificial Neural Networks, 2008

2007
Evolutionary Development of Hierarchical Learning Structures.
IEEE Trans. Evol. Comput., 2007

Learning a dynamic policy by using policy gradient: application to biped walking.
Systems and Computers in Japan, 2007

Nitric Oxide Regulates Input Specificity of Long-Term Depression and Context Dependence of Cerebellar Learning.
PLoS Comput. Biol., 2007

Erratum to "Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics" [Neural Networks 19 (8) (2006) 1233-1241].
Neural Networks, 2007

Multiple model-based reinforcement learning explains dopamine neuronal activity.
Neural Networks, 2007

Reinforcement Learning State Estimator.
Neural Comput., 2007

Bayesian System Identification of Molecular Cascades.
Proceedings of the Neural Information Processing, 14th International Conference, 2007

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents.
Proceedings of the Neural Information Processing, 14th International Conference, 2007

Estimating Internal Variables of a Decision Maker's Brain: A Model-Based Approach for Neuroscience.
Proceedings of the Neural Information Processing, 14th International Conference, 2007

Designing the Reward System: Computational and Biological Principles.
Proceedings of the IEEE Symposium on Foundations of Computational Intelligence, 2007

2006
Learning CPG-based biped locomotion with a policy gradient method.
Robotics Auton. Syst., 2006

Switching particle filters for efficient visual tracking.
Robotics Auton. Syst., 2006

Humans Can Adopt Optimal Discounting Strategy under Real-Time Constraints.
PLoS Comput. Biol., 2006

Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics.
Neural Networks, 2006

fMRI investigation of cortical and subcortical networks in the learning of abstract and effector-specific representations of motor sequences.
NeuroImage, 2006

Application of evolutionary computation for efficient reinforcement learning.
Appl. Artif. Intell., 2006

Hierarchical Chunking during Learning of Visuomotor Sequences.
Proceedings of the International Joint Conference on Neural Networks, 2006

2005
Evolution of recurrent neural controllers using an extended parallel genetic algorithm.
Robotics Auton. Syst., 2005

Robust Reinforcement Learning.
Neural Comput., 2005

The Cyber Rodent Project: Exploration of Adaptive Mechanisms for Self-Preservation and Self-Reproduction.
Adapt. Behav., 2005

Evolution of Neural Architecture Fitting Environmental Dynamics.
Adapt. Behav., 2005

Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Biologically inspired embodied evolution of survival.
Proceedings of the IEEE Congress on Evolutionary Computation, 2005

2004
Reinforcement learning with via-point representation.
Neural Networks, 2004

Hierarchical Bayesian estimation for MEG inverse problem.
NeuroImage, 2004

Responding to Modalities with Different Latencies.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Multi-agent reinforcement learning: using macro actions to learn a mating task.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Switching Particle Filters for Efficient Real-time Visual Tracking.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Chunking Phenomenon in Complex Sequential Skill Learning in Humans.
Proceedings of the Neural Information Processing, 11th International Conference, 2004

2003
Meta-learning in Reinforcement Learning.
Neural Networks, 2003

Inter-module credit assignment in modular reinforcement learning.
Neural Networks, 2003

Different Cortico-Basal Ganglia Loops Specialize in Reward Prediction at Different Time Scales.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Estimating Internal Variables and Paramters of a Learning Agent by a Particle Filter.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Evolution of meta-parameters in reinforcement learning algorithm.
Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

An Evolutionary Approach to Automatic Construction of the Structure in Hierarchical Reinforcement Learning.
Proceedings of the Genetic and Evolutionary Computation, 2003

Evolving recurrent neural controllers for sequential tasks: a parallel implementation.
Proceedings of the IEEE Congress on Evolutionary Computation, CEC 2003, 8, 2003

2002
Introduction for 2002 Special Issue: Computational Models of Neuromodulation.
Neural Networks, 2002

Metalearning and neuromodulation.
Neural Networks, 2002

Multiple Model-Based Reinforcement Learning.
Neural Comput., 2002

2001
Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning.
Robotics Auton. Syst., 2001

Statistical characteristics of climbing fiber spikes necessary for efficient cerebellar learning.
Biol. Cybern., 2001

Multiple Forward Model Architecture for Sequence Processing.
Proceedings of the Sequence Learning - Paradigms, Algorithms, and Applications, 2001

2000
Reinforcement Learning in Continuous Time and Space.
Neural Comput., 2000

Multi-Agent Reinforcement Learning: An Approach Based on the Other Agent's Internal Model.
Proceedings of the 4th International Conference on Multi-Agent Systems, 2000

1999
What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
Neural Networks, 1999

1998
Near Saddle-Node Bifurcation Behavior as Dynamics in Working Memory for Goal-Directed Behavior.
Neural Comput., 1998

Hierarchical reinforcement learning for motion learning: learning 'stand-up' trajectories.
Adv. Robotics, 1998

Reinforcement learning of dynamic motor sequence: learning to stand up.
Proceedings of the Proceedings 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, 1998

A Model of the Electrophysiological Properties of the Inferior Olive Neurons.
Proceedings of the Fifth International Conference on Neural Information Processing, 1998

Hierarchical Reinforcement Learning of Low-Dimensional Subgoals and High-Dimensional Trajectories.
Proceedings of the Fifth International Conference on Neural Information Processing, 1998

A Sequence Learning Architecture Based on Cortico-Basal Ganglionic Loops and Reinforcement Learning.
Proceedings of the Fifth International Conference on Neural Information Processing, 1998

1996
Efficient Nonlinear Control with Actor-Tutor Architecture.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

1995
Dynamics of Attention as Near Saddle-Node Bifurcation Behavior.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995

Temporal Difference Learning in Continuous Time and Space.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995

1994
Dimension Reduction of Biological Neuron Models by Artificial Neural Networks.
Neural Comput., 1994

A Novel Reinforcement Model of Birdsong Vocalization Learning.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

1993
A Hodgkin-Huxley Type Neuron Model That Learns Slow Non-Spike Oscillations.
Proceedings of the Advances in Neural Information Processing Systems 6, 1993

1992
Maaping Between Neural and Physical Activities of the Lobster Gastric Mill.
Proceedings of the Advances in Neural Information Processing Systems 5, [NIPS Conference, Denver, Colorado, USA, November 30, 1992

1991
Neural network model of temporal pattern memory.
Systems and Computers in Japan, 1991

Adaptive Synchronization of Neural and Physical Oscillators.
Proceedings of the Advances in Neural Information Processing Systems 4, 1991

1990
Memorizing hierarchical temporal patterns in analog neuron networks.
Proceedings of the IJCNN 1990, 1990

1989
Adaptive neural oscillator using continuous-time back-propagation learning.
Neural Networks, 1989


  Loading...