We stand with Ukraine

We stand with Ukraine

Amir-massoud Farahmand

Affiliations:

Vector Institute, Toronto, ON, Canada
University of Toronto, ON, Canada
Mitsubishi Electric Research Laboratories (MERL) (former)

According to our database¹, Amir-massoud Farahmand authored at least 73 papers between 2004 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2026

Press Start to Charge: Videogaming the Online Centralized Charging Scheduling Problem.

[DOI]

Alireza Ghahtarani

,

Martin Cousineau

,

Amir-massoud Farahmand

,

Jorge E. Mendoza

CoRR, January, 2026

2025

Majority of the Bests: Improving Best-of-N via Bootstrapping.

[DOI]

,

,

,

Amir-massoud Farahmand

,

Amir Khasahmadi

CoRR, November, 2025

Relative Entropy Pathwise Policy Optimization.

[DOI]

,

Axel Brunnbauer

,

,

,

,

,

,

Amir-massoud Farahmand

,

Igor Gilitschenski

CoRR, July, 2025

Calibrated Value-Aware Model Learning with Stochastic Environment Models.

[DOI]

,

Anastasiia Pedan

,

,

,

Igor Gilitschenski

,

Amir-massoud Farahmand

CoRR, May, 2025

Deflated Dynamics Value Iteration.

[DOI]

,

,

,

Amir-massoud Farahmand

Trans. Mach. Learn. Res., 2025

Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients.

[DOI]

,

Allan Douglas Jepson

,

Amir-massoud Farahmand

Trans. Mach. Learn. Res., 2025

Calibrated Value-Aware Model Learning with Probabilistic Environment Models.

[DOI]

,

Anastasiia Pedan

,

,

,

Igor Gilitschenski

,

Amir-massoud Farahmand

Proceedings of the Forty-second International Conference on Machine Learning, 2025

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling.

[DOI]

,

,

Amir-massoud Farahmand

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Categorical Distributional Reinforcement Learning with Kullback-Leibler Divergence: Convergence and Asymptotics.

[DOI]

,

,

,

Murat A. Erdogdu

,

Amir-massoud Farahmand

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL.

[DOI]

,

,

,

Amir-massoud Farahmand

,

Igor Gilitschenski

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Truncated Newton Method for Optimal Transport.

[DOI]

,

Amir-massoud Farahmand

,

Allan Douglas Jepson

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence.

[DOI]

,

,

Igor Gilitschenski

,

Amir-massoud Farahmand

,

CoRR, 2024

When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning.

[DOI]

,

,

Igor Gilitschenski

,

Amir-massoud Farahmand

RLJ, 2024

Dissecting Deep RL with High Update Ratios: Combatting Value Divergence.

[DOI]

,

,

Igor Gilitschenski

,

Amir-massoud Farahmand

,

RLJ, 2024

PID Accelerated Temporal Difference Algorithms.

[DOI]

,

,

Amir-massoud Farahmand

RLJ, 2024

Maximum Entropy Model Correction in Reinforcement Learning.

[DOI]

,

,

Mohammad Ghavamzadeh

,

Amir-massoud Farahmand

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Improving Adversarial Transferability via Model Alignment.

[DOI]

,

Amir-massoud Farahmand

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods.

[DOI]

,

,

Amir-massoud Farahmand

Trans. Mach. Learn. Res., 2023

λ-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces.

[DOI]

,

,

,

Igor Gilitschenski

,

Amir-massoud Farahmand

CoRR, 2023

Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning.

[DOI]

,

Murat A. Erdogdu

,

Amir-massoud Farahmand

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

Understanding and mitigating the limitations of prioritized experience replay.

[DOI]

,

,

Amir-massoud Farahmand

,

,

,

,

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Operator Splitting Value Iteration.

[DOI]

,

,

Mohammad Ghavamzadeh

,

Amir-massoud Farahmand

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Value Gradient weighted Model-Based Reinforcement Learning.

[DOI]

,

,

,

Amir-massoud Farahmand

Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Object-Oriented Dynamics for Planning from Text.

[DOI]

,

Ashutosh Adhikari

,

Amir-massoud Farahmand

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations.

[DOI]

,

Faraz Khoshbakhtian

,

Farnam Mansouri

,

Amir-massoud Farahmand

CoRR, 2021

PID Accelerated Value Iteration Algorithm.

[DOI]

Amir Massoud Farahmand

,

Mohammad Ghavamzadeh

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

The act of remembering: a study in partially observable reinforcement learning.

[DOI]

Rodrigo Toro Icarte

,

Richard Anthony Valenzano

,

Toryn Q. Klassen

,

Phillip J. K. Christoffersen

,

Amir-massoud Farahmand

,

Sheila A. McIlraith

CoRR, 2020

Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities.

[DOI]

,

,

,

Amir-massoud Farahmand

,

CoRR, 2020

Adversarial Robustness through Regularization: A Second-Order Approach.

[DOI]

,

,

Amir-massoud Farahmand

CoRR, 2020

Policy-Aware Model Learning for Policy Gradient Methods.

[DOI]

,

Mohammad Ghavamzadeh

,

Amir-massoud Farahmand

CoRR, 2020

An implicit function learning approach for parametric modal regression.

[DOI]

,

,

Amir-massoud Farahmand

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Frequency-based Search-control in Dyna.

[DOI]

,

,

Amir-massoud Farahmand

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Value Function in Frequency Domain and the Characteristic Value Iteration Algorithm.

[DOI]

Amir-massoud Farahmand

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Improving Skin Condition Classification with a Visual Symptom Checker Trained Using Reinforcement Learning.

[DOI]

,

Amir-massoud Farahmand

,

,

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Hill Climbing on Value Estimates for Search-control in Dyna.

[DOI]

,

,

Amir-massoud Farahmand

,

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dimensionality Reduction for Representing the Knowledge of Probabilistic Models.

[DOI]

,

,

Amir-massoud Farahmand

,

,

Richard S. Zemel

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Improving Skin Condition Classification with a Question Answering Model.

[DOI]

,

Amir-massoud Farahmand

,

CoRR, 2018

Iterative Value-Aware Model Learning.

[DOI]

Amir-massoud Farahmand

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control.

[DOI]

,

Amir-massoud Farahmand

,

,

,

,

Daniel Nikovski

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

Attentional Network for Visual Object Detection.

[DOI]

,

,

,

Amir-massoud Farahmand

CoRR, 2017

Learning to regulate rolling ball motion.

[DOI]

,

Daniel Nikovski

,

William Yerazunis

,

Amir-massoud Farahmand

Proceedings of the 2017 IEEE Symposium Series on Computational Intelligence, 2017

Random Projection Filter Bank for Time Series Data.

[DOI]

Amir-massoud Farahmand

,

Sepideh Pourazarm

,

Daniel Nikovski

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deep reinforcement learning for partial differential equation control.

[DOI]

Amir-massoud Farahmand

,

,

Daniel Nikolaev Nikovski

Proceedings of the 2017 American Control Conference, 2017

Value-Aware Loss Function for Model-based Reinforcement Learning.

[DOI]

Amir Massoud Farahmand

,

,

Daniel Nikovski

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016

Regularized Policy Iteration with Nonparametric Function Spaces.

[DOI]

Amir-massoud Farahmand

,

Mohammad Ghavamzadeh

,

Csaba Szepesvári

,

J. Mach. Learn. Res., 2016

Learning to control partial differential equations: Regularized Fitted Q-Iteration approach.

[DOI]

Amir-massoud Farahmand

,

,

,

Daniel Nikovski

Proceedings of the 55th IEEE Conference on Decision and Control, 2016

Learning-based modular indirect adaptive control for a class of nonlinear systems.

[DOI]

Mouhacine Benosman

,

Amir-massoud Farahmand

,

Proceedings of the 2016 American Control Conference, 2016

Truncated Approximate Dynamic Programming with Task-Dependent Terminal Value.

[DOI]

Amir-massoud Farahmand

,

Daniel Nikolaev Nikovski

,

,

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Classification-Based Approximate Policy Iteration.

[DOI]

Amir-massoud Farahmand

,

,

André da Motta Salles Barreto

,

Mohammad Ghavamzadeh

IEEE Trans. Autom. Control., 2015

Reports of the AAAI 2014 Conference Workshops.

[DOI]

AI Mag., 2015

Approximate MaxEnt Inverse Optimal Control and Its Application for Mental Simulation of Human Interactions.

[DOI]

,

Amir-massoud Farahmand

,

,

James Andrew Bagnell

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Classification-based Approximate Policy Iteration: Experiments and Extended Discussions.

[DOI]

Amir-massoud Farahmand

,

,

André da Motta Salles Barreto

,

Mohammad Ghavamzadeh

CoRR, 2014

Sample-based approximate regularization.

[DOI]

,

Amir-massoud Farahmand

,

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

Learning from Limited Demonstrations.

[DOI]

,

Amir-massoud Farahmand

,

,

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Bellman Error Based Feature Generation using Random Projections on Sparse Spaces.

[DOI]

Mahdi Milani Fard

,

,

Amir-massoud Farahmand

,

,

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012

Value Pursuit Iteration.

[DOI]

Amir Massoud Farahmand

,

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011

Model selection in reinforcement learning.

[DOI]

Amir Massoud Farahmand

,

Csaba Szepesvári

Mach. Learn., 2011

Action-Gap Phenomenon in Reinforcement Learning.

[DOI]

Amir Massoud Farahmand

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

2010

Interaction of Culture-Based Learning and Cooperative Co-Evolution and its Application to Automatic Behavior-Based System Design.

[DOI]

Amir Massoud Farahmand

,

Majid Nili Ahmadabadi

,

,

Babak Nadjar Araabi

IEEE Trans. Evol. Comput., 2010

Error Propagation for Approximate Policy and Value Iteration.

[DOI]

Amir Massoud Farahmand

,

,

Csaba Szepesvári

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Robust Jacobian estimation for uncalibrated visual servoing.

[DOI]

,

Amir Massoud Farahmand

,

Martin Jägersand

Proceedings of the IEEE International Conference on Robotics and Automation, 2010

2009

Model-based and model-free reinforcement learning for visual servoing.

[DOI]

Amir Massoud Farahmand

,

,

Martin Jägersand

,

Csaba Szepesvári

Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

Towards Learning Robotic Reaching and Pointing: An Uncalibrated Visual Servoing Approach.

[DOI]

,

Amir Massoud Farahmand

,

Martin Jägersand

Proceedings of the Sixth Canadian Conference on Computer and Robot Vision, 2009

Regularized Fitted Q-Iteration for planning in continuous-space Markovian decision problems.

[DOI]

Amir Massoud Farahmand

,

Mohammad Ghavamzadeh

,

Csaba Szepesvári

,

Proceedings of the American Control Conference, 2009

2008

Regularized Policy Iteration.

[DOI]

Amir Massoud Farahmand

,

Mohammad Ghavamzadeh

,

Csaba Szepesvári

,

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Regularized Fitted Q-Iteration: Application to Planning.

[DOI]

Amir Massoud Farahmand

,

Mohammad Ghavamzadeh

,

Csaba Szepesvári

,

Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

2007

Global visual-motor estimation for uncalibrated visual servoing.

[DOI]

Amir Massoud Farahmand

,

,

Martin Jägersand

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Manifold-adaptive dimension estimation.

[DOI]

Amir Massoud Farahmand

,

Csaba Szepesvári

,

Jean-Yves Audibert

Proceedings of the Machine Learning, 2007

2006

Channel Assignment using Chaotic Simulated Annealing Enhanced Hopfield Neural Network.

[DOI]

Amir Massoud Farahmand

,

Mohammad Javad Yazdanpanah

Proceedings of the International Joint Conference on Neural Networks, 2006

Learning to Coordinate Behaviors in Soft Behavior-Based Systems Using Reinforcement Learning.

[DOI]

Mohammad G. Azar

,

Majid Nili Ahmadabadi

,

Amir Massoud Farahmand

,

Babak Nadjar Araabi

Proceedings of the International Joint Conference on Neural Networks, 2006

Hybrid Behavior Co-evolution and Structure Learning in Behavior-based Systems.

[DOI]

Amir Massoud Farahmand

,

Majid Nili Ahmadabadi

,

,

Babak Nadjar Araabi

Proceedings of the IEEE International Conference on Evolutionary Computation, 2006

2005

Locally Optimal Takagi-Sugeno Fuzzy Controllers.

[DOI]

Amir Massoud Farahmand

,

Mohammad Javad Yazdanpanah

Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

2004

Behavior hierarchy learning in a behavior-based system using reinforcement learning.

[DOI]

Amir Massoud Farahmand

,

Majid Nili Ahmadabadi

,

Babak Nadjar Araabi

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Loading...