J. Andrew Bagnell

Affiliations:
  • Carnegie Mellon University, Pittsburgh, USA


According to our database1, J. Andrew Bagnell authored at least 156 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Hybrid Inverse Reinforcement Learning.
CoRR, 2024

2023
Inverse Reinforcement Learning without Reinforcement Learning.
CoRR, 2023

The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms.
Proceedings of the International Conference on Machine Learning, 2023

Inverse Reinforcement Learning without Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Hybrid RL: Using both offline and online data can make RL efficient.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Game-Theoretic Algorithms for Conditional Moment Matching.
CoRR, 2022

Minimax Optimal Online Imitation Learning via Replay Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sequence Model Imitation Learning with Unobserved Contexts.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On the Effectiveness of Iterative Learning Control.
Proceedings of the Learning for Dynamics and Control Conference, 2022

Causal Imitation Learning under Temporally Correlated Noise.
Proceedings of the International Conference on Machine Learning, 2022

2021
A Critique of Strictly Batch Imitation Learning.
CoRR, 2021

Of Moments and Matching: Trade-offs and Treatments in Imitation Learning.
CoRR, 2021

Feedback in Imitation Learning: The Three Regimes of Covariate Shift.
CoRR, 2021

Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap.
Proceedings of the 38th International Conference on Machine Learning, 2021

CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Exploration in Action Space.
CoRR, 2020

Planning and Execution using Inaccurate Models with Provable Guarantees.
Proceedings of the Robotics: Science and Systems XVI, 2020

Tron: A Fast Solver for Trajectory Optimization with Non-Smooth Cost Functions.
Proceedings of the 59th IEEE Conference on Decision and Control, 2020

2019
Provably Efficient Imitation Learning from Observation Alone.
Proceedings of the 36th International Conference on Machine Learning, 2019

Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Learning Anytime Predictions in Neural Networks via Adaptive Loss Balancing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
A convex polynomial model for planar sliding mechanics: theory, application, and experimental validation.
Int. J. Robotics Res., 2018

Shared autonomy via hindsight optimization for teleoperation and teaming.
Int. J. Robotics Res., 2018

An Algorithmic Perspective on Imitation Learning.
Found. Trends Robotics, 2018

Dual Policy Iteration.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Truncated horizon Policy Search: Combining Reinforcement Learning & Imitation Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Policy Gradient Methods.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

A Probabilistic Planning Framework for Planar Grasping Under Uncertainty.
IEEE Robotics Autom. Lett., 2017

Log-DenseNet: How to Sparsify a DenseNet.
CoRR, 2017

Ignoring Distractors in the Absence of Labels: Optimal Linear Projection to Remove False Positives During Anomaly Detection.
CoRR, 2017

Anytime Neural Networks via Joint Optimization of Auxiliary Losses.
CoRR, 2017

Autonomy infused teleoperation with application to brain computer interface controlled manipulation.
Auton. Robots, 2017

A Fast Stochastic Contact Model for Planar Pushing and Grasping: Theory and Experimental Validation.
Proceedings of the Robotics: Science and Systems XIII, 2017

Predictive-State Decoders: Encoding the Future into Recurrent Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction.
Proceedings of the 34th International Conference on Machine Learning, 2017

Gradient Boosting on Stochastic Data Streams.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Robot Learning.
Proceedings of the Springer Handbook of Robotics, 2016

Robust Monocular Flight in Cluttered Outdoor Environments.
CoRR, 2016

Learning to Smooth with Bidirectional Predictive State Inference Machines.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Efficient Feature Group Sequencing for Anytime Linear Prediction.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Improved Learning of Dynamics Models for Control.
Proceedings of the International Symposium on Experimental Robotics, 2016

Learning Transferable Policies for Monocular Reactive MAV Control.
Proceedings of the International Symposium on Experimental Robotics, 2016

Introspective perception: Learning to predict failures in vision systems.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Inference Machines for Nonparametric Filter Learning.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Online Bellman Residual and Temporal Difference Algorithms with Predictive Error Guarantees.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A convex polynomial force-motion model for planar sliding: Identification and application.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Learning to Filter with Predictive State Inference Machines.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Minimizing User Cost for Shared Autonomy.
Proceedings of the Eleventh ACM/IEEE International Conference on Human Robot Interation, 2016

A Discriminative Framework for Anomaly Detection in Large Videos.
Proceedings of the Computer Vision - ECCV 2016, 2016

Online Instrumental Variable Regression with Applications to Online Linear System Identification.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
CHIMP, the CMU Highly Intelligent Mobile Platform.
J. Field Robotics, 2015

Online Bellman Residual Algorithms with Predictive Error Guarantees.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Autonomy Infused Teleoperation with Application to BCI Manipulation.
Proceedings of the Robotics: Science and Systems XI, Sapienza University of Rome, 2015

Shared Autonomy via Hindsight Optimization.
Proceedings of the Robotics: Science and Systems XI, Sapienza University of Rome, 2015

Theoretical Limits of Speed and Resolution for Kinodynamic Planning in a Poisson Forest.
Proceedings of the Robotics: Science and Systems XI, Sapienza University of Rome, 2015

Visual chunking: A list prediction framework for region-based object detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Movement primitives via optimization.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Predicting Multiple Structured Visual Interpretations.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Vision and Learning for Deliberative Monocular Cluttered Flight.
Proceedings of the Field and Service Robotics, 2015

Solving Games with Functional Regret Estimation.
Proceedings of the Computer Poker and Imperfect Information, 2015

A Unified View of Large-Scale Zero-Sum Equilibrium Computation.
Proceedings of the Computer Poker and Imperfect Information, 2015

Improving Multi-Step Prediction of Learned Time Series Models.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Approximate MaxEnt Inverse Optimal Control and Its Application for Mental Simulation of Human Interactions.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Submodular Surrogates for Value of Information.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Learning to Manipulate Unknown Objects in Clutter by Reinforcement.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Reinforcement and Imitation Learning via Interactive No-Regret Learning.
CoRR, 2014

Human-inspired force compliant grasping primitives.
Auton. Robots, 2014

Perceiving, learning, and exploiting object affordances for autonomous pile manipulation.
Auton. Robots, 2014

Pose Machines: Articulated Pose Estimation via Inference Machines.
Proceedings of the Computer Vision - ECCV 2014, 2014

Near Optimal Bayesian Active Learning for Decision Making.
Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

Efficient Optimization for Autonomous Robotic Manipulation of Natural Objects.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
The Principle of Maximum Causal Entropy for Estimating Interacting Processes.
IEEE Trans. Inf. Theory, 2013

CHOMP: Covariant Hamiltonian optimization for motion planning.
Int. J. Robotics Res., 2013

Reinforcement learning in robotics: A survey.
Int. J. Robotics Res., 2013

Knapsack Constrained Contextual Submodular List Prediction with Application to Multi-document Summarization.
CoRR, 2013

SpeedMachines: Anytime Structured Prediction.
CoRR, 2013

Learning monocular reactive UAV control in cluttered natural environments.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Efficient temporal consistency for streaming video scene analysis.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Interactive segmentation, tracking, and kinematic modeling of unknown 3D articulated objects.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Clearing a pile of unknown objects using interactive perception.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Efficient touch based localization through submodularity.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Efficient 3-D scene analysis from streaming data.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Learning Policies for Contextual Submodular Prediction.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
SpeedBoost: Anytime Prediction with Uniform Near-Optimality.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Predicting Contextual Sequences via Submodular Function Maximization
CoRR, 2012

Robust Object Grasping using Force Compliant Motion Primitives.
Proceedings of the Robotics: Science and Systems VIII, 2012

Contextual Sequence Prediction with Application to Control Library Optimization.
Proceedings of the Robotics: Science and Systems VIII, 2012

Efficient high dimensional maximum entropy modeling via symmetric partition functions.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Probabilistic pointing target prediction via inverse optimal control.
Proceedings of the 17th International Conference on Intelligent User Interfaces, 2012

Learning Autonomous Driving Styles and Maneuvers from Expert Demonstration.
Proceedings of the Experimental Robotics, 2012

An integrated system for autonomous robotics manipulation.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Reinforcement Planning: RL for optimal planners.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Active learning from demonstration for robust autonomous navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Agnostic System Identification for Model-Based Reinforcement Learning.
Proceedings of the 29th International Conference on Machine Learning, 2012

Co-inference for Multi-modal Scene Analysis.
Proceedings of the Computer Vision - ECCV 2012, 2012

Activity Forecasting.
Proceedings of the Computer Vision - ECCV 2012, 2012

Detecting Interesting Events Using Unsupervised Density Ratio Estimation.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Efficient Optimization of Control Libraries.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Anytime online novelty and change detection for mobile robots.
J. Field Robotics, 2011

Optimization and learning for rough terrain legged locomotion.
Int. J. Robotics Res., 2011

Stability Conditions for Online Learnability
CoRR, 2011

3-D scene analysis via sequenced predictions over points and regions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Segmentation-based online change detection for mobile robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Computational Rationalization: The Inverse Equilibrium Problem.
Proceedings of the 28th International Conference on Machine Learning, 2011

Generalized Boosting Algorithms for Convex Optimization.
Proceedings of the 28th International Conference on Machine Learning, 2011

Learning message-passing inference machines for structured prediction.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Policy Gradient Methods.
Proceedings of the Encyclopedia of Machine Learning, 2010

Learning for Autonomous Navigation.
IEEE Robotics Autom. Mag., 2010

Efficient Reductions for Imitation Learning.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain.
Int. J. Robotics Res., 2010

Space-carving Kernels for Accurate Rough Terrain Estimation.
Int. J. Robotics Res., 2010

No-Regret Reductions for Imitation Learning and Structured Prediction
CoRR, 2010

An optimization approach to rough terrain locomotion.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Anytime online novelty detection for vehicle safeguarding.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Modeling Interaction via the Principle of Maximum Causal Entropy.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Boosted Backpropagation Learning for Training Deep Modular Networks.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Stacked Hierarchical Labeling.
Proceedings of the Computer Vision - ECCV 2010, 2010

Maximum Causal Entropy Correlated Equilibria for Markov Games.
Proceedings of the Interactive Decision Theory and Game Theory, 2010

2009
Inverse Optimal Heuristic Control for Imitation Learning.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Learning to search: Functional gradient techniques for imitation learning.
Auton. Robots, 2009

Convex Coding.
Proceedings of the UAI 2009, 2009

Accurate rough terrain estimation with space-carving kernels.
Proceedings of the Robotics: Science and Systems V, University of Washington, Seattle, USA, June 28, 2009

Perceptual Interpretation for Autonomous Navigation through Dynamic Imitation Learning.
Proceedings of the Robotics Research - The 14th International Symposium, 2009

Planning-based prediction for pedestrians.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

CHOMP: Gradient optimization techniques for efficient motion planning.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

GATMO: A Generalized Approach to Tracking Movable Objects.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

Bandit-Based Online Candidate Selection for Adjustable Autonomy.
Proceedings of the Field and Service Robotics, Results of the 7th International Conference, 2009

Applied Imitation Learning for Autonomous Navigation in Complex Natural Terrain.
Proceedings of the Field and Service Robotics, Results of the 7th International Conference, 2009

Contextual classification with functional Max-Margin Markov Networks.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Human Behavior Modeling with Maximum Entropy Inverse Optimal Control.
Proceedings of the Human Behavior Modeling, 2009

2008
Autonomous driving in urban environments: Boss and the Urban Challenge.
J. Field Robotics, 2008

Editorial: Special Issue on Machine Learning in Robotics.
Int. J. Robotics Res., 2008

High Performance Outdoor Navigation from Overhead Data using Imitation Learning.
Proceedings of the Robotics: Science and Systems IV, 2008

Differentiable Sparse Coding.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Adaptive workspace biasing for sampling-based planners.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Navigate like a cabbie: probabilistic reasoning from observed context-aware behavior.
Proceedings of the UbiComp 2008: Ubiquitous Computing, 10th International Conference, 2008

Fast Planning for Dynamic Preferences.
Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, 2008

Maximum Entropy Inverse Reinforcement Learning.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
(Approximate) Subgradient Methods for Structured Prediction.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Learning Selectively Conditioned Forest Structures with Applications to DBNs and Classification.
Proceedings of the UAI 2007, 2007

Kernel Conjugate Gradient for Fast Kernel Machines.
Proceedings of the IJCAI 2007, 2007

Vegetation Detection for Driving in Complex Environments.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Imitation learning for locomotion and manipulation.
Proceedings of the 2007 7th IEEE-RAS International Conference on Humanoid Robots, November 29th, 2007

2006
Improving robot navigation through self-supervised online learning.
J. Field Robotics, 2006

Improving Robot Navigation Through Self-Supervised Online Learning.
Proceedings of the Robotics: Science and Systems II, 2006

Boosting Structured Prediction for Imitation Learning.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Experimental Analysis of Overhead Data Processing To Support Long Range Navigation.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Maximum margin planning.
Proceedings of the Machine Learning, 2006

2005
On Local Rewards and Scaling Distributed Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Learning Opportunity Costs in Multi-Robot Market Based Planners.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Robust Supervised Learning.
Proceedings of the Proceedings, 2005

2003
Policy Search by Dynamic Programming.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Covariant Policy Search.
Proceedings of the IJCAI-03, 2003

2002
Learning with Scope, with Application to Information Extraction and Classification.
Proceedings of the UAI '02, 2002

2001
Autonomous Helicopter Control using Reinforcement Learning Policy Search Methods.
Proceedings of the 2001 IEEE International Conference on Robotics and Automation, 2001


  Loading...