George Tucker

According to our database¹, George Tucker authored at least 53 papers between 2013 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Gemma: Open Models Based on Gemini Research and Technology.

[BibT_eX]

[DOI]

CoRR, 2024

Guided Evolution with Binary Discriminators for ML Program Search.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Gemini: A Family of Highly Capable Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2023

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios.

[BibT_eX]

[DOI]

IROS, 2023

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Oracle Inequalities for Model Selection in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Model Selection in Batch Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Offline Policy Selection under Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization.

[BibT_eX]

[DOI]

CoRR, 2021

Coupled Gradient Estimators for Discrete Latent Variables.

[BibT_eX]

[DOI]

Zhe Dong

Andriy Mnih

George Tucker

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Benchmarks for Deep Off-Policy Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

RL Unplugged: Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

CoRR, 2020

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems.

[BibT_eX]

[DOI]

CoRR, 2020

D4RL: Datasets for Deep Data-Driven Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Conservative Q-Learning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DisARM: An Antithetic Gradient Estimator for Binary Latent Variables.

[BibT_eX]

[DOI]

Zhe Dong

Andriy Mnih

George Tucker

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Learning without Memorization.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Model Based Reinforcement Learning for Atari.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Behavior Regularized Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Yifan Wu

George Tucker

Ofir Nachum

CoRR, 2019

Reinforcement Learning Driven Heuristic Optimization.

[BibT_eX]

[DOI]

CoRR, 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.

[BibT_eX]

[DOI]

CoRR, 2019

Model-Based Reinforcement Learning for Atari.

[BibT_eX]

[DOI]

CoRR, 2019

Learning to Walk Via Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XV, 2019

Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Energy-Inspired Models: Learning with Sampler-Induced Distributions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Variational Bounds of Mutual Information.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Guided evolutionary strategies: augmenting random search with surrogate gradients.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Proceedings of the 36th International Conference on Machine Learning, 2019

The Laplacian in RL: Learning Representations with Efficient Approximations.

[BibT_eX]

[DOI]

Yifan Wu

George Tucker

Ofir Nachum

Proceedings of the 7th International Conference on Learning Representations, 2019

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Understanding Posterior Collapse in Generative Latent Variable Models.

[BibT_eX]

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Revisiting Auxiliary Latent Variables in Generative Models.

[BibT_eX]

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

2018

Soft Actor-Critic Algorithms and Applications.

[BibT_eX]

[DOI]

CoRR, 2018

Guided evolutionary strategies: escaping the curse of dimensionality in random search.

[BibT_eX]

[DOI]

Niru Maheswaranathan

Luke Metz

George Tucker

Jascha Sohl-Dickstein

CoRR, 2018

Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Smoothed Action Value Functions for Learning Gaussian Policies.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling.

[BibT_eX]

[DOI]

Carlos Riquelme

George Tucker

Jasper Snoek

Proceedings of the 6th International Conference on Learning Representations, 2018

Learning Hard Alignments with Variational Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

An online sequence-to-sequence model for noisy speech recognition.

[BibT_eX]

[DOI]

CoRR, 2017

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Filtering Variational Objectives.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.

[BibT_eX]

[DOI]

George Tucker

Andriy Mnih

Chris J. Maddison

Jascha Sohl-Dickstein

Proceedings of the 5th International Conference on Learning Representations, 2017

Regularizing Neural Networks by Penalizing Confident Output Distributions.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Particle Value Functions.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

Compacting Neural Network Classifiers via Dropout Training.

[BibT_eX]

[DOI]

Yotaro Kubo

George Tucker

Simon Wiesler

CoRR, 2016

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting.

[BibT_eX]

[DOI]

Ming Sun

Anirudh Raju

George Tucker

Sankaran Panchapagesan

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Model Compression Applied to Small-Footprint Keyword Spotting.

[BibT_eX]

[DOI]

George Tucker

Minhua Wu

Ming Sun

Sankaran Panchapagesan

Gengshen Fu

Shiv Vitaladevuni

Proceedings of the Interspeech 2016, 2016

2014

Network topology and parameter estimation: from experimental design methods to gene regulatory network kinetics using a community based approach.

[BibT_eX]

[DOI]

BMC Syst. Biol., 2014

2013

A sampling framework for incorporating quantitative mass spectrometry data in protein interaction analysis.

[BibT_eX]

[DOI]

George Tucker

Po-Ru Loh

Bonnie Berger

BMC Bioinform., 2013

George Tucker

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...