Bo Dai

Affiliations:

Google Brain, USA
Georgia Institute of Technology, Atlanta, GA, USA (PhD)
Chinese Academy of Science, Institute of Automation, NLPR/LIAMA, Beijing, China (former)

According to our database¹, Bo Dai authored at least 124 papers between 2010 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Expectations: Learning with Stochastic Dominance Made Practical.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

DF2: Distribution-Free Decision-Focused Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Probabilistic Adaptation of Text-to-Video Models.

[BibT_eX]

[DOI]

CoRR, 2023

AdaPlanner: Adaptive Planning from Feedback with Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Universal Policies via Text-Guided Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2023

AdaPlanner: Adaptive Planning from Feedback with Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Ordering-based Conditions for Global Convergence of Policy Gradient Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Universal Policies via Text-Guided Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stochastic Gradient Succeeds for Bandits.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Score-based Continuous-time Discrete Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Any-scale Balanced Samplers for Discrete Space.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Spectral Decomposition Representation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Latent Variable Representation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding.

[BibT_eX]

[DOI]

Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Discrete Langevin Samplers via Wasserstein Gradient Flow.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Learning to Optimize with Stochastic Dominance Constraints.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Learning to Optimize with Stochastic Dominance Constraints.

[BibT_eX]

[DOI]

CoRR, 2022

Discrete Langevin Sampler via Wasserstein Gradient Flow.

[BibT_eX]

[DOI]

CoRR, 2022

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition.

[BibT_eX]

[DOI]

CoRR, 2022

On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems.

[BibT_eX]

[DOI]

CoRR, 2022

Can Small Heads Help? Understanding and Improving Multi-Task Generalization.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

A free lunch from the noise: Provable and practical exploration for representation learning.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2022

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Role of Baselines in Policy Gradient Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Oracle Inequalities for Model Selection in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Making Linear MDPs Practical via Contrastive Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Model Selection in Batch Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Understanding and Leveraging Overparameterization in Recursive Value Estimation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Neural Stochastic Dual Dynamic Programming.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

SMARTAVE: Structured Multimodal Transformer for Product Attribute Value Extraction.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Offline Policy Selection under Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

The Curse of Passive Data Collection in Batch Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data.

[BibT_eX]

[DOI]

CoRR, 2021

Towards understanding retrosynthesis by energy-based models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Nearly Horizon-Free Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Combiner: Full Attention Transformer with Sparse Computation Cost.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Understanding the Effect of Stochasticity in Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Leveraging Non-uniformity in First-order Non-convex Optimization.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Overcoming Catastrophic Forgetting by Bayesian Generative Regularization.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning to Defend by Learning to Attack.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020

Small Towers Make Big Differences.

[BibT_eX]

[DOI]

CoRR, 2020

Energy-based View of Retrosynthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2020

Differentiable Top-k Operator with Optimal Transport.

[BibT_eX]

[DOI]

CoRR, 2020

Reinforcement Learning via Fenchel-Rockafellar Duality.

[BibT_eX]

[DOI]

Ofir Nachum

Bo Dai

CoRR, 2020

Off-Policy Imitation Learning from Observations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Off-Policy Evaluation via the Regularized Lagrangian.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Differentiable Top-k with Optimal Transport.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Escaping the Gravitational Pull of Softmax.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

CoinDICE: Off-Policy Confidence Interval Estimation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Energy-Based Processes for Exchangeable Data.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Batch Stationary Distribution Estimation.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Scalable Deep Generative Modeling for Sparse Graphs.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

GenDICE: Generalized Offline Estimation of Stationary Values.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

AlgaeDICE: Policy Gradient from Arbitrary Experience.

[BibT_eX]

[DOI]

CoRR, 2019

Overcoming Catastrophic Forgetting by Generative Regularization.

[BibT_eX]

[DOI]

CoRR, 2019

Learning to Plan via Neural Exploration-Exploitation Trees.

[BibT_eX]

[DOI]

Binghong Chen

Bo Dai

Le Song

CoRR, 2019

Meta Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Energy-Inspired Models: Learning with Sampler-Induced Distributions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exponential Family Estimation via Adversarial Dynamics Embedding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Retrosynthesis Prediction with Conditional Graph Logic Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Revisiting Auxiliary Latent Variables in Generative Models.

[BibT_eX]

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Learning to Defense by Learning to Attack.

[BibT_eX]

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Kernel Exponential Family Estimation via Doubly Dual Embedding.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Learning over functions, distributions and dynamics via stochastic optimization.

[BibT_eX]

[DOI]

Bo Dai

PhD thesis, 2018

Bayesian Meta-network Architecture Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to Defense by Learning to Attack.

[BibT_eX]

[DOI]

CoRR, 2018

Learning Deep Hidden Nonlinear Dynamics from Aggregate Data.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Predictive Approximate Bayesian Computation via Saddle Points.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning towards Minimum Hyperspherical Energy.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Coupled Variational Bayes via Optimization Embedding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Structured Inference for Recurrent Hidden Semi-markov Model.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Towards Black-box Iterative Machine Teaching.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Steady-States of Iterative Algorithms over Graphs.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Syntax-Directed Variational Autoencoder for Structured Data.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Boosting the Actor with Dual Critic.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Decoupled Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-scale Nystrom Method.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Smoothed Dual Embedding Control.

[BibT_eX]

[DOI]

CoRR, 2017

Deep Hyperspherical Learning.

[BibT_eX]

[DOI]

CoRR, 2017

Towards Black-box Iterative Machine Teaching.

[BibT_eX]

[DOI]

CoRR, 2017

Iterative Machine Teaching.

[BibT_eX]

[DOI]

CoRR, 2017

Deep Hyperspherical Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning from semantically dependent multi-tasks.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Iterative Machine Teaching.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Stochastic Generative Hashing.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Recurrent Hidden Semi-Markov Model.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Learning from Conditional Distributions via Dual Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016

A Context-Aware Framework for Reducing Bandwidth Usage of Mobile Video Chats.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Learning from Conditional Distributions via Dual Kernel Embeddings.

[BibT_eX]

[DOI]

CoRR, 2016

Discriminative Embeddings of Latent Variable Models for Structured Data.

[BibT_eX]

[DOI]

Hanjun Dai

Bo Dai

Le Song

Proceedings of the 33nd International Conference on Machine Learning, 2016

Provable Bayesian Inference via Particle Mirror Descent.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015

Scalable Bayesian Inference via Particle Mirror Descent.

[BibT_eX]

[DOI]

CoRR, 2015

2014

Information-Theoretic Semi-Supervised Metric Learning via Entropy Regularization.

[BibT_eX]

[DOI]

Neural Comput., 2014

Scalable Kernel Methods via Doubly Stochastic Gradients.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Nonparametric Estimation of Multi-View Latent Variable Models.

[BibT_eX]

[DOI]

Le Song

Animashree Anandkumar

Bo Dai

Bo Xie

Proceedings of the 31th International Conference on Machine Learning, 2014

Transductive Learning with Multi-class Volume Approximation.

[BibT_eX]

[DOI]

Gang Niu

Bo Dai

Marthinus Christoffel du Plessis

Masashi Sugiyama

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

Maximum volume clustering: a new discriminative clustering approach.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2013

Robust Low Rank Kernel Embeddings of Multivariate Distributions.

[BibT_eX]

[DOI]

Le Song

Bo Dai

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Squared-loss Mutual Information Regularization: A Novel Information-theoretic Approach to Semi-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

2011

Maximum Volume Clustering.

[BibT_eX]