Yuandong Tian

Zechun Liu

Beidi Chen

CoRR, January, 2026

2025

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

The Path Not Taken: RLVR Provably Learns Off the Principals.

[BibT_eX]

[DOI]

CoRR, November, 2025

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes.

[BibT_eX]

[DOI]

Yangyang Shi

CoRR, September, 2025

Emergence of Superposition: Unveiling the Training Dynamics of Chain of Continuous Thought.

[BibT_eX]

[DOI]

CoRR, September, 2025

Provable Scaling Laws of Feature Emergence from Learning Dynamics of Grokking.

[BibT_eX]

[DOI]

CoRR, September, 2025

Positional Encoding via Token-Aware Phase Attention.

[BibT_eX]

[DOI]

CoRR, September, 2025

Inpainting-Guided Policy Optimization for Diffusion Large Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Language Self-Play For Data-Free Training.

[BibT_eX]

[DOI]

CoRR, September, 2025

Deep Think with Confidence.

[BibT_eX]

[DOI]

CoRR, August, 2025

ParamΔ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost.

[BibT_eX]

[DOI]

CoRR, April, 2025

GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection.

[BibT_eX]

[DOI]

CoRR, April, 2025

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks.

[BibT_eX]

[DOI]

CoRR, March, 2025

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions.

[BibT_eX]

[DOI]

CoRR, February, 2025

Spectral Journey: How Transformers Predict the Shortest Path.

[BibT_eX]

[DOI]

CoRR, February, 2025

LLM Pretraining with Continuous Concepts.

[BibT_eX]

[DOI]

CoRR, February, 2025

SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters.

[BibT_eX]

[DOI]

CoRR, February, 2025

GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?

[BibT_eX]

[DOI]

CoRR, February, 2025

ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization.

[BibT_eX]

[DOI]

Tijmen Blankevoort

CoRR, February, 2025

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback.

[BibT_eX]

[DOI]

CoRR, January, 2025

Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition.

[BibT_eX]

[DOI]

CoRR, January, 2025

Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

AdvPrefix: An Objective for Nuanced LLM Jailbreaks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

ParetoQ: Improving Scaling Laws in Extremely Low-bit LLM Quantization.

[BibT_eX]

[DOI]

Tijmen Blankevoort

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Agent-as-a-Judge: Evaluate Agents with Agents.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

GSM-∞: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking.

[BibT_eX]

[DOI]

Paria Rashidinejad

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SpinQuant: LLM Quantization with Learned Rotations.

[BibT_eX]

[DOI]

Tijmen Blankevoort

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards General-Purpose Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MagicPIG: LSH Sampling for Efficient LLM Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ParamΔ for Direct Mixing: Post-Train Large Language Model At Zero Cost.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

You Only Use Reactive Attention Slice When Retrieving From Long Context.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2025

2024

Training Large Language Models to Reason in a Continuous Latent Space.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning.

[BibT_eX]

[DOI]

CoRR, 2024

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets.

[BibT_eX]

[DOI]

CoRR, 2024

The Perfect Blend: Redefining RLHF with Mixture of Judges.

[BibT_eX]

[DOI]

CoRR, 2024

You Only Use Reactive Attention Slice For Long Context Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients.

[BibT_eX]

[DOI]

CoRR, 2024

Holding the Line: A Study of Writers' Attitudes on Co-creativity with AI.

[BibT_eX]

[DOI]

CoRR, 2024

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping.

[BibT_eX]

[DOI]

CoRR, 2024

Diffusion World Model.

[BibT_eX]

[DOI]

CoRR, 2024

Image Classifier Based Generative Method for Planar Antenna Design.

[BibT_eX]

[DOI]

CoRR, 2024

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

On the Surprising Effectiveness of Attention Transfer for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

TravelPlanner: A Benchmark for Real-World Planning with Language Agents.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases.

[BibT_eX]

[DOI]

Liangzhen Lai

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Contrastive Predict-and-Search for Mixed Integer Linear Programs.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

GenCO: Generating Diverse Designs with Combinatorial Constraints.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

LoCoCo: Dropping In Convolutions for Long Context Compression.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Efficient Streaming Language Models with Attention Sinks.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

H-GAP: Humanoid Control with a Generalist Planner.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Personalized Alignment for Evaluating Open-ended Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

To the Globe (TTG): Towards Language-Driven Guaranteed Travel Planning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

2023

Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

End-to-end Story Plot Generator.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Personalized Story Evaluation.

[BibT_eX]

[DOI]

CoRR, 2023

GenCO: Generating Diverse Solutions to Design Problems with Combinatorial Nature.

[BibT_eX]

[DOI]

CoRR, 2023

RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment.

[BibT_eX]

[DOI]

CoRR, 2023

Extending Context Window of Large Language Models via Positional Interpolation.

[BibT_eX]

[DOI]

CoRR, 2023

H<sub>2</sub>O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

A Cookbook of Self-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation.

[BibT_eX]

[DOI]

CoRR, 2023

Klotski: Efficient and Safe Network Migration of Large Production Datacenters.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

DyFormer : A Scalable Dynamic Graph Transformer with Provable Benefits on Generalization Ability.

[BibT_eX]

[DOI]

Chun-cheng Jason Chen

Mehrdad Mahdavi

Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time.

[BibT_eX]

[DOI]

Anshumali Shrivastava

Proceedings of the International Conference on Machine Learning, 2023

Learning Compiler Pass Orders using Coreset and Normalized Value Prediction.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

SurCo: Learning Linear SURrogates for COmbinatorial Nonlinear Optimization Problems.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Understanding the Role of Nonlinearity in Training Dynamics of Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient Planning in a Compact Latent Action Space.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

Local Branching Relaxation Heuristics for Integer Linear Programs.

[BibT_eX]

[DOI]

Proceedings of the Integration of Constraint Programming, Artificial Intelligence, and Operations Research, 2023

DOC: Improving Long Story Coherence With Detailed Outline Control.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Sample-Efficient Neural Architecture Search by Learning Actions for Monte Carlo Tree Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational Data.

[BibT_eX]

[DOI]

CoRR, 2022

AutoCAT: Reinforcement Learning for Automated Exploration of Cache Timing-Channel Attacks.

[BibT_eX]

[DOI]

CoRR, 2022

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems.

[BibT_eX]

[DOI]

CoRR, 2022

Deep Contrastive Learning is Provably (almost) Principal Component Analysis.

[BibT_eX]

[DOI]

CoRR, 2022

DreamShard: Generalizable Embedding Table Placement for Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding Deep Contrastive Learning via Coordinate-wise Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AutoShard: Automated Embedding Table Sharding for Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Denoised MDPs: Learning World Models Better Than the World Itself.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Multi-objective Optimization by Learning Space Partition.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Understanding Dimensional Collapse in Contrastive Self-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet Training.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Re3: Generating Longer Stories With Recursive Reprompting and Revision.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

On the Importance of Asymmetry for Siamese Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Learning Bounded Context-Free-Grammar via LSTM and the Transformer: Difference and the Explanations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Q-gym: An Equality Saturation Framework for DNN Inference Exploiting Weight Repetition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021

Planning in Learned Latent Action Spaces for Generalizable Legged Locomotion.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

Dynamic Graph Representation Learning via Graph Transformer Networks.

[BibT_eX]

[DOI]

Chun-cheng Jason Chen

CoRR, 2021

Towards Demystifying Representation Learning with Non-contrastive Self-supervision.

[BibT_eX]

[DOI]

CoRR, 2021

Multi-objective Optimization by Learning Space Partitions.

[BibT_eX]

[DOI]

CoRR, 2021

Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games.

[BibT_eX]

[DOI]

CoRR, 2021

Network planning with deep reinforcement learning.

[BibT_eX]

[DOI]

Hang Zhu

Varun Gupta

Satyajeet Singh Ahuja

Ying Zhang

Xin Jin

Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

NovelD: A Simple yet Effective Exploration Criterion.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MADE: Exploration via Maximizing Deviation from Explored Regions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Space Partitions for Path Planning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Latent Execution for Neural Program Synthesis Beyond Domain-Specific Languages.

[BibT_eX]

[DOI]

Xinyun Chen

Dawn Song

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Few-Shot Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Understanding self-supervised learning dynamics without contrastive pairs.

[BibT_eX]

[DOI]

Xinlei Chen

Surya Ganguli

Proceedings of the 38th International Conference on Machine Learning, 2021

Learn-to-Share: A Hardware-friendly Transfer Learning Framework Exploiting Computation and Parameter Sharing.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

FP-NAS: Fast Probabilistic Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Understanding Robustness in Teacher-Student Setting: A New Perspective.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020

Enhancing Model Parallelism in Neural Architecture Search for Multidevice System.

[BibT_eX]

[DOI]

IEEE Micro, 2020

BeBold: Exploration Beyond the Boundary of Explored Regions.

[BibT_eX]

[DOI]

CoRR, 2020

Multi-Agent Collaboration via Reward Attribution Decomposition.

[BibT_eX]

[DOI]

CoRR, 2020

Understanding Self-supervised Learning with Dual Deep Networks.

[BibT_eX]

[DOI]

CoRR, 2020

Real-world Video Adaptation with Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information.

[BibT_eX]

[DOI]

Qucheng Gong

Tina Jiang

CoRR, 2020

FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Linnan Wang

Rodrigo Fonseca

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information.

[BibT_eX]

[DOI]

Qucheng Gong

Yu Jiang

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Student Specialization in Deep Rectified Networks With Finite Width and Input Dimension.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Deep Symbolic Superoptimization Without Human Knowledge.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Architecture Search Using Deep Neural Networks and Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Over-parameterization as a Catalyst for Better Generalization of Deep ReLU network.

[BibT_eX]

[DOI]

CoRR, 2019

A Neural-based Program Decompiler.

[BibT_eX]

[DOI]

CoRR, 2019

Sample-Efficient Neural Architecture Search by Learning Action Space.

[BibT_eX]

[DOI]

CoRR, 2019

Luck Matters: Understanding Training Dynamics of Deep ReLU Networks.

[BibT_eX]

[DOI]

CoRR, 2019

AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2019

One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Hierarchical Decision Making by Generating and Following Natural Language Instructions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Coda: An End-to-End Neural Program Decompiler.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning to Perform Local Rewriting for Combinatorial Optimization.

[BibT_eX]

[DOI]

Xinyun Chen

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

ELF OpenGo: an analysis and open reimplementation of AlphaZero.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

M^3RL: Mind-aware Multi-agent Management Reinforcement Learning.

[BibT_eX]

[DOI]

Tianmin Shu

Proceedings of the 7th International Conference on Learning Representations, 2019

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Bayesian Relational Memory for Semantic Visual Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Guest Editorial Special Issue on Deep/Reinforcement Learning and Games.

[BibT_eX]

[DOI]

IEEE Trans. Games, 2018

3D Interpreter Networks for Viewer-Centered Wireframe Modeling.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2018

Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to Progressively Plan.

[BibT_eX]

[DOI]

Xinyun Chen

CoRR, 2018

Learning and Planning with a Semantic Model.

[BibT_eX]

[DOI]

CoRR, 2018

A theoretical framework for deep locally connected ReLU network.

[BibT_eX]

[DOI]

CoRR, 2018

Algorithmic Framework for Model-based Reinforcement Learning with Theoretical Guarantees.

[BibT_eX]

[DOI]

CoRR, 2018

Channel-Recurrent Autoencoding for Image Modeling.

[BibT_eX]

[DOI]

Wenling Shang

Kihyuk Sohn

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Building Generalizable Agents with a Realistic and Rich 3D Environment.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

When is a Convolutional Filter Easy to Learn?

[BibT_eX]

[DOI]

Simon S. Du

Jason D. Lee

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

CoDraw: Visual Dialog for Collaborative Drawing.

[BibT_eX]

[DOI]

CoRR, 2017

Channel-Recurrent Variational Autoencoders.

[BibT_eX]

[DOI]

CoRR, 2017

ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Training Agent for First-Person Shooter Game with Actor-Critic Curriculum Learning.

[BibT_eX]

[DOI]

Yuxin Wu

Proceedings of the 5th International Conference on Learning Representations, 2017

Symmetry-Breaking Convergence Analysis of Certain Two-layered Neural Networks with ReLU nonlinearity.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Semantic Amodal Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Better Computer Go Player with Neural Network and Long-term Prediction.

[BibT_eX]

[DOI]

Yan Zhu

Proceedings of the 4th International Conference on Learning Representations, 2016

Single Image 3D Interpreter Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

2015

Theory and Practice of Hierarchical Data-driven Descent for Optimal Deformation Estimation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2015

Simple Baseline for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2015

Scale-invariant learning and convolutional networks.

[BibT_eX]

[DOI]

CoRR, 2015

2013

Theory and Practice of Globally Optimal Deformation Estimation.

[BibT_eX]

[DOI]

PhD thesis, 2013

Hierarchical Data-Driven Descent for Efficient Optimal Deformation Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Integrating Perceptual Learning with External World Knowledge in a Simulated Student.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence in Education - 16th International Conference, 2013

2012

Globally Optimal Estimation of Nonrigid Image Distortion.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2012

A Combined Theory of Defocused Illumination and Global Light Transport.

[BibT_eX]

[DOI]

Mohit Gupta

Li Zhang

Int. J. Comput. Vis., 2012

Learning from crowds in the presence of schools of thought.

[BibT_eX]

[DOI]

Jun Zhu

Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Exploring the Spatial Hierarchy of Mixture Models for Human Pose Estimation.

[BibT_eX]

[DOI]

C. Lawrence Zitnick

Proceedings of the Computer Vision - ECCV 2012, 2012

Depth from optical turbulence.

[BibT_eX]

[DOI]

Alan Van Nevel

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Rectification and 3D reconstruction of curved document images.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Local isomorphism to solve the pre-image problem in kernel methods.

[BibT_eX]

[DOI]

Dong Huang

Fernando De la Torre

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

A globally optimal data-driven approach for image distortion estimation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Seeing through water: Image restoration using model-based tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

(De) focusing on global light transport for active scene recovery.

[BibT_eX]

[DOI]

Mohit Gupta