Wei Chen

CoRR, 2022

Does Momentum Change the Implicit Regularization on Separable Data?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Availability Attacks Create Shortcuts.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

SE(3) Equivariant Graph Neural Networks with Complete Local Frames.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Gradient Information Matters in Policy Optimization by Back-propagating through Model.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Two Coupled Rejection Metrics Can Tell Adversarial Examples Apart.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Certified Robustness to Word Substitution Ranking Attack for Neural Ranking Models.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021

Interpreting the Basis Path Set in Neural Networks.

[BibT_eX]

[DOI]

J. Syst. Sci. Complex., 2021

Indiscriminate Poisoning Attacks Are Shortcuts.

[BibT_eX]

[DOI]

CoRR, 2021

Equivariant vector field network for many-body system modeling.

[BibT_eX]

[DOI]

CoRR, 2021

Optimizing Information-theoretical Generalization Bounds via Anisotropic Noise in SGLD.

[BibT_eX]

[DOI]

CoRR, 2021

Momentum Doesn't Change the Implicit Bias.

[BibT_eX]

[DOI]

CoRR, 2021

Causally Invariant Predictor with Shift-Robustness.

[BibT_eX]

[DOI]

CoRR, 2021

Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit.

[BibT_eX]

[DOI]

CoRR, 2021

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior.

[BibT_eX]

[DOI]

CoRR, 2021

Machine-Learning Non-Conservative Dynamics for New-Physics Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Adversarial Training with Rectified Rejection.

[BibT_eX]

[DOI]

CoRR, 2021

Combinatorial Pure Exploration with Bottleneck Reward Function and its Extension to General Reward Functions.

[BibT_eX]

[DOI]

Yuko Kuroki

CoRR, 2021

Towards Accelerating Training of Batch Normalization: A Manifold Perspective.

[BibT_eX]

[DOI]

CoRR, 2021

Path-BN: Towards effective batch normalization in the Path Space for ReLU networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Optimizing Information-theoretical Generalization Bound via Anisotropic Noise of SGLD.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Recovering Latent Causal Factor for Generalization to Distributional Shifts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Causal Semantic Representation for Out-of-Distribution Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

R-Drop: Regularized Dropout for Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Large Scale Private Learning via Low-rank Reparametrization.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

How Does Data Augmentation Affect Privacy in Machine Learning?

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback.

[BibT_eX]

[DOI]

Yuko Kuroki

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Convergence of Distributed Stochastic Variance Reduced Methods Without Sampling Extra Data.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2020

Target transfer Q-learning and its convergence analysis.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Identifying Invariant Texture Violation for Robust Deepfake Detection.

[BibT_eX]

[DOI]

Xinwei Sun

Botong Wu

CoRR, 2020

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks.

[BibT_eX]

[DOI]

Bohan Wang

Qi Meng

CoRR, 2020

Latent Causal Invariant Model.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Causal Semantic Representation for Out-of-Distribution Prediction.

[BibT_eX]

[DOI]

CoRR, 2020

Membership Inference with Privately Augmented Data Endorses the Benign while Suppresses the Adversary.

[BibT_eX]

[DOI]

CoRR, 2020

Dynamic of Stochastic Gradient Descent with State-Dependent Noise.

[BibT_eX]

[DOI]

CoRR, 2020

Combinatorial Pure Exploration with Partial or Full-Bandit Linear Feedback.

[BibT_eX]

[DOI]

Yuko Kuroki

CoRR, 2020

Combinatorial Semi-Bandit in the Non-Stationary Environment.

[BibT_eX]

[DOI]

CoRR, 2020

Gradient Perturbation is Underrated for Differentially Private Convex Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

I4R: Promoting Deep Reinforcement Learning by the Indicator for Expressive Representations.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

(Locally) Differentially Private Combinatorial Semi-Bandits.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Combinatorial Pure Exploration for Dueling Bandit.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Convergence analysis of distributed stochastic gradient descent with shuffling.

[BibT_eX]

[DOI]

Neurocomputing, 2019

OptQuant: Distributed training of neural networks with optimized quantization mechanisms.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Training Over-parameterized Deep ResNet Is almost as Easy as Training a Two-layer Network.

[BibT_eX]

[DOI]

CoRR, 2019

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.

[BibT_eX]

[DOI]

CoRR, 2019

Positively Scale-Invariant Flatness of ReLU Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

BN-invariant Sharpness Regularizes the Training Model to Better Generalization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Capacity Control of ReLU Neural Networks by Basis-Path Norm.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Target Transfer Q-Learning and Its Convergence Analysis.

[BibT_eX]

[DOI]

CoRR, 2018

Train Feedfoward Neural Network with Layer-wise Adaptive Rate via Approximating Back-matching Propagation.

[BibT_eX]

[DOI]

Huishuai Zhang

CoRR, 2018

Optimizing Neural Networks in the Equivalent Class Space.

[BibT_eX]

[DOI]

CoRR, 2018

On the Local Hessian in Back-propagation.

[BibT_eX]

[DOI]

Huishuai Zhang

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Differential Equations for Modeling Asynchronous Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Towards Binary-Valued Gates for Robust LSTM Training.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Slim-DP: A Multi-Agent System for Communication-Efficient Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017

Slim-DP: A Light Communication Data Parallelism for DNN.

[BibT_eX]

[DOI]

CoRR, 2017

Distributed Machine Learning: Foundations, Trends, and Practices.

[BibT_eX]

[DOI]

Taifeng Wang

Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Ensemble-Compression: A New Method for Parallel Training of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Finite sample analysis of the GTD Policy Evaluation Algorithms in Markov Setting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

LightGBM: A Highly Efficient Gradient Boosting Decision Tree.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Efficient Inexact Proximal Gradient Algorithm for Nonconvex Problems.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Asynchronous Stochastic Gradient Descent with Delay Compensation.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Dual Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Dynamic Group Behavior Analysis and Its Application in Network Abnormal Behavior Detection.

[BibT_eX]

[DOI]

Proceedings of the Communications and Networking, 2017

Generalization Error Bounds for Optimization Algorithms via Stability.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Asynchronous Stochastic Proximal Optimization Algorithms with Variance Reduction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Asynchronous Stochastic Gradient Descent with Delay Compensation for Distributed Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2016

A Communication-Efficient Parallel Algorithm for Decision Tree.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Asynchronous Accelerated Stochastic Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

On the Depth of Deep Neural Networks: A Theoretical View.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Large Margin Deep Neural Networks: Theory and Algorithms.

[BibT_eX]

[DOI]

CoRR, 2015

Mechanism Learning with Mechanism Induced Data.

[BibT_eX]

[DOI]

Tao Qin

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Generalization Analysis for Game-Theoretic Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Sponsored Search Auctions: Recent Advances and Future Directions.

[BibT_eX]

[DOI]

Tao Qin

ACM Trans. Intell. Syst. Technol., 2014

Generalization Analysis for Game-Theoretic Machine Learning.

[BibT_eX]

[DOI]

CoRR, 2014

Sampling dilemma: towards effective data sampling for click prediction in sponsored search.

[BibT_eX]

[DOI]

Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Generalized second price auction with probabilistic broad match.

[BibT_eX]

[DOI]

Proceedings of the ACM Conference on Economics and Computation, 2014

Agent Behavior Prediction and Its Generalization Analysis.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Online learning for auction mechanism in bandit setting.

[BibT_eX]

[DOI]

Decis. Support Syst., 2013

A Theoretical Analysis of NDCG Type Ranking Measures

[BibT_eX]

[DOI]

CoRR, 2013

A Game-Theoretic Machine Learning Approach for Revenue Maximization in Sponsored Search.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

2012

Convergence Analysis for Weighted Joint Strategy Fictitious Play in Generalized Second Price Auction.

[BibT_eX]

[DOI]

Lei Yao

Proceedings of the Internet and Network Economics - 8th International Workshop, 2012

2010

Two-Layer Generalization Analysis for Ranking Using Rademacher Average.

[BibT_eX]

[DOI]