Wei Chen

Orcid: 0000-0002-7438-5180

Affiliations:
  • Chinese Academy of Sciences, Institute Of Computing Technology, Beijing, China (since 2022)
  • Microsoft Research Asia, Machine Learning Group, Beijing, China (2011-2020)
  • Chinese Academy of Sciences, Academy of Mathematics and System Science, Beijing, China (PhD)
  • University of Science and Technology of China, Hefei, Anhui, China


According to our database1, Wei Chen authored at least 111 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond.
CoRR, 2024

Listwise Generative Retrieval Models via a Sequential Learning Process.
CoRR, 2024

A Unified Causal View of Instruction Tuning.
CoRR, 2024

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Incorporating NODE with pre-trained neural differential operator for learning dynamics.
Neurocomputing, April, 2023

CAME: Competitively Learning a Mixture-of-Experts Model for First-stage Retrieval.
CoRR, 2023

On the Robustness of Generative Retrieval Models: An Out-of-Distribution Perspective.
CoRR, 2023

Power-law Dynamic arising from machine learning.
CoRR, 2023

When and Why Momentum Accelerates SGD: An Empirical Study.
CoRR, 2023

Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Closing the gap between the upper bound and lower bound of Adam's iteration complexity.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Collaborative Pure Exploration in Kernel Bandit.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Convergence of AdaGrad for Non-convex Objectives: Simple Proofs and Relaxed Assumptions.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Black-box Adversarial Attacks against Dense Retrieval Models: A Multi-view Contrastive Learning Method.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Inducing Causal Structure for Abstractive Text Summarization.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Continual Learning for Generative Retrieval over Dynamic Corpora.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Stabilize deep ResNet with a sharp scaling factor τ.
Mach. Learn., 2022

Constructing the Basis Path Set by Eliminating the Path Dependency.
J. Syst. Sci. Complex., 2022

Provable Adaptivity in Adam.
CoRR, 2022

Normalized/Clipped SGD with Perturbation for Differentially Private Non-Convex Optimization.
CoRR, 2022

Neural Operator with Regularity Structure for Modeling Dynamics Driven by SPDEs.
CoRR, 2022

Branching Reinforcement Learning.
CoRR, 2022

Does Momentum Change the Implicit Regularization on Separable Data?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Availability Attacks Create Shortcuts.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

SE(3) Equivariant Graph Neural Networks with Complete Local Frames.
Proceedings of the International Conference on Machine Learning, 2022

Gradient Information Matters in Policy Optimization by Back-propagating through Model.
Proceedings of the Tenth International Conference on Learning Representations, 2022

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Two Coupled Rejection Metrics Can Tell Adversarial Examples Apart.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Certified Robustness to Word Substitution Ranking Attack for Neural Ranking Models.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Interpreting the Basis Path Set in Neural Networks.
J. Syst. Sci. Complex., 2021

Indiscriminate Poisoning Attacks Are Shortcuts.
CoRR, 2021

Equivariant vector field network for many-body system modeling.
CoRR, 2021

Optimizing Information-theoretical Generalization Bounds via Anisotropic Noise in SGLD.
CoRR, 2021

Momentum Doesn't Change the Implicit Bias.
CoRR, 2021

Causally Invariant Predictor with Shift-Robustness.
CoRR, 2021

Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit.
CoRR, 2021

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior.
CoRR, 2021

Machine-Learning Non-Conservative Dynamics for New-Physics Detection.
CoRR, 2021

Adversarial Training with Rectified Rejection.
CoRR, 2021

Combinatorial Pure Exploration with Bottleneck Reward Function and its Extension to General Reward Functions.
CoRR, 2021

Towards Accelerating Training of Batch Normalization: A Manifold Perspective.
CoRR, 2021

Path-BN: Towards effective batch normalization in the Path Space for ReLU networks.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Optimizing Information-theoretical Generalization Bound via Anisotropic Noise of SGLD.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Recovering Latent Causal Factor for Generalization to Distributional Shifts.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Causal Semantic Representation for Out-of-Distribution Prediction.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

R-Drop: Regularized Dropout for Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Large Scale Private Learning via Low-rank Reparametrization.
Proceedings of the 38th International Conference on Machine Learning, 2021

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks.
Proceedings of the 38th International Conference on Machine Learning, 2021

Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

How Does Data Augmentation Affect Privacy in Machine Learning?
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Convergence of Distributed Stochastic Variance Reduced Methods Without Sampling Extra Data.
IEEE Trans. Signal Process., 2020

Target transfer Q-learning and its convergence analysis.
Neurocomputing, 2020

Identifying Invariant Texture Violation for Robust Deepfake Detection.
CoRR, 2020

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks.
CoRR, 2020

Latent Causal Invariant Model.
CoRR, 2020

Learning Causal Semantic Representation for Out-of-Distribution Prediction.
CoRR, 2020

Membership Inference with Privately Augmented Data Endorses the Benign while Suppresses the Adversary.
CoRR, 2020

Dynamic of Stochastic Gradient Descent with State-Dependent Noise.
CoRR, 2020

Combinatorial Pure Exploration with Partial or Full-Bandit Linear Feedback.
CoRR, 2020

Combinatorial Semi-Bandit in the Non-Stationary Environment.
CoRR, 2020

Gradient Perturbation is Underrated for Differentially Private Convex Optimization.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

I4R: Promoting Deep Reinforcement Learning by the Indicator for Expressive Representations.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

(Locally) Differentially Private Combinatorial Semi-Bandits.
Proceedings of the 37th International Conference on Machine Learning, 2020

Combinatorial Pure Exploration for Dueling Bandit.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Convergence analysis of distributed stochastic gradient descent with shuffling.
Neurocomputing, 2019

OptQuant: Distributed training of neural networks with optimized quantization mechanisms.
Neurocomputing, 2019

Training Over-parameterized Deep ResNet Is almost as Easy as Training a Two-layer Network.
CoRR, 2019

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.
CoRR, 2019

Positively Scale-Invariant Flatness of ReLU Neural Networks.
CoRR, 2019

BN-invariant Sharpness Regularizes the Training Model to Better Generalization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space.
Proceedings of the 7th International Conference on Learning Representations, 2019

Capacity Control of ReLU Neural Networks by Basis-Path Norm.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Target Transfer Q-Learning and Its Convergence Analysis.
CoRR, 2018

Train Feedfoward Neural Network with Layer-wise Adaptive Rate via Approximating Back-matching Propagation.
CoRR, 2018

Optimizing Neural Networks in the Equivalent Class Space.
CoRR, 2018

On the Local Hessian in Back-propagation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Differential Equations for Modeling Asynchronous Algorithms.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Towards Binary-Valued Gates for Robust LSTM Training.
Proceedings of the 35th International Conference on Machine Learning, 2018

Slim-DP: A Multi-Agent System for Communication-Efficient Distributed Deep Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017
Slim-DP: A Light Communication Data Parallelism for DNN.
CoRR, 2017

Distributed Machine Learning: Foundations, Trends, and Practices.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Ensemble-Compression: A New Method for Parallel Training of Deep Neural Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Finite sample analysis of the GTD Policy Evaluation Algorithms in Markov Setting.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

LightGBM: A Highly Efficient Gradient Boosting Decision Tree.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Efficient Inexact Proximal Gradient Algorithm for Nonconvex Problems.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Asynchronous Stochastic Gradient Descent with Delay Compensation.
Proceedings of the 34th International Conference on Machine Learning, 2017

Dual Supervised Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Dynamic Group Behavior Analysis and Its Application in Network Abnormal Behavior Detection.
Proceedings of the Communications and Networking, 2017

Generalization Error Bounds for Optimization Algorithms via Stability.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Asynchronous Stochastic Proximal Optimization Algorithms with Variance Reduction.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Asynchronous Stochastic Gradient Descent with Delay Compensation for Distributed Deep Learning.
CoRR, 2016

A Communication-Efficient Parallel Algorithm for Decision Tree.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Asynchronous Accelerated Stochastic Gradient Descent.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

On the Depth of Deep Neural Networks: A Theoretical View.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Large Margin Deep Neural Networks: Theory and Algorithms.
CoRR, 2015

Mechanism Learning with Mechanism Induced Data.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Generalization Analysis for Game-Theoretic Machine Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Sponsored Search Auctions: Recent Advances and Future Directions.
ACM Trans. Intell. Syst. Technol., 2014

Generalization Analysis for Game-Theoretic Machine Learning.
CoRR, 2014

Sampling dilemma: towards effective data sampling for click prediction in sponsored search.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Generalized second price auction with probabilistic broad match.
Proceedings of the ACM Conference on Economics and Computation, 2014

Agent Behavior Prediction and Its Generalization Analysis.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Online learning for auction mechanism in bandit setting.
Decis. Support Syst., 2013

A Theoretical Analysis of NDCG Type Ranking Measures
CoRR, 2013

A Game-Theoretic Machine Learning Approach for Revenue Maximization in Sponsored Search.
Proceedings of the IJCAI 2013, 2013

2012
Convergence Analysis for Weighted Joint Strategy Fictitious Play in Generalized Second Price Auction.
Proceedings of the Internet and Network Economics - 8th International Workshop, 2012

2010
Two-Layer Generalization Analysis for Ranking Using Rademacher Average.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

2009
Ranking Measures and Loss Functions in Learning to Rank.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009


  Loading...