We stand with Ukraine

We stand with Ukraine

Zhihua Zhang

Orcid: 0000-0003-3165-5213

Affiliations:

Peking University, Center for Statistical Science, School of Mathematical Sciences, China

According to our database¹, Zhihua Zhang authored at least 108 papers between 2003 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2024

Fedpower: privacy-preserving distributed eigenspace estimation.

[BibT_eX]

[DOI]

,

,

,

,

Mach. Learn., December, 2024

Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Federated Control in Markov Decision Processes.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Near Minimax-Optimal Distributional Temporal Difference Algorithms and The Freedman Inequality in Hilbert Spaces.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Statistical Efficiency of Distributional Temporal Difference Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Towards explicit superlinear convergence rate for SR1.

[BibT_eX]

[DOI]

,

,

,

Math. Program., May, 2023

Estimation and Inference in Distributional Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Stochastic Distributed Optimization under Average Second-order Similarity: Algorithms and Analysis.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Complete Asymptotic Analysis for Projected Stochastic Approximation and Debiased Variants.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 59th Annual Allerton Conference on Communication, 2023

A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning.

[BibT_eX]

[DOI]

,

,

,

,

Michael I. Jordan

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Explicit Convergence Rates of Greedy and Random Quasi-Newton Methods.

[BibT_eX]

[DOI]

,

,

J. Mach. Learn. Res., 2022

On the Convergence of Policy in Unregularized Policy Mirror Descent.

[BibT_eX]

[DOI]

,

CoRR, 2022

Global Convergence Analysis of Deep Linear Networks with A One-neuron Layer.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

On the landscape of one-hidden-layer sparse networks and beyond.

[BibT_eX]

[DOI]

,

,

Artif. Intell., 2022

Semi-infinitely Constrained Markov Decision Processes.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Personalized Federated Learning towards Communication Efficiency, Robustness and Fairness.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Asymptotic Behaviors of Projected Stochastic Approximation: A Jump Diffusion Perspective.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Non-local Convergence Analysis of Deep Linear Networks.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2022

Statistical Estimation and Online Inference via Local SGD.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Federated Reinforcement Learning with Environment Heterogeneity.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Accelerated Proximal Subsampled Newton Method.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Neural Networks Learn. Syst., 2021

Approximate Newton Methods.

[BibT_eX]

[DOI]

,

,

J. Mach. Learn. Res., 2021

Polyak-Ruppert Averaged Q-Leaning is Statistically Efficient.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

CoRR, 2021

Greedy and Random Broyden's Methods with Explicit Superlinear Convergence Rates in Nonlinear Equations.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

Statistical Estimation and Inference via Local SGD in Federated Learning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

Near Optimal Stochastic Algorithms for Finite-Sum Unbalanced Convex-Concave Minimax Optimization.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

Directional Convergence Analysis under Spherically Symmetric Distribution.

[BibT_eX]

[DOI]

,

CoRR, 2021

Meta-Regularization: An Approach to Adaptive Choice of the Learning Rate in Gradient Descent.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

Privacy-Preserving Distributed SVD via Federated Power.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2021

Delayed Projection Techniques for Linearly Constrained Problems: Convergence Rates, Acceleration, and Applications.

[BibT_eX]

[DOI]

,

CoRR, 2021

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Faster Directional Convergence of Linear Neural Networks under Spherically Symmetric Data.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Communication-Efficient Distributed SVD via Local Power Iterations.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Multi-split Reversible Transformers Can Enhance Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020

Nesterov's Acceleration for Approximate Newton.

[BibT_eX]

[DOI]

,

,

J. Mach. Learn. Res., 2020

Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Landscape of Sparse Linear Network: A Brief Investigation.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

Optimal Quantization for Batch Normalization in Neural Network Deployments and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Lower Complexity Bounds for Finite-Sum Convex-Concave Minimax Optimization Problems.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

On the Convergence of FedAvg on Non-IID Data.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Active Learning Approaches to Enhancing Neural Machine Translation: An Empirical Study.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Efficient Spectrum-Revealing CUR Matrix Decomposition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Do Subsampled Newton Methods Work for High-Dimensional Data?

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Fast stochastic second-order method logarithmic in condition number.

[BibT_eX]

[DOI]

,

,

,

Pattern Recognit., 2019

Robust Frequent Directions with Application in Online Learning.

[BibT_eX]

[DOI]

,

,

,

,

J. Mach. Learn. Res., 2019

Fast Generalized Matrix Regression with Applications in Machine Learning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

Communication Efficient Decentralized Training with Multiple Local Updates.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

A Stochastic Proximal Point Algorithm for Saddle-Point Problems.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

A General Analysis Framework of Lower Complexity Bounds for Finite-Sum Optimization.

[BibT_eX]

[DOI]

,

,

CoRR, 2019

Towards Better Generalization: BP-SVRG in Training Deep Neural Networks.

[BibT_eX]

[DOI]

,

,

CoRR, 2019

A Unified Framework for Regularized Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2019

A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Lipschitz Generative Adversarial Nets.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Interpolatron: Interpolation or Extrapolation Schemes to Accelerate Optimization for Deep Neural Networks.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2018

Sketched Follow-The-Regularized-Leader for Online Factorization Machine.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

2017

Fast Fisher discriminant analysis with randomized algorithms.

[BibT_eX]

[DOI]

,

,

,

Pattern Recognit., 2017

Nesterov's Acceleration For Approximate Newton.

[BibT_eX]

[DOI]

,

CoRR, 2017

A Unifying Framework for Convergence Analysis of Approximate Newton Methods.

[BibT_eX]

[DOI]

,

,

CoRR, 2017

Online Learning Via Regularized Frequent Directions.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2017

Approximate Newton Methods and Their Local Convergence.

[BibT_eX]

[DOI]

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

Communication Lower Bounds for Distributed Convex Optimization: Partition Data on Features.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Towards More Efficient SPSD Matrix Approximation and CUR Matrix Decomposition.

[BibT_eX]

[DOI]

,

,

J. Mach. Learn. Res., 2016

SPSD Matrix Approximation vis Column Selection: Theories, Algorithms, and Extensions.

[BibT_eX]

[DOI]

,

,

J. Mach. Learn. Res., 2016

Tighter bound of Sketched Generalized Matrix Approximation.

[BibT_eX]

[DOI]

,

,

CoRR, 2016

Revisiting Sub-sampled Newton Methods.

[BibT_eX]

[DOI]

,

,

CoRR, 2016

Variance-Reduced Second-Order Methods.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2016

Quasi-Newton Hamiltonian Monte Carlo.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Frequent Direction Algorithms for Approximate Matrix Multiplication with Applications in CCA.

[BibT_eX]

[DOI]

,

,

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Improved Analyses of the Randomized Power Method and Block Lanczos Method.

[BibT_eX]

[DOI]

,

,

CoRR, 2015

Towards More Efficient Nystrom Approximation and CUR Matrix Decomposition.

[BibT_eX]

[DOI]

,

,

CoRR, 2015

Distributed Multi-Armed Bandits: Regret vs. Communication.

[BibT_eX]

[DOI]

,

,

CoRR, 2015

A Parallel algorithm for $\mathcal{X}$-Armed bandits.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2015

Support Matrix Machines.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 32nd International Conference on Machine Learning, 2015

2014

Group Orbit Optimization: A Unified Approach to Data Normalization.

[BibT_eX]

[DOI]

,

,

CoRR, 2014

Adjusting Leverage Scores by Row Weighting: A Practical Approach to Coherent Matrix Completion.

[BibT_eX]

[DOI]

,

,

CoRR, 2014

The Modified Nystrom Method: Theories, Algorithms, and Extension.

[BibT_eX]

[DOI]

,

,

CoRR, 2014

Multicategory large margin classification methods: Hinge losses vs. coherence functions.

[BibT_eX]

[DOI]

,

,

,

,

Artif. Intell., 2014

2013

A Scalable Approach to Column-Based Low-Rank Matrix Approximation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IJCAI 2013, 2013

2012

EP-GIG Priors and Applications in Bayesian Sparse Learning.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2012

An Autoregressive Approach to Nonparametric Hierarchical Dependent Modeling.

[BibT_eX]

[DOI]

,

,

Edward Y. Chang

Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Coherence functions with applications in large-margin classification methods.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2012

Sublinear Algorithms for Penalized Logistic Regression in Massive Datasets.

[BibT_eX]

[DOI]

,

,

Edward Y. Chang

,

,

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

2011

Bayesian Generalized Kernel Mixed Models.

[BibT_eX]

[DOI]

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2011

Generalized Latent Factor Models for Social Network Analysis.

[BibT_eX]

[DOI]

,

,

Proceedings of the IJCAI 2011, 2011

2010

A regularization framework for multiclass classification: A deterministic annealing approach.

[BibT_eX]

[DOI]

,

,

,

,

Frederick H. Lochovsky

Pattern Recognit., 2010

Regularized Discriminant Analysis, Ridge Regression and Beyond.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2010

Bayesian Generalized Kernel Models.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Matrix-Variate Dirichlet Process Mixture Models.

[BibT_eX]

[DOI]

,

,

Michael I. Jordan

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

2009

Coherence Functions for Multicategory Margin-based Classification Methods.

[BibT_eX]

[DOI]

,

Michael I. Jordan

,

,

Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Latent Variable Models for Dimensionality Reduction.

[BibT_eX]

[DOI]

,

Michael I. Jordan

Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Latent Wishart Processes for Relational Kernel Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

A Flexible and Efficient Algorithm for Regularized Fisher Discriminant Analysis.

[BibT_eX]

[DOI]

,

,

Michael I. Jordan

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Probabilistic Relational PCA.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

2008

Posterior Consistency of the Silverman g-prior in Bayesian Model Choice.

[BibT_eX]

[DOI]

,

Michael I. Jordan

,

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

2007

Semiparametric Regression Using Student t Processes.

[BibT_eX]

[DOI]

,

,

Edward Y. Chang

IEEE Trans. Neural Networks, 2007

Surrogate maximization/minimization algorithms and extensions.

[BibT_eX]

[DOI]

,

,

Mach. Learn., 2007

2006

Model-based transductive learning of the kernel matrix.

[BibT_eX]

[DOI]

,

,

Mach. Learn., 2006

Bayesian Multicategory Support Vector Machines.

[BibT_eX]

[DOI]

,

Michael I. Jordan

Proceedings of the UAI '06, 2006

Adaptive non-linear clustering in data streams.

[BibT_eX]

[DOI]

,

,

Edward Y. Chang

Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005

Kronecker Factorization for Speeding up Kernel Machines.

[BibT_eX]

[DOI]

,

,

Edward Y. Chang

Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

Learning with non-metric proximity matrices.

[BibT_eX]

[DOI]

,

Edward Y. Chang

,

Proceedings of the 13th ACM International Conference on Multimedia, 2005

A Bernoulli Relational Model for Nonlinear Embedding.

[BibT_eX]

[DOI]

,

,

,

Frederick H. Lochovsky

Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Annealed Discriminant Analysis.

[BibT_eX]

[DOI]

,

,

Frederick H. Lochovsky

Proceedings of the Machine Learning: ECML 2005, 2005

2004

Bayesian inference for transductive learning of kernel matrix using the Tanner-Wong data augmentation algorithm.

[BibT_eX]

[DOI]

,

,

Proceedings of the Machine Learning, 2004

Surrogate maximization/minimization algorithms for AdaBoost and the logistic regression model.

[BibT_eX]

[DOI]

,

,

Proceedings of the Machine Learning, 2004

Bayesian Inference on Principal Component Analysis Using Reversible Jump Markov Chain Monte Carlo.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003

Parametric Distance Metric Learning with Label Information.

[BibT_eX]

[DOI]

,

,

Proceedings of the IJCAI-03, 2003

Loading...