Kun Yuan

Orcid: 0000-0001-8394-8187

Affiliations:
  • Peking University, Center for Machine Learning Research, Beijing, China
  • Alibaba Group, Decision Intelligence Lab, Bellevue, WA, USA (2019 - 2022)
  • University of California Los Angeles, Department of Electrical and Computer Engineering, Los Angeles, CA, USA (PhD 2019)
  • University of Science and Technology of China, Department of Automation, Hefei, China (until 2014)


According to our database1, Kun Yuan authored at least 99 papers between 2004 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
BEVHeight++: Toward Robust Visual Centric 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network.
CoRR, June, 2025

Optimal Complexity in Byzantine-Robust Distributed Stochastic Optimization with Data Heterogeneity.
CoRR, March, 2025

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models.
CoRR, February, 2025

CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models.
CoRR, February, 2025

Understanding the Influence of Digraphs on Decentralized Optimization: Effective Metrics, Lower Bound, and Optimal Algorithm.
SIAM J. Optim., 2025

Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
S$^\text{3}$Attention: Improving Long Sequence Attention With Smoothed Skeleton Sketching.
IEEE J. Sel. Top. Signal Process., September, 2024

An enhanced gradient-tracking bound for distributed online stochastic convex optimization.
Signal Process., April, 2024

Gradient Normalization with(out) Clipping Ensures Convergence of Nonconvex SGD under Heavy-Tailed Noise with Improved Results.
CoRR, 2024

Subspace Optimization for Large Language Models with Convergence Guarantees.
CoRR, 2024

S<sup>3</sup>Attention: Improving Long Sequence Attention with Smoothed Skeleton Sketching.
CoRR, 2024

On the Trade-off between Flatness and Optimization in Distributed Learning.
CoRR, 2024

Decentralized Bilevel Optimization over Graphs: Loopless Algorithmic Update and Transient Iteration Complexity.
CoRR, 2024

SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Distributed Bilevel Optimization with Communication Compression.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Momentum Benefits Non-iid Federated Learning Simply and Provably.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Asynchronous Diffusion Learning with Agent Subsampling and Local Updates.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Removing Data Heterogeneity Influence Enhances Network Topology Dependence of Decentralized SGD.
J. Mach. Learn. Res., 2023

Model-free Test Time Adaptation for Out-Of-Distribution Detection.
CoRR, 2023

RandCom: Random Communication Skipping Method for Decentralized Stochastic Optimization.
CoRR, 2023

BEVHeight++: Toward Robust Visual Centric 3D Object Detection.
CoRR, 2023

Momentum Benefits Non-IID Federated Learning Simply and Provably.
CoRR, 2023

Lower Bounds and Accelerated Algorithms in Distributed Stochastic Optimization with Communication Compression.
CoRR, 2023

Unbiased Compression Saves Communication in Distributed Optimization: When and How Much?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation.
Proceedings of the International Conference on Machine Learning, 2023

DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm.
Proceedings of the International Conference on Machine Learning, 2023

BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Achieving Linear Speedup with Network-Independent Learning Rates in Decentralized Stochastic Optimization.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

On the Performance of Gradient Tracking with Local Updates.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

2022
A Unified and Refined Convergence Analysis for Non-Convex Decentralized Learning.
IEEE Trans. Signal Process., 2022

Optimal Complexity in Non-Convex Decentralized Learning over Time-Varying Networks.
CoRR, 2022

Heavy-Tail Phenomenon in Decentralized SGD.
CoRR, 2022

Revisiting Optimal Convergence Rate for Smooth and Non-convex Stochastic Decentralized Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Communication-Efficient Topologies for Decentralized Learning with $O(1)$ Consensus Rate.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Effective Model Sparsification by Scheduled Grow-and-Prune Methods.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Byzantine-Resilient Dual Subgradient Method for Vertical Federated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

CHEX: CHannel EXploration for CNN Model Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Multiagent Fully Decentralized Value Function Learning With Linear Convergence Rates.
IEEE Trans. Autom. Control., 2021

Decentralized Proximal Gradient Algorithms With Linear Convergence Rates.
IEEE Trans. Autom. Control., 2021

BlueFog: Make Decentralized Algorithms Practical for Optimization and Deep Learning.
CoRR, 2021

Decentralized Composite Optimization with Compression.
CoRR, 2021

Removing Data Heterogeneity Influence Enhances Network Topology Dependence of Decentralized SGD.
CoRR, 2021

On the Comparison between Cyclic Sampling and Random Reshuffling.
CoRR, 2021

Exponential Graph is Provably Efficient for Decentralized Deep Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An Improved Analysis and Rates for Variance Reduction under Without-replacement Sampling Orders.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Accelerating Gossip SGD with Periodic Global Averaging.
Proceedings of the 38th International Conference on Machine Learning, 2021

DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Can Primal Methods Outperform Primal-Dual Methods in Decentralized Dynamic Optimization?
IEEE Trans. Signal Process., 2020

On the Influence of Bias-Correction on Distributed Stochastic Optimization.
IEEE Trans. Signal Process., 2020

Variance-Reduced Stochastic Learning Under Random Reshuffling.
IEEE Trans. Signal Process., 2020

Walkman: A Communication-Efficient Random-Walk Algorithm for Decentralized Optimization.
IEEE Trans. Signal Process., 2020

A Proximal Diffusion Strategy for Multiagent Optimization With Sparse Affine Constraints.
IEEE Trans. Autom. Control., 2020

2019
Exact Diffusion for Distributed Optimization and Learning - Part II: Convergence Analysis.
IEEE Trans. Signal Process., 2019

Exact Diffusion for Distributed Optimization and Learning - Part I: Algorithm Development.
IEEE Trans. Signal Process., 2019

Variance-Reduced Stochastic Learning by Networked Agents Under Random Reshuffling.
IEEE Trans. Signal Process., 2019

Stochastic Learning Under Random Reshuffling With Constant Step-Sizes.
IEEE Trans. Signal Process., 2019

Supervised Learning Under Distributed Features.
IEEE Trans. Signal Process., 2019

Dynamic Average Diffusion With Randomized Coordinate Updates.
IEEE Trans. Signal Inf. Process. over Networks, 2019

ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems and GANs.
CoRR, 2019

A Linearly Convergent Proximal Gradient Algorithm for Decentralized Optimization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

COVER: A Cluster-based Variance Reduced Method for Online Learning.
Proceedings of the IEEE International Conference on Acoustics, 2019

Distributed Value-Function Learning with Linear Convergence Rates.
Proceedings of the 17th European Control Conference, 2019

On the Performance of Exact Diffusion over Adaptive Networks.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019

Decentralized Dynamic ADMM with Quantized and Censored Communications.
Proceedings of the 53rd Asilomar Conference on Signals, Systems, and Computers, 2019

On the Comparison between Primal and Primal-dual Methods in Decentralized Dynamic Optimization.
Proceedings of the 53rd Asilomar Conference on Signals, Systems, and Computers, 2019

2018
Decentralized Consensus Optimization With Asynchrony and Delays.
IEEE Trans. Signal Inf. Process. over Networks, 2018

Multi-Agent Fully Decentralized Off-Policy Learning with Linear Convergence Rates.
CoRR, 2018

Learning Under Distributed Features.
CoRR, 2018

A Communication-Efficient Random-Walk Algorithm for Decentralized Optimization.
CoRR, 2018

Stochastic Learning under Random Reshuffling.
CoRR, 2018

Convergence of Variance-Reduced Learning Under Random Reshuffling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Efficient Variance-Reduced Learning Over Multi-Agent Networks.
Proceedings of the 26th European Signal Processing Conference, 2018

An Exponentially Convergent Algorithm for Learning Under Distributed Features.
Proceedings of the 2018 IEEE Data Science Workshop, 2018

Dual Coupled Diffusion for Distributed Optimization with Affine Constraints.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018

2017
Efficient Variance-Reduced Learning for Fully Decentralized On-Device Intelligence.
CoRR, 2017

Convergence of Variance-Reduced Stochastic Learning under Random Reshuffling.
CoRR, 2017

On the performance of random reshuffling in stochastic learning.
Proceedings of the 2017 Information Theory and Applications Workshop, 2017

Exact diffusion strategy for optimization by networked agents.
Proceedings of the 25th European Signal Processing Conference, 2017

Decentralized exact coupled optimization.
Proceedings of the 55th Annual Allerton Conference on Communication, 2017

2016
On the Convergence of Decentralized Gradient Descent.
SIAM J. Optim., 2016

On the Influence of Momentum Acceleration on Online Learning.
J. Mach. Learn. Res., 2016

Cooperative tracking for nonlinear multi-agent systems with hybrid time-delayed protocol.
Neurocomputing, 2016

Stochastic gradient descent with finite samples sizes.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Online dual coordinate ascent learning.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Communication-Efficient Decentralized Event Monitoring in Wireless Sensor Networks.
IEEE Trans. Parallel Distributed Syst., 2015

A decentralised linear programming approach to energy-efficient event detection.
Int. J. Sens. Networks, 2015

2014
On the Linear Convergence of the ADMM in Decentralized Consensus Optimization.
IEEE Trans. Signal Process., 2014

Partial synchronization of the distributed parameter system with time delay via fuzzy control.
IMA J. Math. Control. Inf., 2014

2013
Linearly convergent decentralized consensus optimization with the alternating direction method of multipliers.
Proceedings of the IEEE International Conference on Acoustics, 2013

A linearized bregman algorithm for decentralized basis pursuit.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
Synchronization of Coupled Networks with Mixed Delays by Intermittent Control.
J. Appl. Math., 2012

2008
Robust Stabilization of the Distributed Parameter System With Time Delay via Fuzzy Control.
IEEE Trans. Fuzzy Syst., 2008

2006
Robust Stability of Switched Cohen-Grossberg Neural Networks With Mixed Time-Varying Delays.
IEEE Trans. Syst. Man Cybern. Part B, 2006

Global Asymptotical Stability of Recurrent Neural Networks With Multiple Discrete Delays and Distributed Delays.
IEEE Trans. Neural Networks, 2006

Exponential stability and periodic solutions of fuzzy cellular neural networks with time-varying delays.
Neurocomputing, 2006

2005
An analysis of global asymptotic stability of delayed Cohen-Grossberg neural networks via nonsmooth analysis.
IEEE Trans. Circuits Syst. I Regul. Pap., 2005

2004
Global Exponential Stability of Cohen-Grossberg Neural Networks with Multiple Time-Varying Delays.
Proceedings of the Advances in Neural Networks, 2004


  Loading...