Kaiwen Zhou

Orcid: 0000-0002-3088-8085

Affiliations:
  • Huawei Noah's Ark Lab, Shenzhen, China
  • Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong (PhD 2022)


According to our database1, Kaiwen Zhou authored at least 56 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
EE-MCP: Self-Evolving MCP-GUI Agents via Automated Environment Generation and Experience Learning.
CoRR, April, 2026

A Unified Perspective on Adversarial Membership Manipulation in Vision Models.
CoRR, April, 2026

HATS: Hardness-Aware Trajectory Synthesis for GUI Agents.
CoRR, March, 2026

A²Flow: Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Boosting Cross-problem Generalization in Diffusion-Based Neural Combinatorial Solver via Inference Time Adaptation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
HiconAgent: History Context-aware Policy Optimization for GUI Agents.
CoRR, December, 2025

Consistency Flow Model Achieves One-step Denoising Error Correction Codes.
CoRR, December, 2025

A<sup>2</sup>Flow: Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators.
CoRR, November, 2025

More than A Point: Capturing Uncertainty with Adaptive Affordance Heatmaps for Spatial Grounding in Robotic Tasks.
CoRR, October, 2025

Uncertainty-Aware GUI Agent: Adaptive Perception through Component Recommendation and Human-in-the-Loop Refinement.
CoRR, August, 2025

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills.
CoRR, June, 2025

GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents.
CoRR, May, 2025

VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT.
CoRR, April, 2025

Generative Models in Decision Making: A Survey.
CoRR, February, 2025

Boosting Generalization in Diffusion-Based Neural Combinatorial Solver via Energy-guided Sampling.
CoRR, February, 2025

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers.
CoRR, January, 2025

MESH - Understanding Videos Like Human: Measuring Hallucinations in Large Video Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Spa-Bench: a comprehensive Benchmark for Smartphone Agent Evaluation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

FALCON: Resolving Visual Redundancy and Fragmentation in High-Resolution Multimodal Large Language Models via Visual Registers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Less is More: Empowering GUI Agent with Context-Aware Simplification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Efficient private SCO for heavy-tailed data via averaged clipping.
Mach. Learn., December, 2024

Beyond Pixels: Text Enhances Generalization in Real-World Image Restoration.
CoRR, 2024

HORSE: Hierarchical Representation for Large-Scale Neural Subset Selection.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Enhancing Neural Subset Selection: Integrating Background Information into Set Representations.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Evolving Domain Generalization through Dynamic Latent Representations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Positional Information Matters for Invariant In-Context Learning: A Case Study of Simple Function Classes.
CoRR, 2023

Towards Understanding Feature Learning in Out-of-Distribution Generalization.
CoRR, 2023

Understanding and Improving Feature Learning for Out-of-Distribution Generalization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Does Invariant Graph Learning via Environment Augmentation Learn Invariance?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pareto Invariant Risk Minimization: Towards Mitigating the Optimization Dilemma in Out-of-Distribution Generalization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Novel Extrapolation Technique to Accelerate WMMSE.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Efficient Private SCO for Heavy-Tailed Data via Clipping.
CoRR, 2022

Pareto Invariant Risk Minimization.
CoRR, 2022

An Adaptive Incremental Gradient Method With Support for Non-Euclidean Norms.
CoRR, 2022

On the Finite-Time Complexity and Practical Computation of Approximate Stationarity Concepts of Lipschitz Functions.
Proceedings of the International Conference on Machine Learning, 2022

Fast and Reliable Evaluation of Adversarial Robustness with Minimum-Margin Attack.
Proceedings of the International Conference on Machine Learning, 2022

Practical Schemes for Finding Near-Stationary Points of Convex Finite-Sums.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Accelerating Perturbed Stochastic Iterates in Asynchronous Lock-Free Optimization.
CoRR, 2021

Local Reweighting for Adversarial Training.
CoRR, 2021

2020
VR-SGD: A Simple Stochastic Variance Reduction Method for Machine Learning.
IEEE Trans. Knowl. Data Eng., 2020

Edit Distance Embedding using Convolutional Neural Networks.
CoRR, 2020

Amortized Nesterov's Momentum: A Robust Momentum and Its Application to Deep Learning.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Convolutional Embedding for Edit Distance.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Boosting First-Order Methods by Shifting Objective: New Schemes with Faster Worst-Case Rates.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Tight Convergence Rate of Gradient Descent for Eigenvalue Computation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019
Hyper-Sphere Quantization: Communication-Efficient SGD for Federated Learning.
CoRR, 2019

Direct Acceleration of SAGA using Sampled Negative Momentum.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Norm-Range Partition: A Univiseral Catalyst for LSH based Maximum Inner Product Search (MIPS).
CoRR, 2018

Direct Acceleration of SAGA using Sampled Negative Momentum.
CoRR, 2018

VR-SGD: A Simple Stochastic Variance Reduction Method for Machine Learning.
CoRR, 2018

A Simple Stochastic Variance Reduced Algorithm with Fast Convergence Rates.
Proceedings of the 35th International Conference on Machine Learning, 2018

Guaranteed Sufficient Decrease for Stochastic Variance Reduced Gradient Optimization.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

ASVRG: Accelerated Proximal SVRG.
Proceedings of The 10th Asian Conference on Machine Learning, 2018


  Loading...