Yao Shu

Orcid: 0000-0003-2721-751X

According to our database1, Yao Shu authored at least 77 papers between 2005 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization.
CoRR, May, 2026

Revisiting Zeroth-Order Hessian Approximation: A Single-Step Policy Optimization Lens.
CoRR, May, 2026

AR1-ZO: Topology-Aware Rank-1 Zeroth-Order Queries for High-Rank LoRA Fine-Tuning.
CoRR, May, 2026

Compander-Aligned Query Geometry for Quantized Zeroth-Order Optimization.
CoRR, May, 2026

Why Zeroth-Order Adaptation May Forget Less: A Randomized Shaping Theory.
CoRR, May, 2026

Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent.
CoRR, May, 2026

Can We Change the Stroke Size for Easier Diffusion?
CoRR, March, 2026

Multinoulli Extension: A Lossless Continuous Relaxation for Partition-Constrained Subset Selection.
CoRR, March, 2026

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting.
CoRR, March, 2026

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation.
CoRR, March, 2026

MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks.
CoRR, March, 2026

LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models.
CoRR, March, 2026

Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation.
CoRR, March, 2026

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling.
CoRR, February, 2026

1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization.
CoRR, February, 2026

Controllable Concept Bottleneck Models.
CoRR, January, 2026

Optimization and Robustness-Informed Membership Inference Attacks for LLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Scheduling Your LLM Reinforcement Learning with Reasoning Trees.
CoRR, October, 2025

FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline.
CoRR, October, 2025

Self-Reflective Generation at Test Time.
CoRR, October, 2025

FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits.
CoRR, September, 2025

T-POP: Test-Time Personalization with Online Preference Feedback.
CoRR, September, 2025

Test-Time Policy Adaptation for Enhanced Multi-Turn Interactions with LLMs.
CoRR, September, 2025

RIMO: An Easy-to-Evaluate, Hard-to-Solve Olympiad Benchmark for Advanced Mathematical Reasoning.
CoRR, September, 2025

Implicit Reasoning in Large Language Models: A Comprehensive Survey.
CoRR, September, 2025

Thinking with Nothinking Calibration: A New In-Context Learning Paradigm in Reasoning Large Language Models.
CoRR, August, 2025

ReDit: Reward Dithering for Improved LLM Policy Optimization.
CoRR, June, 2025

Zeroth-Order Optimization is Secretly Single-Step Policy Optimization.
CoRR, June, 2025

On Path to Multimodal Historical Reasoning: HistBench and HistAgent.
CoRR, May, 2025

PAFT: Prompt-Agnostic Fine-Tuning.
CoRR, February, 2025

Meta-Prompt Optimization for LLM-Based Sequential Decision Making.
CoRR, February, 2025

Characterizing Logs in Vulnerability Reports: In-Depth Analysis and Security Implications.
Proceedings of the IEEE International Conference on Software Analysis, 2025

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Refining Adaptive Zeroth-Order Optimization at Ease.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

PAFT: Prompt-Agnostic Fine-Tuning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Flexora: Flexible Low-Rank Adaptation for Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

FSL-Rectifier: Rectify Outliers in Few-Shot Learning via Test-Time Augmentation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Flexora: Flexible Low Rank Adaptation for Large Language Models.
CoRR, 2024

Data-Centric AI in the Age of Large Language Models.
CoRR, 2024

Localized Zeroth-Order Prompt Optimization.
CoRR, 2024

OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations.
CoRR, 2024

Small-Scale Pedestrian Detection Using Fusion Network and Probabilistic Loss.
IEEE Access, 2024

Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Localized Zeroth-Order Prompt Optimization.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Robustifying and Boosting Training-Free Neural Architecture Search.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Position Paper: Data-Centric AI in the Age of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers.
CoRR, 2023

Federated Zeroth-Order Optimization using Trajectory-Informed Surrogate Gradients.
CoRR, 2023

Exploiting Correlated Auxiliary Feedback in Parameterized Bandits.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Quantum Bayesian Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Federated Neural Bandits.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Federated Neural Bandit.
CoRR, 2022

Neural ensemble search via Bayesian sampling.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Unifying and Boosting Gradient-Based Training-Free Neural Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample-Then-Optimize Batch Neural Thompson Sampling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DAVINZ: Data Valuation using Deep Neural Networks at Initialization.
Proceedings of the International Conference on Machine Learning, 2022

NASI: Label- and Data-agnostic Neural Architecture Search at Initialization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Going Beyond Neural Architecture Search with Sampling-based Neural Ensemble Search.
CoRR, 2021

Dynamic Routing Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

2020
Multimodal information fusion based human movement recognition.
Multim. Tools Appl., 2020

Tight Lower Complexity Bounds for Strongly Convex Finite-Sum Optimization.
CoRR, 2020

Understanding Architectures Learnt by Cell-based Neural Architecture Search.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Wireless non-invasive motion tracking of functional behavior.
Pervasive Mob. Comput., 2019

Research and development of off-line services for the 3D automatic printing machine based on cloud manufacturing.
J. Ambient Intell. Humaniz. Comput., 2019

Parameterized representation and solution method of the lightweight 3D model virtual assembly constraint.
J. Ambient Intell. Humaniz. Comput., 2019

ISBNet: Instance-aware Selective Branching Network.
CoRR, 2019

Efficient Memory Management for GPU-based Deep Learning Systems.
CoRR, 2019

2018
Research on Human Motion Recognition Based on Wi-Fi and Inertial Sensor Signal Fusion.
Proceedings of the 2018 IEEE SmartWorld, 2018

WiTT: Modeling and the evaluation of table tennis actions based on WIFI signals.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

2017
Understanding Deep Representations through Random Weights.
CoRR, 2017

2005
An OO-based Design Model of Software Agent.
Proceedings of the Sixth International Conference on Parallel and Distributed Computing, 2005


  Loading...