Yao Shu

Zilin Zhu

CoRR, May, 2026

Why Zeroth-Order Adaptation May Forget Less: A Randomized Shaping Theory.

[BibT_eX]

[DOI]

Jian Mu

CoRR, May, 2026

Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent.

[BibT_eX]

[DOI]

CoRR, May, 2026

Can We Change the Stroke Size for Easier Diffusion?

[BibT_eX]

[DOI]

CoRR, March, 2026

Multinoulli Extension: A Lossless Continuous Relaxation for Partition-Constrained Subset Selection.

[BibT_eX]

[DOI]

CoRR, March, 2026

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting.

[BibT_eX]

[DOI]

CoRR, March, 2026

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation.

[BibT_eX]

[DOI]

CoRR, March, 2026

MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks.

[BibT_eX]

[DOI]

CoRR, March, 2026

LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation.

[BibT_eX]

[DOI]

CoRR, March, 2026

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling.

[BibT_eX]

[DOI]

CoRR, February, 2026

1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization.

[BibT_eX]

[DOI]

CoRR, February, 2026

Controllable Concept Bottleneck Models.

[BibT_eX]

[DOI]

CoRR, January, 2026

Optimization and Robustness-Informed Membership Inference Attacks for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Scheduling Your LLM Reinforcement Learning with Reasoning Trees.

[BibT_eX]

[DOI]

CoRR, October, 2025

FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline.

[BibT_eX]

[DOI]

CoRR, October, 2025

Self-Reflective Generation at Test Time.

[BibT_eX]

[DOI]

CoRR, October, 2025

FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits.

[BibT_eX]

[DOI]

CoRR, September, 2025

T-POP: Test-Time Personalization with Online Preference Feedback.

[BibT_eX]

[DOI]

CoRR, September, 2025

Test-Time Policy Adaptation for Enhanced Multi-Turn Interactions with LLMs.

[BibT_eX]

[DOI]

CoRR, September, 2025

RIMO: An Easy-to-Evaluate, Hard-to-Solve Olympiad Benchmark for Advanced Mathematical Reasoning.

[BibT_eX]

[DOI]

Ziye Chen

Chengwei Qin

CoRR, September, 2025

Implicit Reasoning in Large Language Models: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, September, 2025

Thinking with Nothinking Calibration: A New In-Context Learning Paradigm in Reasoning Large Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

ReDit: Reward Dithering for Improved LLM Policy Optimization.

[BibT_eX]

[DOI]

CoRR, June, 2025

Zeroth-Order Optimization is Secretly Single-Step Policy Optimization.

[BibT_eX]

[DOI]

CoRR, June, 2025

On Path to Multimodal Historical Reasoning: HistBench and HistAgent.

[BibT_eX]

[DOI]

CoRR, May, 2025

PAFT: Prompt-Agnostic Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, February, 2025

Meta-Prompt Optimization for LLM-Based Sequential Decision Making.

[BibT_eX]

[DOI]

CoRR, February, 2025

Characterizing Logs in Vulnerability Reports: In-Depth Analysis and Security Implications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Software Analysis, 2025

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models.

[BibT_eX]

[DOI]

Wenyang Hu

See-Kiong Ng

Fei Richard Yu

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Refining Adaptive Zeroth-Order Optimization at Ease.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

PAFT: Prompt-Agnostic Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Flexora: Flexible Low-Rank Adaptation for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

FSL-Rectifier: Rectify Outliers in Few-Shot Learning via Test-Time Augmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Flexora: Flexible Low Rank Adaptation for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Data-Centric AI in the Age of Large Language Models.

[BibT_eX]

[DOI]

Gregory Kang Ruey Lau

Hieu Dao

Lucas Agussurja

Rachael Hwee Ling Sim

CoRR, 2024

Localized Zeroth-Order Prompt Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations.

[BibT_eX]

[DOI]

CoRR, 2024

Small-Scale Pedestrian Detection Using Fusion Network and Probabilistic Loss.

[BibT_eX]

[DOI]

IEEE Access, 2024

Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Localized Zeroth-Order Prompt Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Robustifying and Boosting Training-Free Neural Architecture Search.

[BibT_eX]

[DOI]

Zhenfeng He

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Position Paper: Data-Centric AI in the Age of Large Language Models.

[BibT_eX]

[DOI]

Gregory Kang Ruey Lau

Hieu Dao

Lucas Agussurja

Rachael Hwee Ling Sim

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Federated Zeroth-Order Optimization using Trajectory-Informed Surrogate Gradients.

[BibT_eX]

[DOI]

Xiaoqiang Lin

CoRR, 2023

Exploiting Correlated Auxiliary Feedback in Parameterized Bandits.

[BibT_eX]

[DOI]

Arun Verma

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Quantum Bayesian Optimization.

[BibT_eX]

[DOI]

Gregory Kang Ruey Lau

Arun Verma

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Federated Neural Bandits.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Federated Neural Bandit.

[BibT_eX]

[DOI]

CoRR, 2022

Neural ensemble search via Bayesian sampling.

[BibT_eX]

[DOI]

Yizhou Chen

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Unifying and Boosting Gradient-Based Training-Free Neural Architecture Search.

[BibT_eX]

[DOI]

Zhaoxuan Wu

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample-Then-Optimize Batch Neural Thompson Sampling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DAVINZ: Data Valuation using Deep Neural Networks at Initialization.

[BibT_eX]

[DOI]

Zhaoxuan Wu

Proceedings of the International Conference on Machine Learning, 2022

NASI: Label- and Data-agnostic Neural Architecture Search at Initialization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Going Beyond Neural Architecture Search with Sampling-based Neural Ensemble Search.

[BibT_eX]

[DOI]

Yizhou Chen

CoRR, 2021

Dynamic Routing Networks.

[BibT_eX]

[DOI]

Shaofeng Cai

Wei Wang

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

2020

Multimodal information fusion based human movement recognition.

[BibT_eX]

[DOI]

Heng Zhang

Multim. Tools Appl., 2020

Tight Lower Complexity Bounds for Strongly Convex Finite-Sum Optimization.

[BibT_eX]

[DOI]

Min Zhang

Kun He

CoRR, 2020

Understanding Architectures Learnt by Cell-based Neural Architecture Search.

[BibT_eX]

[DOI]

Wei Wang

Shaofeng Cai

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Wireless non-invasive motion tracking of functional behavior.

[BibT_eX]

[DOI]

Pervasive Mob. Comput., 2019

Research and development of off-line services for the 3D automatic printing machine based on cloud manufacturing.

[BibT_eX]

[DOI]

J. Ambient Intell. Humaniz. Comput., 2019

Parameterized representation and solution method of the lightweight 3D model virtual assembly constraint.

[BibT_eX]

[DOI]

J. Ambient Intell. Humaniz. Comput., 2019

ISBNet: Instance-aware Selective Branching Network.

[BibT_eX]

[DOI]

CoRR, 2019

Efficient Memory Management for GPU-based Deep Learning Systems.

[BibT_eX]

[DOI]

CoRR, 2019

2018

Research on Human Motion Recognition Based on Wi-Fi and Inertial Sensor Signal Fusion.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE SmartWorld, 2018

WiTT: Modeling and the evaluation of table tennis actions based on WIFI signals.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

2017

Understanding Deep Representations through Random Weights.

[BibT_eX]

[DOI]

CoRR, 2017

2005

An OO-based Design Model of Software Agent.

[BibT_eX]

[DOI]

Jianxing Li

XinJun Mao