Li Shen

IEEE Trans. Pattern Anal. Mach. Intell., April, 2026

Universally Empowering Zeroth-Order Optimization via Adaptive Layer-wise Sampling.

[BibT_eX]

[DOI]

CoRR, April, 2026

Rethinking the Personalized Relaxed Initialization in the Federated Learning: Consistency and Generalization.

[BibT_eX]

[DOI]

CoRR, April, 2026

Teleportation: Defense Against Stealing Attacks of Data-Driven Healthcare APIs.

[BibT_eX]

[DOI]

IEEE Trans. Artif. Intell., March, 2026

Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing.

[BibT_eX]

[DOI]

CoRR, March, 2026

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments.

[BibT_eX]

[DOI]

CoRR, March, 2026

Release the Potential of Memory Buffer in Continual Learning: A Dynamic System Perspective.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2026

Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2026

Adaptive Batch Size Time Evolving Stochastic Gradient Descent for Federated Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2026

Stability and Generalization of Push-Sum Based Decentralized Optimization over Directed Graphs.

[BibT_eX]

[DOI]

CoRR, February, 2026

Reason-IAD: Knowledge-Guided Dynamic Latent Reasoning for Explainable Industrial Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, February, 2026

Sparse Layer Sharpness-Aware Minimization for Efficient Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, February, 2026

Who Deserves the Reward? SHARP: Shapley Credit-based Optimization for Multi-Agent System.

[BibT_eX]

[DOI]

CoRR, February, 2026

Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink.

[BibT_eX]

[DOI]

CoRR, February, 2026

Mitigating Safety Tax via Distribution-Grounded Refinement in Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

Task-Distributionally Robust Data-Free Meta-Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

Language-based Trial and Error Falls Behind in the Era of Experience.

[BibT_eX]

[DOI]

CoRR, January, 2026

Understanding Model Merging: A Unified Generalization Framework for Heterogeneous Experts.

[BibT_eX]

[DOI]

CoRR, January, 2026

Advancing Adaptive Multi-Stage Video Anomaly Reasoning: A Benchmark Dataset and Method.

[BibT_eX]

[DOI]

CoRR, January, 2026

Decentralized Partial Model Personalization With Guaranteed Nonconvex Convergence.

[BibT_eX]

[DOI]

IEEE Trans. Netw., 2026

EEformer: Early Exiting for Transformer With Global-Local Exits and Progressive Fine-Tuning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Subspace based Federated Unlearning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

FreeStyle: Free lunch for text-guided style transfer using diffusion models.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

Towards understanding memory buffer based continual learning.

[BibT_eX]

[DOI]

Neural Networks, 2026

Curiosity-driven cooperation for long-tailed multi-label learning.

[BibT_eX]

[DOI]

Neural Networks, 2026

Prodigal: Backdoor defense for federated learning beyond robust aggregation.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2026

Boosting backdoor attack with a learnable poisoning sample selection strategy.

[BibT_eX]

[DOI]

Neurocomputing, 2026

Improving zero-shot translation with the navigation ability-enhanced language tags.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2026

Prompt tuning with preference ranking for few-shot pre-trained decision transformer.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2026

CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Continual Diffuser (CoD): Mastering Continual Offline RL With Experience Rehearsal.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2025

CoFormer: Collaborating With Heterogeneous Edge Devices for Scalable Transformer Inference.

[BibT_eX]

[DOI]

IEEE Trans. Computers, December, 2025

Toward the Flatter Landscape and Better Generalization in Federated Learning Under Client-Level Differential Privacy.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning.

[BibT_eX]

[DOI]

CoRR, December, 2025

Aligning Text-to-Image Diffusion Models With Constrained Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

DREAM: A Dual Variational Framework for Unsupervised Graph Domain Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing.

[BibT_eX]

[DOI]

CoRR, November, 2025

Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, November, 2025

Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, November, 2025

Sequential Federated Learning in Hierarchical Architecture on Non-IID Datasets.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., October, 2025

Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2025

Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler.

[BibT_eX]

[DOI]

CoRR, October, 2025

ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters.

[BibT_eX]

[DOI]

CoRR, October, 2025

Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Pharmacist: Safety Alignment Data Curation for Large Language Models against Harmful Fine-tuning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training.

[BibT_eX]

[DOI]

CoRR, October, 2025

AdaptiveFL: Communication-Adaptive Federated Learning Under Dynamic Bandwidth.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., September, 2025

Toward Understanding the Generalizability of Delayed Stochastic Gradient Descent.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

A Multi-Language Object-Oriented Programming Benchmark for Large Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios.

[BibT_eX]

[DOI]

CoRR, September, 2025

EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Consistent Estimation of Numerical Distributions under Local Differential Privacy by Wavelet Expansion.

[BibT_eX]

[DOI]

CoRR, September, 2025

Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Model Unmerging: Making Your Models Unmergeable for Secure Model Sharing.

[BibT_eX]

[DOI]

CoRR, September, 2025

Asymmetrically Decentralized Federated Learning.

[BibT_eX]

[DOI]

IEEE Trans. Computers, August, 2025

Constraint Boundary Wandering Framework: Enhancing Constrained Optimization With Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

On Nonconvex SGD Under Unbounded Noise With Weak Gradient Lipschitz and Delayed Stochastic Gradient.

[BibT_eX]

[DOI]

Tao Sun

Xinwang Liu

IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

Building accurate translation-tailored large language models with language-aware instruction tuning.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., August, 2025

Data-Adaptive Weight-Ensembling for Multi-task Model Fusion.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., August, 2025

ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., August, 2025

Diffusion Language Models Know the Answer Before Decoding.

[BibT_eX]

[DOI]

CoRR, August, 2025

LOST: Low-rank and Sparse Pre-training for Large Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

Graph Convolutional Mixture-of-Experts Learner Network for Long-Tailed Domain Generalization.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., July, 2025

DFedADMM: Dual Constraint Controlled Model Inconsistency for Decentralize Federated Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

Hyper-modal Imputation Diffusion Embedding with Dual-Distillation for Federated Multimodal Knowledge Graph Completion.

[BibT_eX]

[DOI]

CoRR, June, 2025

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching.

[BibT_eX]

[DOI]

CoRR, June, 2025

AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs.

[BibT_eX]

[DOI]

CoRR, June, 2025

MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

DGL-GAN: discriminator-guided GAN compression.

[BibT_eX]

[DOI]

Vis. Comput., May, 2025

Revisiting Flatness-Aware Optimization in Continual Learning With Orthogonal Gradient Projection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

LightSAM: Parameter-Agnostic Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

CoRR, May, 2025

Decision Flow Policy Optimization.

[BibT_eX]

[DOI]

CoRR, May, 2025

Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval.

[BibT_eX]

[DOI]

CoRR, May, 2025

Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging.

[BibT_eX]

[DOI]

CoRR, May, 2025

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, May, 2025

MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval.

[BibT_eX]

[DOI]

CoRR, May, 2025

R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search.

[BibT_eX]

[DOI]

CoRR, May, 2025

R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO.

[BibT_eX]

[DOI]

CoRR, May, 2025

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities.

[BibT_eX]

[DOI]

CoRR, May, 2025

Federated Learning With Only Positive Labels by Exploring Label Correlations.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., April, 2025

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization.

[BibT_eX]

[DOI]

CoRR, April, 2025

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection.

[BibT_eX]

[DOI]

CoRR, April, 2025

Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

On Efficient Training of Large-Scale Deep Learning Models.

[BibT_eX]

[DOI]

ACM Comput. Surv., March, 2025

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam.

[BibT_eX]

[DOI]

CoRR, February, 2025

On Theoretical Limits of Learning with Label Differential Privacy.

[BibT_eX]

[DOI]

CoRR, February, 2025

Zero Token-Driven Deep Thinking in LLMs: Unlocking the Full Potential of Existing Parameters via Cyclic Refinement.

[BibT_eX]

[DOI]

CoRR, February, 2025

SeWA: Selective Weight Average via Probabilistic Masking.

[BibT_eX]

[DOI]

CoRR, February, 2025

HRP: High-Rank Preheating for Superior LoRA Initialization.

[BibT_eX]

[DOI]

CoRR, February, 2025

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging.

[BibT_eX]

[DOI]

CoRR, February, 2025

Leveraging Reasoning with Guidelines to Elicit and Utilize Knowledge for Enhancing Safety Alignment.

[BibT_eX]

[DOI]

CoRR, February, 2025

Winning Prize Comes from Losing Tickets: Improve Invariant Learning by Exploring Variant Parameters for Out-of-Distribution Generalization.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2025

TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs.

[BibT_eX]

[DOI]

CoRR, January, 2025

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning.

[BibT_eX]

[DOI]

CoRR, January, 2025

Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging.

[BibT_eX]

[DOI]

CoRR, January, 2025

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent.

[BibT_eX]

[DOI]

CoRR, January, 2025

QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Are Large Language Models Really Robust to Word-Level Perturbations?

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

A Pyramid Fusion MLP for Dense Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Targeted Vaccine: Safety Alignment for Large Language Models Against Harmful Fine-Tuning via Layer-Wise Perturbation.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2025

DFedGFM: Pursuing global consistency for Decentralized Federated Learning via global flatness and global momentum.

[BibT_eX]

[DOI]

Neural Networks, 2025

Communication-efficient distributed learning with Local Immediate Error Compensation.

[BibT_eX]

[DOI]

Neural Networks, 2025

Learning from models beyond fine-tuning.

[BibT_eX]

[DOI]

Nat. Mac. Intell., 2025

FusionBench: A Unified Library and Comprehensive Benchmark for Deep Model Fusion.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2025

Code-switching finetuning: Bridging multilingual pretrained language models for enhanced cross-lingual performance.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2025

Graph decision transformer for offline reinforcement learning.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2025

Enhancing column generation by reinforcement learning-based hyper-heuristic for vehicle routing and scheduling problems.

[BibT_eX]

[DOI]

Kuan Xu

Lindong Liu

Comput. Ind. Eng., 2025

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Merging on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Dynamic Analysis and Adaptive Discriminator for Fake News Detection.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Hypernetwork Aggregation for Decentralized Personalized Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Decision Mixer: Integrating Long-term and Local Dependencies via Dynamic Token Selection for Decision-Making.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Contextual Bandits for Unbounded Context Distributions.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

GraphCL: Graph-based Clustering for Semi-Supervised Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Safety Reasoning with Guidelines.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Retrieval-Augmented Perception: High-resolution Image Perception Meets Visual RAG.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Targeted Low-rank Refinement: Enhancing Sparse Language Models with Precision.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Enhancing Learning with Label Differential Privacy by Vector Approximation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Open-Vocabulary Customization from CLIP via Data-Free Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Understanding the Stability-based Generalization of Personalized Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PEARL: Towards Permutation-Resilient LLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Robust Knowledge Editing via Explicit Reasoning Chains for Distractor-Resilient Multi-Hop QA.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Investigating the Role of Weight Decay in Enhancing Nonconvex SGD.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., December, 2024

Master-Slave Deep Architecture for Top-K Multiarmed Bandits With Nonlinear Bandit Feedback and Diversity Constraints.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2024

FedGAMMA: Federated Learning With Global Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2024

Continual Learning From a Stream of APIs.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

On Transforming Reinforcement Learning With Transformers: The Development Trajectory.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Neural-aware Decoupling Fusion based Personalized Federated Learning for Intelligent Sensing.

[BibT_eX]

[DOI]

ACM Trans. Sens. Networks, November, 2024

SPORT: A Subgraph Perspective on Graph Classification with Label Noise.

[BibT_eX]

[DOI]

ACM Trans. Knowl. Discov. Data, November, 2024

Meta-Learning Without Data via Unconditional Diffusion Models.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., November, 2024

A Unified Analysis of AdaGrad With Weighted Aggregation and Momentum Acceleration.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., October, 2024

Quantum Imitation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., October, 2024

Efficient Federated Learning With Enhanced Privacy via Lottery Ticket Pruning in Edge Computing.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., October, 2024

Retain and Adapt: Online Sequential EEG Classification With Subject Shift.

[BibT_eX]

[DOI]

IEEE Trans. Artif. Intell., September, 2024

Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System.

[BibT_eX]

[DOI]

ACM Trans. Knowl. Discov. Data, July, 2024

Generalized Embedding Machines for Recommender Systems.

[BibT_eX]

[DOI]

Mach. Intell. Res., June, 2024

Messages are Never Propagated Alone: Collaborative Hypergraph Neural Network for Time-Series Forecasting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Local AdaGrad-type algorithm for stochastic convex-concave optimization.

[BibT_eX]

[DOI]

Mach. Learn., April, 2024

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks.

[BibT_eX]

[DOI]

Neural Networks, January, 2024

Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Serv. Comput., 2024

Revisiting Discrete Soft Actor-Critic.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Visual Prompt Based Personalized Federated Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

SGDA: Towards 3-D Universal Pulmonary Nodule Detection via Slice Grouped Domain Attention.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Biol. Bioinform., 2024

Dynamic PDGAN: discriminator-boosted knowledge distillation for StyleGANs.

[BibT_eX]

[DOI]

J. Electronic Imaging, 2024

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2024

Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring the Generalization Capabilities of AID-based Bi-level Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

A Unified Analysis for Finite Weight Averaging.

[BibT_eX]

[DOI]

CoRR, 2024

AGLP: A Graph Learning Perspective for Semi-supervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

DiM: <i>f</i>-Divergence Minimization Guided Sharpness-Aware Optimization for Semi-supervised Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Aligning Few-Step Diffusion Models with Dense Reward Difference Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Continual Task Learning through Adaptive Policy Self-Composition.

[BibT_eX]

[DOI]

CoRR, 2024

Stability and Generalization for Distributed SGDA.

[BibT_eX]

[DOI]

CoRR, 2024

Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Communication Learning in Multi-Agent Systems from Graph Modeling Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Constraint-aware Learning for Resource Allocation in NFV-enabled Networks.

[BibT_eX]

[DOI]

CoRR, 2024

Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces.

[BibT_eX]

[DOI]

CoRR, 2024

SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery.

[BibT_eX]

[DOI]

CoRR, 2024

Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Targeted Vaccine: Safety Alignment for Large Language Models against Harmful Fine-Tuning via Layer-wise Perturbation.

[BibT_eX]

[DOI]

CoRR, 2024

Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration.

[BibT_eX]

[DOI]

CoRR, 2024

OledFL: Unleashing the Potential of Decentralized Federated Learning via Opposite Lookahead Enhancement.

[BibT_eX]

[DOI]

CoRR, 2024

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion.

[BibT_eX]

[DOI]

CoRR, 2024

USCD: Improving Code Generation of LLMs by Uncertainty-Aware Selective Contrastive Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal.

[BibT_eX]

[DOI]

CoRR, 2024

Convergent Differential Privacy Analysis for General Federated Learning: the f-DP Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

Byzantine-resilient Federated Learning Employing Normalized Gradients on Non-IID Datasets.

[BibT_eX]

[DOI]

CoRR, 2024

(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Efficient Pareto Set Approximation via Mixture of Experts Based Model Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

FusionBench: A Comprehensive Benchmark of Deep Model Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Learning with User-Level Local Differential Privacy.

[BibT_eX]

[DOI]

CoRR, 2024

Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.

[BibT_eX]

[DOI]

CoRR, 2024

Continuous Spiking Graph Neural Networks.

[BibT_eX]

[DOI]

Nan Yin

Mengzhu Wang

Hitesh Laxmichand Patel

Baopu Li

Bin Gu

Huan Xiong

CoRR, 2024

A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data.

[BibT_eX]

[DOI]

CoRR, 2024

Building Accurate Translation-Tailored LLMs with Language Aware Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping.

[BibT_eX]

[DOI]

CoRR, 2024

Solving Continual Offline Reinforcement Learning with Decision Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

Decomposed Prompt Decision Transformer for Efficient Unseen Task Generalization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

PrimKD: Primary Modality Guided Multimodal Fusion for RGB-D Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MuEP: A Multimodal Benchmark for Embodied Planning with Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Representation Surgery for Multi-Task Model Merging.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Generalization Analysis of Stochastic Weight Averaging with General Sampling.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Merging Multi-Task Models via Weight-Ensembling Mixture of Experts.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Q-value Regularized Transformer for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

DREAM: Dual Structured Exploration with Mixup for Open-set Graph Domain Adaption.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AdaMerging: Adaptive Model Merging for Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Unified and General Framework for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Parameter-Efficient Multi-Task Model Fusion with Partial Linearization.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Multi-Agent Communication from Graph Modeling Perspective.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Improving Non-Transferable Representation Learning by Harnessing Content and Style.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Training A Secure Model Against Data-Free Model Extraction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Sheared Backpropagation for Fine-Tuning Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Free: Faster and Better Data-Free Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Decentralized Directed Collaboration for Personalized Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Your Transferability Barrier is Fragile: Free-Lunch for Transferring the Non-Transferable Learning.

[BibT_eX]

[DOI]

Ziming Hong

Tongliang Liu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Knowledge Distillation for Autoregressive Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Neural Network Approximation for Pessimistic Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

OMG: Towards Effective Graph Classification Against Label Noise.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., December, 2023

Distributionally Robust Memory Evolution With Generalized Divergence for Continual Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Efficient Federated Learning Via Local Adaptive Amended Optimizer With Linear Speedup.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Hierarchical Detailed Intermediate Supervision for Image-to-Image Translation.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., December, 2023

Prescribed Safety Performance Imitation Learning From a Single Expert Dataset.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Don't Be So Dense: Sparse-to-Sparse GAN Training Without Sacrificing Performance.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., October, 2023

Task-Adaptive Feature Disentanglement and Hallucination for Few-Shot Classification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., August, 2023

Differentiable Neural Architecture Search for Extremely Lightweight Image Super-Resolution.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., June, 2023

Curriculum-Based Asymmetric Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Reducing bi-level feature redundancy for unsupervised domain adaptation.

[BibT_eX]

[DOI]

Pattern Recognit., May, 2023

Efficient-Adam: Communication-Efficient Distributed Adam.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2023

Dynamic Contrastive Distillation for Image-Text Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Fusion of Global and Local Knowledge for Personalized Federated Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

FedDAG: Federated DAG Structure Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking SIGN Training: Provable Nonconvex Acceleration without First- and Second-Order Gradient Lipschitz.

[BibT_eX]

[DOI]

CoRR, 2023

Learn From Model Beyond Fine-Tuning: A Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Asymmetrically Decentralized Federated Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Which mode is better for federated learning? Centralized or Decentralized.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models.

[BibT_eX]

[DOI]

CoRR, 2023

Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot Translation.

[BibT_eX]

[DOI]

CoRR, 2023

FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup for Non-IID Data.

[BibT_eX]

[DOI]

CoRR, 2023

MerA: Merging Pretrained Adapters For Few-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent.

[BibT_eX]

[DOI]

CoRR, 2023

DFedADMM: Dual Constraints Controlled Model Inconsistency for Decentralized Federated Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Towards More Suitable Personalization in Federated Learning via Decentralized Partial Model Training.

[BibT_eX]

[DOI]

CoRR, 2023

Prompt-Tuning Decision Transformer with Preference Ranking.

[BibT_eX]

[DOI]

CoRR, 2023

Towards the Flatter Landscape and Better Generalization in Federated Learning under Client-level Differential Privacy.

[BibT_eX]

[DOI]

CoRR, 2023

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review.

[BibT_eX]

[DOI]

CoRR, 2023

Graph Decision Transformer.

[BibT_eX]

[DOI]

CoRR, 2023

SGDA: Towards 3D Universal Pulmonary Nodule Detection via Slice Grouped Domain Attention.

[BibT_eX]

[DOI]

CoRR, 2023

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System.

[BibT_eX]

[DOI]

CoRR, 2023

Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE.

[BibT_eX]

[DOI]

CoRR, 2023

Enhance Local Consistency in Federated Learning: A Multi-Step Inertial Momentum Approach.

[BibT_eX]

[DOI]

CoRR, 2023

SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Enhancing Adversarial Training via Reweighting Optimization Trajectory.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

An Efficient Dataset Condensation Plugin and Its Application to Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Defending against Data-Free Model Extraction by Distributionally Robust Defensive Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Understanding How Consistency Works in Federated Learning via Stage-wise Relaxed Initialization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Stable Backdoor Purification through Feature Shift Tuning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Federated Learning with Manifold Regularization and Normalized Update Reaggregation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dynamic Sparsity Is Channel-Level Sparsity Learner.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LGViT: Dynamic Early Exiting for Accelerating Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Off-policy Imitation Learning from Visual Inputs.

[BibT_eX]

[DOI]

Zhihao Cheng

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Improving the Model Consistency of Decentralized Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Are Large Kernels Better Teachers than Transformers for ConvNets?

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Learning to Learn from APIs: Black-Box Data-Free Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harnessing Out-Of-Distribution Examples via Augmenting Content and Style.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Enhancing Fine-Tuning based Backdoor Defense with Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Global Balanced Experts for Federated Long-Tailed Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Data Augmented Flatness-aware Gradient Projection for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zero-shot Sharpness-Aware Quantization for Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards Making the Most of ChatGPT for Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Merging Experts into One: Improving Computational Efficiency of Mixture of Experts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MetaMix: Towards Corruption-Robust Continual Learning with Temporally Self-Adaptive Data Transformation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Make Landscape Flatter in Differentially Private Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robust Generalization Against Photon-Limited Corruptions via Worst-Case Sharpness Minimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Conference on Lifelong Learning Agents, 2023

Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Evaluating Model-Free Reinforcement Learning toward Safety-Critical Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

FedABC: Targeting Fair Competition in Personalized Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Offline Quantum Reinforcement Learning in a Conservative Manner.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

AlphaGAN: Fully Differentiable Architecture Search for Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Informative pairs mining based adaptive metric learning for adversarial domain adaptation.

[BibT_eX]

[DOI]

Neural Networks, 2022

Towards harnessing feature embedding for robust learning with noisy labels.

[BibT_eX]

[DOI]

Mach. Learn., 2022

Stochastic Client Selection for Federated Learning With Volatile Clients.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2022

On Transforming Reinforcement Learning by Transformer: The Development Trajectory.

[BibT_eX]

[DOI]

CoRR, 2022

Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE.

[BibT_eX]

[DOI]

CoRR, 2022

Strength-Adaptive Adversarial Training.

[BibT_eX]

[DOI]

CoRR, 2022

SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2022

Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Robust Unlearnable Examples: Protecting Data Against Adversarial Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Achieving Personalized Federated Learning with Sparse Local Models.

[BibT_eX]

[DOI]

CoRR, 2022

Meta-learning without data via Wasserstein distributionally-robust model fusion.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Enhancing Top-N Item Recommendations by Peer Collaboration.

[BibT_eX]

[DOI]

Yang Sun

Fajie Yuan

Min Yang

Alexandros Karatzoglou

Decebal Constantin Mocanu

Xiaoyan Zhao

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DEAL: An Unsupervised Domain Adaptive Framework for Graph-level Classification.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Penalized Proximal Policy Optimization for Safe Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Robust Weight Perturbation for Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Understanding Robust Overfitting of Adversarial Training and Beyond.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Improving Task-free Continual Learning by Distributionally Robust Memory Evolution.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Deep Neural Network Fusion via Graph Matching with Applications to Model Ensemble and Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

DisPFL: Towards Communication-Efficient Personalized Federated Learning via Decentralized Sparse Training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training.

[BibT_eX]

[DOI]

Zhangyang Wang

Mykola Pechenizkiy

Proceedings of the Tenth International Conference on Learning Representations, 2022

Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Meta-Learning with Less Forgetting on Large-Scale Non-Stationary Task Distributions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Learn and Remember Super Long Multi-Domain Task Sequence.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On the Complementarity between Pre-Training and Random-Initialization for Resource-Rich Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Quantized Adam with Error Feedback.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2021

UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Knowledge Distillation With Multi-Objective Divergence Learning.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

DGL-GAN: Discriminator Guided Learning for GAN Compression.

[BibT_eX]

[DOI]

CoRR, 2021

Spatial-Temporal-Fusion BNN: Variational Bayesian Feature Layer.

[BibT_eX]

[DOI]

CoRR, 2021

Federated Causal Discovery.

[BibT_eX]

[DOI]

CoRR, 2021

End-to-End Adaptive Monte Carlo Denoising and Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2021

Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Minimax Problems.

[BibT_eX]

[DOI]

CoRR, 2021

Sparse Training via Boosting Pruning Plasticity with Neuroregeneration.

[BibT_eX]

[DOI]

Decebal Constantin Mocanu

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DAG-GAN: Causal Structure Learning with Generative Adversarial Nets.

[BibT_eX]

[DOI]

Yinghua Gao

Shu-Tao Xia

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

MAP Inference Via ℓ <sub>2</sub>-Sphere Linear Program Reformulation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Adaptive Compact Attention For Few-shot Video-to-video Translation.

[BibT_eX]

[DOI]

CoRR, 2020

Task-agnostic Temporally Consistent Facial Video Editing.

[BibT_eX]

[DOI]

CoRR, 2020

Generalized Embedding Machines for Recommender Systems.

[BibT_eX]

[DOI]

CoRR, 2020

A Block Decomposition Algorithm for Sparse Optimization.

[BibT_eX]

[DOI]

Ganzhao Yuan

Wei-Shi Zheng

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning.

[BibT_eX]

[DOI]

Niranjan Balasubramanian

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

MAP Inference via L2-Sphere Linear Program Reformulation.

[BibT_eX]

[DOI]

CoRR, 2019

Discrete Trust-aware Matrix Factorization for Fast Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Decomposition Algorithm for the Sparse Generalized Eigenvalue Problem.

[BibT_eX]

[DOI]

Ganzhao Yuan