Li Shen
Orcid: 0000-0001-5659-3464Affiliations:
- Sun Yat-sen University Shenzhen Campus, School of Cyber Science and Technology, Shenzhen, China
- JD Explore Academy, Beijing, China (2021 - 2024)
- Tencent, Shenzhen, China (2017 - 2021)
- South China University of Technology, Guangzhou, China (PhD 2017)
According to our database1,
Li Shen
authored at least 299 papers
between 2017 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
Constraint Boundary Wandering Framework: Enhancing Constrained Optimization With Deep Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025
On Nonconvex SGD Under Unbounded Noise With Weak Gradient Lipschitz and Delayed Stochastic Gradient.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025
Int. J. Comput. Vis., August, 2025
Int. J. Comput. Vis., August, 2025
Graph Convolutional Mixture-of-Experts Learner Network for Long-Tailed Domain Generalization.
IEEE Trans. Circuits Syst. Video Technol., July, 2025
DFedADMM: Dual Constraint Controlled Model Inconsistency for Decentralize Federated Learning.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2025
Hyper-modal Imputation Diffusion Embedding with Dual-Distillation for Federated Multimodal Knowledge Graph Completion.
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models.
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
Revisiting Flatness-Aware Optimization in Continual Learning With Orthogonal Gradient Projection.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer.
CoRR, May, 2025
CoRR, May, 2025
Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging.
CoRR, May, 2025
Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought.
CoRR, May, 2025
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval.
CoRR, May, 2025
CoRR, May, 2025
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO.
CoRR, May, 2025
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning.
CoRR, May, 2025
CoRR, May, 2025
Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities.
CoRR, May, 2025
IEEE Trans. Neural Networks Learn. Syst., April, 2025
CoRR, April, 2025
Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection.
CoRR, April, 2025
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning.
CoRR, April, 2025
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025
ACM Comput. Surv., March, 2025
CoRR, March, 2025
CoRR, February, 2025
CoRR, February, 2025
CoRR, February, 2025
CoRR, February, 2025
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging.
CoRR, February, 2025
Leveraging Reasoning with Guidelines to Elicit and Utilize Knowledge for Enhancing Safety Alignment.
CoRR, February, 2025
Winning Prize Comes from Losing Tickets: Improve Invariant Learning by Exploring Variant Parameters for Out-of-Distribution Generalization.
Int. J. Comput. Vis., January, 2025
TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs.
CoRR, January, 2025
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation.
CoRR, January, 2025
CoRR, January, 2025
Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging.
CoRR, January, 2025
CoRR, January, 2025
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning.
Trans. Mach. Learn. Res., 2025
Trans. Mach. Learn. Res., 2025
IEEE Trans. Image Process., 2025
DFedGFM: Pursuing global consistency for Decentralized Federated Learning via global flatness and global momentum.
Neural Networks, 2025
Communication-efficient distributed learning with Local Immediate Error Compensation.
Neural Networks, 2025
Code-switching finetuning: Bridging multilingual pretrained language models for enhanced cross-lingual performance.
Eng. Appl. Artif. Intell., 2025
Enhancing column generation by reinforcement learning-based hyper-heuristic for vehicle routing and scheduling problems.
Comput. Ind. Eng., 2025
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?
ACM Trans. Multim. Comput. Commun. Appl., December, 2024
Master-Slave Deep Architecture for Top-K Multiarmed Bandits With Nonlinear Bandit Feedback and Diversity Constraints.
IEEE Trans. Neural Networks Learn. Syst., December, 2024
IEEE Trans. Neural Networks Learn. Syst., December, 2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
On Transforming Reinforcement Learning With Transformers: The Development Trajectory.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Neural-aware Decoupling Fusion based Personalized Federated Learning for Intelligent Sensing.
ACM Trans. Sens. Networks, November, 2024
ACM Trans. Knowl. Discov. Data, November, 2024
IEEE Trans. Circuits Syst. Video Technol., November, 2024
IEEE Trans. Neural Networks Learn. Syst., October, 2024
Efficient Federated Learning With Enhanced Privacy via Lottery Ticket Pruning in Edge Computing.
IEEE Trans. Mob. Comput., October, 2024
IEEE Trans. Artif. Intell., September, 2024
ACM Trans. Knowl. Discov. Data, July, 2024
Mach. Intell. Res., June, 2024
Messages are Never Propagated Alone: Collaborative Hypergraph Neural Network for Time-Series Forecasting.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024
Mach. Learn., April, 2024
AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks.
Neural Networks, January, 2024
Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning.
IEEE Trans. Serv. Comput., 2024
Adv. Math. Commun., 2024
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search.
CoRR, 2024
CoRR, 2024
Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs.
CoRR, 2024
CoRR, 2024
DiM: <i>f</i>-Divergence Minimization Guided Sharpness-Aware Optimization for Semi-supervised Medical Image Segmentation.
CoRR, 2024
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging.
CoRR, 2024
CoRR, 2024
SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery.
CoRR, 2024
Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace.
CoRR, 2024
Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models.
CoRR, 2024
Targeted Vaccine: Safety Alignment for Large Language Models against Harmful Fine-Tuning via Layer-wise Perturbation.
CoRR, 2024
Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration.
CoRR, 2024
OledFL: Unleashing the Potential of Decentralized Federated Learning via Opposite Lookahead Enhancement.
CoRR, 2024
DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion.
CoRR, 2024
USCD: Improving Code Generation of LLMs by Uncertainty-Aware Selective Contrastive Decoding.
CoRR, 2024
Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal.
CoRR, 2024
Convergent Differential Privacy Analysis for General Federated Learning: the f-DP Perspective.
CoRR, 2024
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models.
CoRR, 2024
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models.
CoRR, 2024
CoRR, 2024
Byzantine-resilient Federated Learning Employing Normalized Gradients on Non-IID Datasets.
CoRR, 2024
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities.
CoRR, 2024
(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork.
CoRR, 2024
Towards Efficient Pareto Set Approximation via Mixture of Experts Based Model Fusion.
CoRR, 2024
CoRR, 2024
CoRR, 2024
A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Your Transferability Barrier is Fragile: Free-Lunch for Transferring the Non-Transferable Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Knowl. Data Eng., December, 2023
Distributionally Robust Memory Evolution With Generalized Divergence for Continual Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023
Efficient Federated Learning Via Local Adaptive Amended Optimizer With Linear Speedup.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023
IEICE Trans. Inf. Syst., December, 2023
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023
Int. J. Comput. Vis., October, 2023
IEEE Trans. Circuits Syst. Video Technol., August, 2023
Differentiable Neural Architecture Search for Extremely Lightweight Image Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., June, 2023
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
Pattern Recognit., May, 2023
IEEE Trans. Signal Process., 2023
Trans. Mach. Learn. Res., 2023
Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion.
CoRR, 2023
Rethinking SIGN Training: Provable Nonconvex Acceleration without First- and Second-Order Gradient Lipschitz.
CoRR, 2023
CoRR, 2023
CoRR, 2023
FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup for Non-IID Data.
CoRR, 2023
Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints.
CoRR, 2023
CoRR, 2023
DFedADMM: Dual Constraints Controlled Model Inconsistency for Decentralized Federated Learning.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning.
CoRR, 2023
Towards More Suitable Personalization in Federated Learning via Decentralized Partial Model Training.
CoRR, 2023
Towards the Flatter Landscape and Better Generalization in Federated Learning under Client-level Differential Privacy.
CoRR, 2023
CoRR, 2023
SGDA: Towards 3D Universal Pulmonary Nodule Detection via Slice Grouped Domain Attention.
CoRR, 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System.
CoRR, 2023
Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE.
CoRR, 2023
Enhance Local Consistency in Federated Learning: A Multi-Step Inertial Momentum Approach.
CoRR, 2023
SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning.
CoRR, 2023
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023
Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Defending against Data-Free Model Extraction by Distributionally Robust Defensive Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Understanding How Consistency Works in Federated Learning via Stage-wise Relaxed Initialization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification.
Proceedings of the International Conference on Machine Learning, 2023
Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
MetaMix: Towards Corruption-Robust Continual Learning with Temporally Self-Adaptive Data Transformation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Robust Generalization Against Photon-Limited Corruptions via Worst-Case Sharpness Minimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Conference on Lifelong Learning Agents, 2023
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
AlphaGAN: Fully Differentiable Architecture Search for Generative Adversarial Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Informative pairs mining based adaptive metric learning for adversarial domain adaptation.
Neural Networks, 2022
Mach. Learn., 2022
IEEE Internet Things J., 2022
CoRR, 2022
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE.
CoRR, 2022
SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving.
CoRR, 2022
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation.
CoRR, 2022
CoRR, 2022
Proceedings of the Uncertainty in Artificial Intelligence, 2022
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Deep Neural Network Fusion via Graph Matching with Applications to Model Ensemble and Federated Learning.
Proceedings of the International Conference on Machine Learning, 2022
DisPFL: Towards Communication-Efficient Personalized Federated Learning via Decentralized Sparse Training.
Proceedings of the International Conference on Machine Learning, 2022
The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
On the Complementarity between Pre-Training and Random-Initialization for Resource-Rich Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
IEEE Trans. Image Process., 2021
IEEE Signal Process. Lett., 2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Int. J. Comput. Vis., 2020
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020
Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017