Lucas Caccia

According to our database1, Lucas Caccia authored at least 29 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Exploring Sparse Adapters for Scalable Merging of Parameter Efficient Experts.
CoRR, July, 2025

debug-gym: A Text-Based Environment for Interactive Debugging.
CoRR, March, 2025

Training Plug-n-Play Knowledge Modules with Deep Context Distillation.
CoRR, March, 2025

Generative Modeling of Individual Behavior at Scale.
CoRR, February, 2025

A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning.
Trans. Mach. Learn. Res., 2025

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Towards Modular LLMs by Building and Reusing a Library of LoRAs.
CoRR, 2024

Towards Modular LLMs by Building and Reusing a Library of LoRAs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Guiding Language Model Reasoning with Planning Tokens.
CoRR, 2023

Multi-Head Adapter Routing for Cross-Task Generalization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Guiding The Last Layer in Federated Learning with Pre-Trained Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Building a Subspace of Policies for Scalable Continual Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Re-Weighted Softmax Cross-Entropy to Control Forgetting in Federated Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

2022
Multi-Head Adapter Routing for Data-Efficient Fine-Tuning.
CoRR, 2022

New Insights on Reducing Abrupt Representation Change in Online Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

On Anytime Learning at Macroscale.
Proceedings of the Conference on Lifelong Learning Agents, 2022

2021
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning.
CoRR, 2021

Reducing Representation Drift in Online Continual Learning.
CoRR, 2021

SPeCiaL: Self-supervised Pretraining for Continual Learning.
Proceedings of the Continual Semi-Supervised Learning - First International Workshop, 2021

2020
Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning.
CoRR, 2020

Online Fast Adaptation and Knowledge Accumulation (OSAKA): a New Approach to Continual Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Learned Continual Compression with Adaptive Quantization Modules.
Proceedings of the 37th International Conference on Machine Learning, 2020

Language GANs Falling Short.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Online Learned Continual Compression with Stacked Quantization Module.
CoRR, 2019

Online Continual Learning with Maximally Interfered Retrieval.
CoRR, 2019

Recurrent Value Functions.
CoRR, 2019

Online Continual Learning with Maximal Interfered Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Deep Generative Modeling of LiDAR Data.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

2018
Clustering-Oriented Representation Learning with Attractive-Repulsive Loss.
CoRR, 2018


  Loading...