Mitchell Wortsman

According to our database1, Mitchell Wortsman authored at least 29 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Language models scale reliably with over-training and on downstream tasks.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

2023
lo-fi: distributed fine-tuning without communication.
Trans. Mach. Learn. Res., 2023

Small-scale proxies for large-scale Transformer training instabilities.
CoRR, 2023

Replacing softmax with ReLU in Vision Transformers.
CoRR, 2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models.
CoRR, 2023

The Role of Pre-training Data in Transfer Learning.
CoRR, 2023

Stable and low-precision training for large-scale vision-language models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


Editing models with task arithmetic.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Reproducible Scaling Laws for Contrastive Language-Image Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Editing Models with Task Arithmetic.
CoRR, 2022

CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration.
CoRR, 2022

LAION-5B: An open large-scale dataset for training next generation image-text models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Patching open-vocabulary models by interpolating weights.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time.
Proceedings of the International Conference on Machine Learning, 2022

Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP).
Proceedings of the International Conference on Machine Learning, 2022

Exploring The Landscape of Distributional Robustness for Question Answering Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Robust fine-tuning of zero-shot models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Robust fine-tuning of zero-shot models.
CoRR, 2021

Learning Neural Network Subspaces.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Deconstructing the Structure of Sparse Neural Networks.
CoRR, 2020

Supermasks in Superposition.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Soft Threshold Weight Reparameterization for Learnable Sparsity.
Proceedings of the 37th International Conference on Machine Learning, 2020

What's Hidden in a Randomly Weighted Neural Network?
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Discovering Neural Wirings.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019


  Loading...