Lucas Beyer

Orcid: 0000-0002-0460-0607

According to our database1, Lucas Beyer authored at least 53 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LocCa: Visual Pretraining with Location-aware Captioners.
CoRR, 2024

2023
PaLI-3 Vision Language Models: Smaller, Faster, Stronger.
CoRR, 2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.
CoRR, 2023

Three Towers: Flexible Contrastive Learning with Pretrained Image Models.
CoRR, 2023

A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision.
CoRR, 2023

Scaling Vision Transformers to 22 Billion Parameters.
CoRR, 2023

Image Captioners Are Scalable Vision Learners Too.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Three Towers: Flexible Contrastive Learning with Pretrained Image Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Tuning Computer Vision Models With Task Rewards.
Proceedings of the International Conference on Machine Learning, 2023


PaLI: A Jointly-Scaled Multilingual Language-Image Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sigmoid Loss for Language Image Pre-Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FlexiViT: One Model for All Patch Sizes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Deep visual human sensing with application in robotics.
PhD thesis, 2022

How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers.
Trans. Mach. Learn. Res., 2022

VeLO: Training Versatile Learned Optimizers by Scaling Up.
CoRR, 2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model.
CoRR, 2022

Better plain ViT baselines for ImageNet-1k.
CoRR, 2022

UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Efficiency Misnomer.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

LiT: Zero-Shot Transfer with Locked-image text Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scaling Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Knowledge distillation: A good teacher is patient and consistent.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation.
CoRR, 2021

SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size.
CoRR, 2021

MLP-Mixer: An all-MLP Architecture for Vision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.
Proceedings of the 9th International Conference on Learning Representations, 2021

On Robustness and Transferability of Convolutional Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Are we done with ImageNet?
CoRR, 2020

Big Transfer (BiT): General Visual Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Large Scale Learning of General Visual Representations for Transfer.
CoRR, 2019

The Visual Task Adaptation Benchmark.
CoRR, 2019

MULEX: Disentangling Exploitation from Exploration in Deep RL.
CoRR, 2019

Deep multi-class learning from label proportions.
CoRR, 2019

S<sup>4</sup>L: Self-Supervised Semi-Supervised Learning.
CoRR, 2019

S4L: Self-Supervised Semi-Supervised Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Revisiting Self-Supervised Visual Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Deep Person Detection in Two-Dimensional Range Data.
IEEE Robotics Autom. Lett., 2018

Deep Person Detection in 2D Range Data.
CoRR, 2018

Detection- Tracking for Efficient Person Analysis: The DetTA Pipeline.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017
The STRANDS Project: Long-Term Autonomy in Everyday Environments.
IEEE Robotics Autom. Mag., 2017

DROW: Real-Time Deep Learning-Based Wheelchair Detection in 2-D Range Data.
IEEE Robotics Autom. Lett., 2017

The Atari Grand Challenge Dataset.
CoRR, 2017

In Defense of the Triplet Loss for Person Re-Identification.
CoRR, 2017

Towards a Principled Integration of Multi-camera Re-identification and Tracking Through Optimal Bayes Filters.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range Data.
CoRR, 2016

2015

Biternion Nets: Continuous Head Pose Regression from Discrete Training Labels.
Proceedings of the Pattern Recognition - 37th German Conference, 2015

2013
Streaming Data from HDD to GPUs for Sustained Peak Performance
CoRR, 2013

GWAS on GPUs: Streaming Data from HDD for Sustained Performance.
Proceedings of the Euro-Par 2013 Parallel Processing, 2013


  Loading...