Antoine Miech

According to our database1, Antoine Miech authored at least 23 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames.
CoRR, 2023

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime.
CoRR, 2023

Zorro: the masked multimodal transformer.
CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multi-Task Learning of Object State Changes from Uncurated Videos.
CoRR, 2022

Learning to Answer Visual Questions from Web Videos.
CoRR, 2022

Flamingo: a Visual Language Model for Few-Shot Learning.
CoRR, 2022

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


TubeDETR: Spatio-Temporal Video Grounding with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Just Ask: Learning to Answer Questions from Millions of Narrated Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Large-scale Learning from Video and Natural Language. (Apprentissage vidéo et langage naturel à grande échelle).
PhD thesis, 2020

RareAct: A video dataset of unusual interactions.
CoRR, 2020

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020).
CoRR, 2020

End-to-End Learning of Visual Representations From Uncurated Instructional Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Leveraging the Present to Anticipate the Future in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data.
CoRR, 2018

2017
Learnable pooling with Context Gating for video classification.
CoRR, 2017

Learning from Video and Text via Large-Scale Discriminative Clustering.
Proceedings of the IEEE International Conference on Computer Vision, 2017


  Loading...