Thomas Mensink

Orcid: 0000-0002-5730-713X

According to our database1, Thomas Mensink authored at least 71 papers between 2007 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
How (not) to ensemble LVLMs for VQA.
CoRR, 2023

Scaling Vision Transformers to 22 Billion Parameters.
CoRR, 2023


Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Infinite Class Mixup.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Factors of Influence for Transfer Learning Across Diverse Appearance Domains and Task Types.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

The Missing Link: Finding Label Relations Across Datasets.
Proceedings of the Computer Vision - ECCV 2022, 2022

How Stable Are Transferability Metrics Evaluations?
Proceedings of the Computer Vision - ECCV 2022, 2022

Transferability Estimation using Bhattacharyya Class Separability.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Transferability Metrics for Selecting Source Model Ensembles.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Automatic generation of dense non-rigid optical flow.
Comput. Vis. Image Underst., 2021

EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Multi-Loss Weighting with Coefficient of Variations.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Neural Feature Matching in Implicit 3D Representations.
Proceedings of the 38th International Conference on Machine Learning, 2021

Calibration of Neural Networks using Splines.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
On the benefit of adversarial training for monocular depth estimation.
Comput. Vis. Image Underst., 2020

Post-hoc Calibration of Neural Networks.
CoRR, 2020

PointMixup: Augmentation for Point Clouds.
Proceedings of the Computer Vision - ECCV 2020, 2020

Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection.
Proceedings of the 4th Conference on Robot Learning, 2020

Novel View Synthesis from Single Images via Point Cloud Transformation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
New Modality: Emoji Challenges in Prediction, Anticipation, and Retrieval.
IEEE Trans. Multim., 2019

IterGANs: Iterative GANs to learn and control 3D object transformation.
Comput. Vis. Image Underst., 2019

Interactive Exploration of Journalistic Video Footage through Multimodal Semantic Matching.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2018
Guest Editorial.
Comput. Vis. Image Underst., 2018

Unsupervised Generation of Optical Flow Datasets from Videos in the Wild.
CoRR, 2018

DeepNCM: Deep Nearest Class Mean Classifiers.
Proceedings of the 6th International Conference on Learning Representations, 2018

Iterative GANs for Rotating Visual Objects.
Proceedings of the 6th International Conference on Learning Representations, 2018

Three for one and one for three: Flow, Segmentation, and Surface Normals.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Video2vec Embeddings Recognize Events When Examples Are Scarce.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Music-Guided Video Summarization using Quadratic Assignments.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Online Open World Recognition.
CoRR, 2016

Learning to Reuse Visual Knowledge.
Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management, 2016

Pooling Objects for Recognizing Scenes without Examples.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Video Stream Retrieval of Unseen Queries using Semantic Memory.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
VideoStory Embeddings Recognize Events when Examples are Scarce.
CoRR, 2015

Image2Emoji: Zero-shot Emoji Prediction for Visual Media.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Query-by-Emoji Video Search.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Bag-of-Fragments: Selecting and Encoding Video Fragments for Event Detection and Recounting.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Discovering Semantic Vocabularies for Cross-Media Retrieval.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Latent Factors of Visual Popularity Prediction.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Objects2action: Classifying and Localizing Actions without Any Video Example.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Event Fisher Vectors: Robust Encoding Visual Diversity of Visual Streams.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Robustifying Descriptor Instability Using Fisher Vectors.
IEEE Trans. Image Process., 2014

MediaMill at TRECVID 2014: Searching Concepts, Objects, Instances and Events in Video.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014


VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Rijksmuseum Challenge: Museum-Centered Visual Recognition.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Composite Concept Discovery for Zero-Shot Video Event Detection.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Attributes Make Sense on Segmented Objects.
Proceedings of the Computer Vision - ECCV 2014, 2014

COSTA: Co-Occurrence Statistics for Zero-Shot Classification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Large Scale Metric Learning for Distance-Based Image Classification on Open Ended Data Sets.
Proceedings of the Advanced Topics in Computer Vision, 2013

Distance-Based Image Classification: Generalizing to New Classes at Near-Zero Cost.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Tree-Structured CRF Models for Interactive Image Labeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Image Classification with the Fisher Vector: Theory and Practice.
Int. J. Comput. Vis., 2013

2012
Learning Image Classification and Retrieval Models. (Apprentissage de Modèles pour la Classification et la Recherche d'Images).
PhD thesis, 2012

Face Recognition from Caption-Based Supervision.
Int. J. Comput. Vis., 2012

Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Learning structured prediction models for interactive image labeling.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Image annotation with tagprop on the MIRFLICKR set.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Improving the Fisher Kernel for Large-Scale Image Classification.
Proceedings of the Computer Vision, 2010

EP for Efficient Stochastic Control with Obstacles.
Proceedings of the ECAI 2010, 2010

LEAR and XRCE's Participation to Visual Concept Detection Task - ImageCLEF 2010.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

Trans Media Relevance Feedback for Image Autoannotation.
Proceedings of the British Machine Vision Conference, 2010

2009
TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

INRIA-LEAR's Participation in ImageCLEF 2009.
Proceedings of the Working Notes for CLEF 2009 Workshop co-located with the 13th European Conference on Digital Libraries (ECDL 2009) , Corfù, Greece, September 30, 2009

2008
Improving People Search Using Query Expansions.
Proceedings of the Computer Vision, 2008

Automatic face naming with caption-based supervision.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Distributed EM Learning for Appearance Based Multi-Camera Tracking.
Proceedings of the 2007 First ACM/IEEE International Conference on Distributed Smart Cameras, 2007


  Loading...