João Carreira

Orcid: 0000-0002-5711-5875

According to our database1, João Carreira authored at least 93 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BootsTAP: Bootstrapped Training for Tracking-Any-Point.
CoRR, 2024

Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023
Transframer: Arbitrary Frame Prediction with Generative Models.
Trans. Mach. Learn. Res., 2023

Perception Test 2023: A Summary of the First Challenge And Outcome.
CoRR, 2023

Learning from One Continuous Video Stream.
CoRR, 2023

Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video.
CoRR, 2023

Zorro: the masked multimodal transformer.
CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-supervised video pretraining yields robust and more human-aligned visual representations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Self-supervised video pretraining yields strong image representations.
CoRR, 2022

Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods.
CoRR, 2022

Hierarchical Perceiver.
CoRR, 2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

General-purpose, long-context autoregressive modeling with Perceiver AR.
Proceedings of the International Conference on Machine Learning, 2022

Perceiver IO: A General Architecture for Structured Inputs & Outputs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Learning Universal Audio Representations.
Proceedings of the IEEE International Conference on Acoustics, 2022

Object Discovery and Representation Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

Input-level Inductive Biases for 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compressed Vision for Efficient Video Understanding.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
What serverless computing is and should become: the next phase of cloud computing.
Commun. ACM, 2021

Perceiver: General Perception with Iterative Attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

Attention-driven tile splitting method for improved efficiency of omnidirectional versatile video coding.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Efficient Visual Pretraining with Contrastive Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

From warm to hot starts: leveraging runtimes for the serverless era.
Proceedings of the HotOS '21: Workshop on Hot Topics in Operating Systems, 2021

Gradient Forward-Propagation for Large-Scale Temporal Video Modelling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A Short Note on the Kinetics-700-2020 Human Action Dataset.
CoRR, 2020

The AVA-Kinetics Localized Human Actions Video Dataset.
CoRR, 2020

Complexity Estimation for Load Balancing of 360-Degree Intra Versatile Video Coding.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2020

Versatile Video Coding Of 360° Video Using Adaptive Resolution Change.
Proceedings of the IEEE International Conference on Image Processing, 2020

Visual Grounding in Video for Unsupervised Word Translation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sideways: Depth-Parallel Training of Video Models.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Scalable Coding of 360-degree Video for Streaming Adaptation at 5G Network Edges.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

2019
Error Concealment-Aware Encoding for Robust Video Transmission.
IEEE Trans. Broadcast., 2019

A Short Note on the Kinetics-700 Human Action Dataset.
CoRR, 2019

Cloud Programming Simplified: A Berkeley View on Serverless Computing.
CoRR, 2019

Versatile Video Coding of 360-Degree Video using Frame-Based FoV and Visual Attention.
Proceedings of the IEEE International Symposium on Multimedia, 2019

Controllable Attention for Structured Layered Video Decomposition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Video Action Transformer Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

The Visual Centrifuge: Model-Free Layered Video Representations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Cirrus: a Serverless Framework for End-to-end ML Workflows.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019

2018
Error resilience and concealment techniques for high-efficiency video coding
PhD thesis, 2018

A Two-Stage Approach for Robust HEVC Coding and Streaming.
IEEE Trans. Circuits Syst. Video Technol., 2018

A Short Note about Kinetics-600.
CoRR, 2018

A Better Baseline for AVA.
CoRR, 2018

Massively Parallel Video Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Learning Category-Specific Deformable 3D Models for Object Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

The Kinetics Human Action Video Dataset.
CoRR, 2017

A robust video encoding scheme to enhance error concealment of intra frames.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Smart photovoltaic systems a cyber-physical systems/IoT approach: Architecture, implementantion and pilot description.
Proceedings of the International Conference on Engineering, Technology and Innovation, 2017

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
The three R's of computer vision: Recognition, reconstruction and reorganization.
Pattern Recognit. Lett., 2016

Lifting Object Detection Datasets into 3D.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Network Requirements for Resource Disaggregation.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

Human Pose Estimation with Iterative Error Feedback.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Free-Form Region Description with Second-Order Pooling.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Shape and Symmetry Induction for 3D Objects.
CoRR, 2015

Reference picture selection using checkerboard pattern for resilient video coding.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Pose Induction for Novel Object Categories.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Amodal Completion and Size Constancy in Natural Scenes.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning to See by Moving.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Category-specific object reconstruction from a single image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Virtual view networks for object reconstruction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Probabilistic Joint Image Segmentation and Labeling by Figure-Ground Composition.
Int. J. Comput. Vis., 2014

Pedestrian detection combining RGB and dense LIDAR data.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Selective motion vector redundancies for improved error resilience in HEVC.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Dynamic motion vector refreshing for enhanced error resilience in HEVC.
Proceedings of the 22nd European Signal Processing Conference, 2014

Reconstructing PASCAL VOC.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Beyond Hard Negative Mining: Efficient Detector Learning via Block-Circulant Decomposition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Composite Statistical Inference for Semantic Segmentation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Bottom-up object segmentation for visual recognition.
PhD thesis, 2012

CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Object Recognition by Sequential Figure-Ground Ranking.
Int. J. Comput. Vis., 2012

Semantic Segmentation with Second-Order Pooling.
Proceedings of the Computer Vision - ECCV 2012, 2012

Frame loss concealment for 3D video decoders based on disparity-compensated motion field.
Proceedings of the 3DTV-Conference 2012: The True Vision, 2012

2011
Subjective quality factors in packet 3D video.
Proceedings of the Third International Workshop on Quality of Multimedia Experience, 2011

Probabilistic Joint Image Segmentation and Labeling.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Image segmentation by figure-ground composition into maximal cliques.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Image Segmentation by Discounted Cumulative Ranking on Maximal Cliques
CoRR, 2010

Subjective assessment of frame loss concealment methods in 3D video.
Proceedings of the Picture Coding Symposium, 2010

Object recognition as ranking holistic figure-ground hypotheses.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Constrained parametric min-cuts for automatic object segmentation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2008
Multicriteria decision analysis (MCDA): Central Porto high-speed railway station.
Eur. J. Oper. Res., 2008

2007
Retrieving and Exploiting Hand's Orientation in Tabletop Interaction.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Unraveling Natural Interfaces for Co-Located Groupware: Lessons Learned in an Experiment with Music.
J. Multim., 2006

A Vision Based Interface for Local Collaborative Music Synthesis.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

1998
Xception: A Technique for the Experimental Evaluation of Dependability in Modern Computers.
IEEE Trans. Software Eng., 1998

Why do some (weird) people inject faults?
ACM SIGSOFT Softw. Eng. Notes, 1998

Computer Science and the Pygmalion Effect.
Computer, 1998

1997
Implementing Tuple Space with Threads.
Proceedings of the IASTED International Conference on Parallel and Distributed Systems, 1997

1996
Experimental Assessment of Parallel Systems.
Proceedings of the Digest of Papers: FTCS-26, 1996

1995
WPVM: parallel computing for the people.
Proceedings of the High-Performance Computing and Networking, 1995


  Loading...