Luc Van Gool

Affiliations:
  • ETH Zurich, Computer Vision Lab
  • KU Leuven, Center for Processing Speech and Images (PSI-VISICS)


According to our database1, Luc Van Gool authored at least 1,216 papers between 1984 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Loopy-SLAM: Dense Neural SLAM with Loop Closures.
CoRR, 2024

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector.
CoRR, 2024

Key-Graph Transformer for Image Restoration.
CoRR, 2024

Image Fusion via Vision-Language Model.
CoRR, 2024

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.
CoRR, 2024

MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty.
CoRR, 2024

Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation.
CoRR, 2024

InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes.
CoRR, 2024

Learning to Prompt with Text Only Supervision for Vision-Language Models.
CoRR, 2024

2023
Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis.
Mach. Intell. Res., December, 2023

Discwise Active Learning for LiDAR Semantic Segmentation.
IEEE Robotics Autom. Lett., November, 2023

How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges.
Mach. Intell. Res., October, 2023

GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Global Aligned Structured Sparsity Learning for Efficient Image Super-Resolution.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

PDC-Net+: Enhanced Probabilistic Dense Correspondence Network.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with Self-Supervised Depth Estimation.
Int. J. Comput. Vis., August, 2023

A Survey on Deep Learning Technique for Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Advancing Learned Video Compression With In-Loop Frame Prediction.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Unsupervised Template Warp Consistency for Implicit Surface Correspondences.
Comput. Graph. Forum, May, 2023

An Efficient Recurrent Adversarial Framework for Unsupervised Real-Time Video Enhancement.
Int. J. Comput. Vis., April, 2023

Active Perception for Visual-Language Navigation.
Int. J. Comput. Vis., March, 2023

Binaural SoundNet: Predicting Semantics, Depth and Motion With Binaural Sounds.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Masked Vision-language Transformer in Fashion.
Int. J. Autom. Comput., 2023

Deep Gradient Learning for Efficient Camouflaged Object Detection.
Int. J. Autom. Comput., 2023

Residual Learning for Image Point Descriptors.
CoRR, 2023

Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM.
CoRR, 2023

Diffusion-Based Particle-DETR for BEV Perception.
CoRR, 2023

G-MEMP: Gaze-Enhanced Multimodal Ego-Motion Prediction in Driving.
CoRR, 2023

DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control.
CoRR, 2023

Zero-Shot Point Cloud Registration.
CoRR, 2023

LALM: Long-Term Action Anticipation with Language Models.
CoRR, 2023

Continuous Pose for Monocular Cameras in Neural Implicit Representation.
CoRR, 2023

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance.
CoRR, 2023

Single-Model and Any-Modality for Video Object Tracking.
CoRR, 2023

2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation.
CoRR, 2023

Lego: Learning to Disentangle and Invert Concepts Beyond Object Appearance in Text-to-Image Diffusion Models.
CoRR, 2023

3D Compression Using Neural Fields.
CoRR, 2023

Deep Equilibrium Diffusion Restoration with Parallel Sampling.
CoRR, 2023

MoVideo: Motion-Aware Video Generation with Diffusion Models.
CoRR, 2023

Contrastive Learning for Multi-Object Tracking with Transformers.
CoRR, 2023

Learning Robust Multi-Scale Representation for Neural Radiance Fields from Unposed Images.
CoRR, 2023

Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences.
CoRR, 2023

Towards High-quality HDR Deghosting with Conditional Diffusion Models.
CoRR, 2023

Revisiting Evaluation Metrics for Semantic Segmentation: Optimization and Evaluation of Fine-grained Intersection over Union.
CoRR, 2023

SILC: Improving Vision Language Pretraining with Self-Distillation.
CoRR, 2023

Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding.
CoRR, 2023

Probabilistic Sampling of Balanced K-Means using Adiabatic Quantum Computing.
CoRR, 2023

Three Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding.
CoRR, 2023

DiffI2I: Efficient Diffusion Model for Image-to-Image Translation.
CoRR, 2023

When Super-Resolution Meets Camouflaged Object Detection: A Comparison Study.
CoRR, 2023

AutoDecoding Latent 3D Diffusion Models.
CoRR, 2023

Prompting Diffusion Representations for Cross-Domain Semantic Segmentation.
CoRR, 2023

Palm: Predicting Actions through Language Models @ Ego4D Long-Term Action Anticipation Challenge 2023.
CoRR, 2023

Condition-Invariant Semantic Segmentation.
CoRR, 2023

Equivariant Multi-Modality Image Fusion.
CoRR, 2023

StyleGenes: Discrete and Efficient Latent Distributions for GANs.
CoRR, 2023

Neural Implicit Dense Semantic SLAM.
CoRR, 2023

Advances in Deep Concealed Scene Understanding.
CoRR, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.
CoRR, 2023

SAM Struggles in Concealed Scenes - Empirical Study on "Segment Anything".
CoRR, 2023

CamDiff: Camouflage Image Augmentation via Diffusion Model.
CoRR, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.
CoRR, 2023

Spatio-Temporal Action Detection Under Large Motion.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Spatially Multi-conditional Image Generation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Jointly Learning Band Selection and Filter Array Design for Hyperspectral Imaging.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learning Attention Propagation for Compositional Zero-Shot Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Multi-View Photometric Stereo Revisited.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Composite Learning for Robust and Effective Dense Predictions.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Barlow constrained optimization for Visual Question Answering.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Fast Online Video Super-Resolution with Deformable Attention Pyramid.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Refign: Align and Refine for Adaptation of Semantic Segmentation to Adverse Conditions.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Efficient Visual Tracking with Exemplar Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Padding Investigations for CNNs in Scene Parsing Tasks.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

MultiVT: Multiple-Task Framework for Dentistry.
Proceedings of the Domain Adaptation and Representation Transfer - 5th MICCAI Workshop, 2023

Prior Based Online Lane Graph Extraction from Single Onboard Camera Image.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

Online Lane Graph Extraction from Onboard Video.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

HRFuser: A Multi-Resolution Sensor Fusion Architecture for 2D Object Detection.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

Model-aware 3D Eye Gaze from Weak and Few-shot Supervisions.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct, 2023

LocalViT: Analyzing Locality in Vision Transformers.
IROS, 2023

A Multiplicative Value Function for Safe and Efficient Reinforcement Learning.
IROS, 2023

TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

L2E: Lasers to Events for 6-DoF Extrinsic Calibration of Lidars and Event Cameras.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Learning continuous piecewise non-linear activation functions for deep neural networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Knowledge Distillation based Degradation Estimation for Blind Super-Resolution.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Basic Binary Convolution Unit for Binarized Image Restoration Network.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

VA-DepthNet: A Variational Approach to Single Image Depth Prediction.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Token-Consistent Dropout For Calibrated Vision Transformers.
Proceedings of the IEEE International Conference on Image Processing, 2023

NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Spatio-Temporal Convolution-Attention Video Network.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Spherical Space Feature Decomposition for Guided Depth Map Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffIR: Efficient Diffusion Model for Image Restoration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Source-free Depth for Object Pop-out.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dreamwalker: Mental Planning for Continuous Vision-Language Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Point-SLAM: Dense Neural Point Cloud-based SLAM.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Surface Normal Clustering for Implicit Representation of Manhattan Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Deformable Neural Radiance Fields using RGB and Event Cameras.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Introducing Language Guidance in Prompt-based Continual Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Online Lane Graph Extraction by Object-Lane Clustering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffDreamer: Towards Consistent Unsupervised Single-view Scene Extrapolation with Conditional Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Contrastive Model Adaptation for Cross-Condition Robustness in Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Supervised Burst Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

EDAPS: Enhanced Domain-Adaptive Panoptic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Denoising Diffusion Models for Plug-and-Play Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Structured Sparsity Learning for Efficient Video Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Event-Based Frame Interpolation with Ad-hoc Deblurring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Indiscernible Object Counting in Underwater Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unbalanced Optimal Transport: A Unified Framework for Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Single Image Depth Prediction Made Better: A Multivariate Gaussian Take.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NTIRE 2023 Challenge on Efficient Super-Resolution: Methods and Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


LSDIR: A Large Scale Dataset for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Efficient and Explicit Modelling of Image Hierarchies for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Enhanced Stable View Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Continuous Pseudo-Label Rectified Domain Adaptive Semantic Segmentation with Implicit Neural Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Quantum Annealing for Single Image Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Graph Transformer GANs for Graph-Constrained House Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SF-FSDA: Source-Free Few-Shot Domain Adaptive Object Detection with Efficient Labeled Data Factory.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Replay-Based Online Adaptation for Unsupervised Deep Visual Odometry.
Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.
Proceedings of the 34th British Machine Vision Conference 2022, 2023

Breathing New Life into 3D Assets with Generative Repainting.
Proceedings of the 34th British Machine Vision Conference 2022, 2023

2022
Learnable Online Graph Representations for 3D Multi-Object Tracking.
IEEE Robotics Autom. Lett., 2022

End-to-End Optimization of LiDAR Beam Configuration for 3D Object Detection and Localization.
IEEE Robotics Autom. Lett., 2022

Improving Depth Estimation Using Map-Based Depth Priors.
IEEE Robotics Autom. Lett., 2022

A Real-Time Online Learning Framework for Joint 3D Reconstruction and Semantic Segmentation of Indoor Scenes.
IEEE Robotics Autom. Lett., 2022

Understanding Bird's-Eye View of Road Semantics Using an Onboard Camera.
IEEE Robotics Autom. Lett., 2022

Plug-and-Play Image Restoration With Deep Denoiser Prior.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multi-Task Learning for Dense Prediction Tasks: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Segmenting Objects From Relational Visual Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards Partial Supervision for Generic Object Counting in Natural Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Video Polyp Segmentation: A Deep Learning Perspective.
Int. J. Autom. Comput., 2022

Facial-sketch Synthesis: A New Challenge.
Int. J. Autom. Comput., 2022

Beyond SOT: It's Time to Track Multiple Generic Objects at Once.
CoRR, 2022

One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers.
CoRR, 2022

CamoFormer: Masked Separable Attention for Camouflaged Object Detection.
CoRR, 2022

Neural Radiance Fields for Manhattan Scenes with Unknown Manhattan Frame.
CoRR, 2022

DiffDreamer: Consistent Single-view Perpetual View Generation with Conditional Diffusion Models.
CoRR, 2022

Piecewise Planar Hulls for Semi-Supervised Learning of 3D Shape and Pose from 2D Images.
CoRR, 2022

TT-NF: Tensor Train Neural Fields.
CoRR, 2022

Practical Real Video Denoising with Realistic Degradation Model.
CoRR, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.
CoRR, 2022

Lasers to Events: Automatic Extrinsic Calibration of Lidars and Event Cameras.
CoRR, 2022

3D-Aware Video Generation.
CoRR, 2022

Residual Sparsity Connection Learning for Efficient Video Super-Resolution.
CoRR, 2022

Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation.
CoRR, 2022

Gradient Obfuscation Checklist Test Gives a False Sense of Security.
CoRR, 2022

GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector.
CoRR, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
CoRR, 2022

Neural Vector Fields for Surface Representation and Inference.
CoRR, 2022

Deep Interactive Motion Prediction and Planning: Playing Games with Motion Prediction Models.
CoRR, 2022

Video Polyp Segmentation: A Deep Learning Perspective.
CoRR, 2022

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis.
CoRR, 2022

Revisiting Deep Semi-supervised Learning: An Empirical Distribution Alignment Framework and Its Generalization Bound.
CoRR, 2022

Pix2NeRF: Unsupervised Conditional π-GAN for Single Image to Neural Radiance Fields Translation.
CoRR, 2022

VRT: A Video Restoration Transformer.
CoRR, 2022

Revisiting RCAN: Improved Training for Image Super-Resolution.
CoRR, 2022

Neural Architecture Search for Efficient Uncalibrated Deep Photometric Stereo.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolution.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Hyperspectral Image Super-Resolution with RGB Image Super-Resolution as an Auxiliary Task.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Towards Unsupervised Online Domain Adaptation for Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Neural Radiance Fields Approach to Deep Multi-View Photometric Stereo.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Unsupervised Robust Domain Adaptation without Source Data.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Towards Versatile Embodied Navigation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Recurrent Video Restoration Transformer with Guided Deformable Attention.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Deep Interactive Motion Prediction and Planning: Playing Games with Motion Prediction Models.
Proceedings of the Learning for Dynamics and Control Conference, 2022

Perceptual Learned Video Compression with Recurrent Conditional GAN.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

PyNet-V2 Mobile: Efficient On-Device Photo Processing With Neural Networks.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

On the Practicality of Deterministic Epistemic Uncertainty.
Proceedings of the International Conference on Machine Learning, 2022

Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration.
Proceedings of the International Conference on Machine Learning, 2022

Flow-Guided Sparse Transformer for Video Deblurring.
Proceedings of the International Conference on Machine Learning, 2022

Collapse by Conditioning: Training Class-conditional GANs with Limited Data.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Zero Pixel Directional Boundary by Vector Transform.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild.
Proceedings of the Computer Vision - ECCV 2022, 2022

Event-Based Fusion for Motion Deblurring with Cross-modal Attention.
Proceedings of the Computer Vision - ECCV 2022, 2022

Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Implicit Neural Representations for Image Compression.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Online Multi-sensor Depth Fusion.
Proceedings of the Computer Vision - ECCV 2022, 2022

Highly Accurate Dichotomous Image Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

Robust Visual Tracking by Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D Compositional Zero-Shot Learning with DeCompositional Consensus.
Proceedings of the Computer Vision - ECCV 2022, 2022

Organic Priors in Non-rigid Structure from Motion.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

MicroISP: Processing 32MP Photos on Mobile Devices with Deep Learning.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Style Adaptive Semantic Image Editing with Transformers.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Interpretable Video Super-Resolution via Alternating Optimization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Reference-Based Image Super-Resolution with Deformable Attention Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking Semantic Segmentation: A Prototype View.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adiabatic Quantum Computing for Multi Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Sound and Visual Representation Learning with Multiple Pretraining Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scribble-Supervised LiDAR Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Coarse-to-Fine Feature Mining for Video Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Generative Flows with Invertible Attentions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unpaired Real-World Super-Resolution with Pseudo Controllable Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Arbitrary-Scale Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TripletTrack: 3D Object Tracking using Triplet Embeddings and LSTM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

RePaint: Inpainting using Denoising Diffusion Probabilistic Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Revisiting Random Channel Pruning for Neural Network Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Uncertainty-Aware Deep Multi-View Photometric Stereo.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LiDAR Snowfall Simulation for Robust 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Topology Preserving Local Road Network Estimation from Single Onboard Camera Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pix2NeRF: Unsupervised Conditional $\pi$-GAN for Single Image to Neural Radiance Fields Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Continual Test-Time Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Transforming Model Prediction for Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Lidar Line Selection with Spatially-Aware Shapley Value for Cost-Efficient Depth Completion.
Proceedings of the Conference on Robot Learning, 2022

SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Robustifying the Multi-Scale Representation of Neural Radiance Fields.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

End-to-End Learning of Multi-category 3D Pose and Shape Estimation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Training Dynamics Aware Neural Network Optimization with Stabilization.
Proceedings of the Computer Vision - ACCV 2022, 2022

ManiFlow: Implicitly Representing Manifolds with Normalizing Flows.
Proceedings of the International Conference on 3D Vision, 2022

2021
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty.
IEEE Robotics Autom. Lett., 2021

Show me where the action is!
Multim. Tools Appl., 2021

Learning for Video Compression With Recurrent Auto-Encoder and Recurrent Probability Model.
IEEE J. Sel. Top. Signal Process., 2021

Guest Editorial: Special Issue on Deep Learning for Video Analysis and Compression.
Int. J. Comput. Vis., 2021

Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory.
Int. J. Comput. Vis., 2021

DLOW: Domain Flow and Applications.
Int. J. Comput. Vis., 2021

Guest Editorial: Special Issue on "Computer Vision for All Seasons: Adverse Weather and Lighting Conditions".
Int. J. Comput. Vis., 2021

Scale-Aware Domain Adaptive Faster R-CNN.
Int. J. Comput. Vis., 2021

Deep Facial Synthesis: A New Challenge.
CoRR, 2021

Stochastic Layers in Vision Transformers.
CoRR, 2021

MEFNet: Multi-scale Event Fusion Network for Motion Deblurring.
CoRR, 2021

Global and Local Alignment Networks for Unpaired Image-to-Image Translation.
CoRR, 2021

Context-aware Padding for Semantic Segmentation.
CoRR, 2021

TADA: Taxonomy Adaptive Domain Adaptation.
CoRR, 2021

Boosting Few-shot Semantic Segmentation with Transformers.
CoRR, 2021

On the Practicality of Deterministic Epistemic Uncertainty.
CoRR, 2021

Video Super-Resolution Transformer.
CoRR, 2021

Transformer in Convolutional Neural Networks.
CoRR, 2021

Boosting Crowd Counting with Transformers.
CoRR, 2021

LocalViT: Bringing Locality to Vision Transformers.
CoRR, 2021

Hyperspectral Image Super-Resolution with Spectral Mixup and Heterogeneous Datasets.
CoRR, 2021

Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution.
CoRR, 2021

Fine-Grained Cross-Modal Retrieval for Cultural Items with Focal Attention and Hierarchical Encodings.
Comput., 2021

Facial Emotion Recognition with Noisy Multi-task Annotations.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Zero-Pair Image to Image Translation using Domain Conditional Normalization.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

CoMoDA: Continuous Monocular Depth Adaptation Using Past Experiences.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Decoder Fusion RNN: Context and Interaction Aware Decoders for Trajectory Prediction.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Local Memory Attention for Fast Video Semantic Segmentation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Neural Architecture Search of SPD Manifold Networks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Fast Few-Shot Classification by Few-Iteration Meta-Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning from Simulation, Racing in Reality.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

A Deep Learning Method for Frame Selection in Videos for Structure from Motion Pipelines.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

SMILE: Semantically-guided Multi-attribute Image and Layout Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021


SwinIR: Image Restoration Using Swin Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

The Ninth Visual Object Tracking VOT2021 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

MonoCInIS: Camera Independent Monocular 3D Object Detection using Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021


Generalized Real-World Super-Resolution through Adversarial Robustness.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021


Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

End-to-End Urban Driving by Imitating a Reinforcement Learning Coach.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Exploring Cross-Image Pixel Contrast for Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Warp Consistency for Unsupervised Learning of Dense Correspondences.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Task Switching Network for Multi-task Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ACDC: The Adverse Conditions Dataset with Correspondences for Semantic Driving Scene Understanding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Towards Efficient Graph Convolutional Networks for Point Cloud Handling.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

mDALU: Multi-Source Domain Adaptation and Label Unification with Partial Datasets.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Fourier Space Losses for Efficient Perceptual Image Super-Resolution.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Vi<sup>2</sup>CLR: Video and Image for Visual Contrastive Learning of Representation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Structured Bird's-Eye-View Traffic Scene Understanding from Onboard Images.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Exploring Relational Context for Multi-Task Dense Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep Reparametrization of Multi-Frame Super-Resolution and Denoising.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Target Candidate Association to Keep Track of What Not to Track.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Compound Domain Adaptation for Face Anti-Spoofing.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

GANmut: Learning Interpretable Conditional Space for Gamut of Emotions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DeFlow: Learning Complex Image Degradations From Unpaired Data With Conditional Flows.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Shadow Removal With Paired and Unpaired Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Learning Accurate Dense Correspondences and When To Trust Them.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Efficient Conditional GAN Transfer With Knowledge Propagation Across Classes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning To Relate Depth and Semantics for Unsupervised Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

CompositeTasking: Understanding Images by Spatial Composition of Tasks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Flow-Based Kernel Prior With Application to Blind Super-Resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Uncalibrated Neural Inverse Rendering for Photometric Stereo of General Surfaces.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Three Ways To Improve Semantic Segmentation With Self-Supervised Depth Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

3D CNNs With Adaptive Temporal Feature Resolutions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Burst Super-Resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Line Encoding for Monocular 3D Object Detection and Depth Prediction.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Spectral Tensor Train Parameterization of Deep Learning Layers.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Neural Architecture Search as Sparse Supernet.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Analogical Image Translation for Fog Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Unsupervised Monocular Depth Reconstruction of Non-Rigid Scenes.
Proceedings of the International Conference on 3D Vision, 2021

Go with the Flows: Mixtures of Normalizing Flows for Point Cloud Generation and Reconstruction.
Proceedings of the International Conference on 3D Vision, 2021

Direct Dense Pose Estimation.
Proceedings of the International Conference on 3D Vision, 2021

2020
stagNet: An Attentive Semantic RNN for Group Activity and Individual Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2020

Geometrically Mappable Image Features.
IEEE Robotics Autom. Lett., 2020

Don't Forget The Past: Recurrent Depth Estimation from Monocular Video.
IEEE Robotics Autom. Lett., 2020

Learned Dynamic Guidance for Depth Image Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Efficient Visual Recognition.
Int. J. Comput. Vis., 2020

Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding.
Int. J. Comput. Vis., 2020

Quantifying Aleatoric and Epistemic Uncertainty Using Density Estimation in Latent Space.
CoRR, 2020

Understanding Bird's-Eye View Semantic HD-Maps Using an Onboard Monocular Camera.
CoRR, 2020

LID 2020: The Learning from Imperfect Data Challenge Results.
CoRR, 2020

Self-Supervised Shadow Removal.
CoRR, 2020

Self-Supervised Ranking for Representation Learning.
CoRR, 2020

Few-Shot Classification By Few-Iteration Meta-Learning.
CoRR, 2020

Improving Point Cloud Semantic Segmentation by Learning 3D Object Proposal Generation.
CoRR, 2020

Learning Condition Invariant Features for Retrieval-Based Localization from 1M Images.
CoRR, 2020

Learning Accurate and Human-Like Driving using Semantic Maps and Attention.
CoRR, 2020

Self-Calibration Supported Robust Projective Structure-from-Motion.
CoRR, 2020

The Heterogeneity Hypothesis: Finding Layer-Wise Dissimilated Network Architecture.
CoRR, 2020

OpenDVC: An Open Source Implementation of the DVC Video Compression Method.
CoRR, 2020

Analogical Image Translation for Fog Generation.
CoRR, 2020

Dense Non-Rigid Structure from Motion: A Manifold Viewpoint.
CoRR, 2020

Learning To Classify Images Without Labels.
CoRR, 2020

Revisiting Multi-Task Learning in the Deep Learning Era.
CoRR, 2020

Quantifying Data Augmentation for LiDAR based 3D Object Detection.
CoRR, 2020

Fully in tensor computation manner: one-shot dense 3D structured light and beyond.
Cogn. Comput. Syst., 2020

Efficient Video Semantic Segmentation with Labels Propagation and Refinement.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Towards Good Practice for CNN-Based Monocular Depth Estimation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Same Same but Different: Augmentation of Tiny Industrial Datasets using Generative Adversarial Networks.
Proceedings of the 7th Swiss Conference on Data Science, 2020

Safe Motion Planning for Autonomous Driving using an Adversarial Road Model.
Proceedings of the Robotics: Science and Systems XVI, 2020

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Soft Contrastive Learning for Visual Localization.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Modelling the Distribution of 3D Brain MRI Using a 2D Slice VAE.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Action Sequence Predictions of Vehicles in Urban Environments using Map and Social Context.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Depth Estimation from Monocular Images and Sparse Radar Data.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Learning Accurate and Human-Like Driving using Semantic Maps and Attention.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

T-Basis: a Compact Representation for Neural Networks.
Proceedings of the 37th International Conference on Machine Learning, 2020

Unpaired Image-To-Image Shape Translation Across Fashion Data.
Proceedings of the IEEE International Conference on Image Processing, 2020

Dual Grid Net: Hand Mesh Vertex Regression from Single Depth Maps.
Proceedings of the Computer Vision - ECCV 2020, 2020

Efficiently Detecting Plausible Locations for Object Placement Using Masked Convolutions.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Modeling the Effects of Windshield Refraction for Camera Calibration.
Proceedings of the Computer Vision - ECCV 2020, 2020

Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds.
Proceedings of the Computer Vision - ECCV 2020, 2020

MTI-Net: Multi-scale Task Interaction Networks for Multi-task Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fixing Localization Errors to Improve Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects.
Proceedings of the Computer Vision - ECCV 2020, 2020

Weakly Supervised 3D Object Detection from Lidar Point Cloud.
Proceedings of the Computer Vision - ECCV 2020, 2020

SRFlow: Learning the Super-Resolution Space with Normalizing Flow.
Proceedings of the Computer Vision - ECCV 2020, 2020

Video Object Segmentation with Episodic Graph Memory Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

DHP: Differentiable Meta Pruning via HyperNetworks.
Proceedings of the Computer Vision - ECCV 2020, 2020

The Eighth Visual Object Tracking VOT2020 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Flexible Example-Based Image Enhancement with Task Adaptive Global Feature Self-guided Network.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Reparameterizing Convolutions for Incremental Multi-Task Learning Without Task Interference.
Proceedings of the Computer Vision - ECCV 2020, 2020

SCAN: Learning to Classify Images Without Labels.
Proceedings of the Computer Vision - ECCV 2020, 2020

Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets.
Proceedings of the Computer Vision - ECCV 2020, 2020

Large Scale Holistic Video Understanding.
Proceedings of the Computer Vision - ECCV 2020, 2020

Commands 4 Autonomous Vehicles (C4AV) Workshop Summary.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Consistency Guided Scene Flow Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning What to Learn for Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Know Your Surroundings: Exploiting Scene Information for Object Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deep Unfolding Network for Image Super-Resolution.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning for Video Compression With Hierarchical Quality and Recurrent Enhancement.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Decomposing Image Generation into Layout Prediction and Conditional Synthesis.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Domain Agnostic Feature Learning for Image and Video Based Face Anti-spoofing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Unsupervised Hierarchical Part Decomposition of 3D Objects From a Single RGB Image.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Better Lossless Compression Using Lossy Compression.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Replacing Mobile Camera ISP with a Single Deep Learning Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Probabilistic Regression for Visual Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-supervised Object Motion and Depth Estimation from Video.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Weakly Paired Multi-Domain Image Translation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Branched Multi-Task Networks: Deciding what layers to share.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Automated Search for Resource-Efficient Branched Multi-Task Networks.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
Video Object Segmentation without Temporal Information.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Temporal Segment Networks for Action Recognition in Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Divide-and-Conquer Adversarial Learning for High-Resolution Image and Video Enhancement.
CoRR, 2019

Talk2Nav: Long-Range Vision-and-Language Navigation in Cities.
CoRR, 2019

The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation.
CoRR, 2019

Holistic Large Scale Video Understanding.
CoRR, 2019

A Novel BiLevel Paradigm for Image-to-Image Translation.
CoRR, 2019

Branched Multi-Task Networks: Deciding What Layers To Share.
CoRR, 2019

Fast video object segmentation with Spatio-Temporal GANs.
CoRR, 2019

Learning Accurate, Comfortable and Human-like Driving.
CoRR, 2019

Semantic Nighttime Image Segmentation with Synthetic Stylized Data, Gradual Adaptation and Uncertainty-Aware Evaluation.
CoRR, 2019

A Three-Player GAN: Generating Hard Samples to Improve Classification Networks.
Proceedings of the 16th International Conference on Machine Vision Applications, 2019

Uncertainty based model selection for fast semantic segmentation.
Proceedings of the 16th International Conference on Machine Vision Applications, 2019

Sparse and Noisy LiDAR Completion with RGB Guidance and Uncertainty.
Proceedings of the 16th International Conference on Machine Vision Applications, 2019

Real-time 3D Traffic Cone Detection for Autonomous Driving.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Texture Underfitting for Domain Adaptation.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Learning a Curve Guardian for Motorcycles.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Semantic Understanding of Foggy Scenes with Purely Synthetic Data.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Night-to-Day Image Translation for Retrieval-based Localization.
Proceedings of the International Conference on Robotics and Automation, 2019

Adversarial Binary Coding for Efficient Person Re-Identification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency.
Proceedings of the 7th International Conference on Learning Representations, 2019

Optimal Transport Maps For Distribution Preserving Operations on Latent Spaces of Generative Models.
Proceedings of the 7th International Conference on Learning Representations, 2019

Extremely Weak Supervised Image-to-Image Translation for Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

SMIT: Stochastic Multi-Label Image-to-Image Translation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

AI Benchmark: All About Deep Learning on Smartphones in 2019.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

End-to-end Lane Detection through Differentiable Least-Squares Fitting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Semi-Supervised Learning by Augmented Distribution Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Guided Curriculum Model Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Convex Relaxations for Consensus and Non-Minimal Problems in 3D Vision.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Filter Basis for Convolutional Neural Network Compression.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Fast Image Restoration With Multi-Bin Trainable Linear Units.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Self-Guided Network for Fast Image Denoising.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DynamoNet: Dynamic Action and Motion Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Discriminative Model Prediction for Tracking.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Generative Adversarial Networks for Extreme Learned Image Compression.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Talk2Car: Taking Control of Your Self-Driving Car.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Sliced Wasserstein Generative Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Self-Supervised 3D Hand Pose Estimation Through Training by Fitting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Mapping, Localization and Path Planning for Image-Based Navigation Using Visual Features and Map.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Feature Representations for Look-Alike Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Unsupervised Learning of Consensus Maximization for 3D Vision Problems.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

What Correspondences Reveal About Unknown Camera and Motion Models?
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Practical Full Resolution Learned Lossless Image Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

3D Appearance Super-Resolution With Deep Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DLOW: Domain Flow for Adaptation and Generalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Weakly Supervised Object Discovery by Generative Adversarial & Ranking Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Learning Semantic Segmentation From Synthetic Data: A Geometrically Guided Input-Output Adaptation Approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019


Tracking the Known and the Unknown by Leveraging Semantic Information.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Manifold-Valued Image Generation with Wasserstein Generative Adversarial Nets.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
AENet: Learning Deep Audio Features for Video Analysis.
IEEE Trans. Multim., 2018

Geometry-Aware Similarity Learning on SPD Manifolds for Visual Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Automatic Tool Landmark Detection for Stereo Vision in Robot-Assisted Retinal Surgery.
IEEE Robotics Autom. Lett., 2018

Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Domain Generalization and Adaptation Using Low Rank Exemplar SVMs.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Visual Recognition in RGB Images and Videos by Learning from RGB-D Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Cross Euclidean-to-Riemannian Metric Learning with Application to Face Recognition from Video.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Reflectance and Natural Illumination from Single-Material Specular Objects Using Deep Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Transferring Deep Object and Scene Representations for Event Recognition in Still Images.
Int. J. Comput. Vis., 2018

Semantic Foggy Scene Understanding with Synthetic Data.
Int. J. Comput. Vis., 2018

Deep Expectation of Real and Apparent Age from a Single Image Without Facial Landmarks.
Int. J. Comput. Vis., 2018

Guest Editorial: Vision and Computational Photography and Graphics.
Comput. Vis. Image Underst., 2018

Image-based Navigation using Visual Features and Map.
CoRR, 2018

Integrated unpaired appearance-preserving shape translation across domains.
CoRR, 2018

Non-invasive thermal comfort perception based on subtleness magnification and deep learning for energy efficiency.
CoRR, 2018

Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs.
CoRR, 2018

Multi-bin Trainable Linear Unit for Fast Image Restoration Networks.
CoRR, 2018

Exemplar Guided Unsupervised Image-to-Image Translation.
CoRR, 2018

Ensemble Manifold Segmentation for Model Distillation and Semi-supervised Learning.
CoRR, 2018

Learning Driving Models with a Surround-View Camera System and a Route Planner.
CoRR, 2018

Deep Unsupervised Intrinsic Image Decomposition by Siamese Training.
CoRR, 2018

The 2018 DAVIS Challenge on Video Object Segmentation.
CoRR, 2018

Unsupervised Deep Single-Image Intrinsic Decomposition using Illumination-Varying Image Sequences.
Comput. Graph. Forum, 2018

An Analysis of Human-Centered Geolocation.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

DARN: A Deep Adversarial Residual Network for Intrinsic Image Decomposition.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

From Pixels to Actions: Learning to Drive a Car with Deep Neural Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

The CAMETRON Lecture Recording System: High Quality Video Recording and Editing with Minimal Human Supervision.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Iterative Deep Retinal Topology Extraction.
Proceedings of the Patch-Based Techniques in Medical Imaging, 2018

Towards End-to-End Lane Detection: an Instance Segmentation Approach.
Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, 2018

Failure Prediction for Autonomous Driving.
Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, 2018

Dark Model Adaptation: Semantic Image Segmentation from Daytime to Nighttime.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Integrating Local and Non-local Denoiser Priors for Image Restoration.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Towards Image Understanding from Deep Compression Without Decoding.
Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical Attention and Context Modeling for Group Activity Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Status Quaestionis and Future Solutions for Using Multi-light Reflectance Imaging Approaches for Preserving Cultural Heritage Artifacts.
Proceedings of the Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection, 2018

Generative Domain-Migration Hashing for Sketch-to-Image Retrieval.
Proceedings of the Computer Vision - ECCV 2018, 2018

Wasserstein Divergence for GANs.
Proceedings of the Computer Vision - ECCV 2018, 2018

Fast Perceptual Image Enhancement.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding.
Proceedings of the Computer Vision - ECCV 2018, 2018

stagNet: An Attentive Semantic RNN for Group Activity Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Incremental Non-Rigid Structure-from-Motion with Unknown Focal Length.
Proceedings of the Computer Vision - ECCV 2018, 2018

Model-free Consensus Maximization for Non-Rigid Shapes.
Proceedings of the Computer Vision - ECCV 2018, 2018

Sampling Algebraic Varieties for Robust Camera Autocalibration.
Proceedings of the Computer Vision - ECCV 2018, 2018

Progressive Structure from Motion.
Proceedings of the Computer Vision - ECCV 2018, 2018

CARN: Convolutional Anchored Regression Network for Fast and Accurate Single Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018


AI Benchmark: Running Deep Neural Networks on Android Smartphones.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners.
Proceedings of the Computer Vision - ECCV 2018, 2018

Spatio-temporal Channel Correlation Networks for Action Classification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Appearance-and-Relation Networks for Video Classification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Dense 3D Regression for Hand Pose Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Object Referring in Videos With Language and Human Gaze.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

NTIRE 2018 Challenge on Single Image Super-Resolution: Methods and Results.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Natural and Effective Obfuscation by Head Inpainting.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Consensus Maximization for Semantic Region Correspondences.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Classification-Driven Dynamic Image Enhancement.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Generative Adversarial Style Transfer Networks for Face Aging.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Conditional Probability Models for Deep Image Compression.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Extreme Cut: From Extreme Points to Object Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Disentangled Person Image Generation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Viewpoint-Aware Video Summarization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

WESPE: Weakly Supervised Photo Enhancer for Digital Cameras.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Error Correction for Dense Semantic Image Labeling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Temporal 3D ConvNets Using Temporal Transition Layer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Domain Adaptive Faster R-CNN for Object Detection in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

WILDTRACK: A Multi-Camera HD Dataset for Dense Unscripted Pedestrian Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

ComboGAN: Unrestrained Scalability for Image Domain Translation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Extreme Learned Image Compression with GANs.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Covariance Pooling for Facial Expression Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Iterative Deep Learning for Road Topology Extraction.
Proceedings of the British Machine Vision Conference 2018, 2018

Customized Multi-person Tracker.
Proceedings of the Computer Vision - ACCV 2018, 2018

Building Deep Networks on Grassmann Manifolds.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Acquiring Common Sense Spatial Knowledge Through Implicit Spatial Templates.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
DeepProposals: Hunting Objects and Actions by Cascading Deep Convolutional Layers.
Int. J. Comput. Vis., 2017

Leveraging observation uncertainty for robust visual tracking.
Comput. Vis. Image Underst., 2017

Efficient architectural structural element decomposition.
Comput. Vis. Image Underst., 2017

Efficient edge-aware surface mesh reconstruction for urban scenes.
Comput. Vis. Image Underst., 2017

Manifold-valued Image Generation with Wasserstein Adversarial Networks.
CoRR, 2017

Iterative Deep Learning for Network Topology Extraction.
CoRR, 2017

Energy-relaxed Wasserstein GANs(EnergyWGAN): Towards More Stable and High Resolution Image Generation.
CoRR, 2017

Face Translation between Images and Videos using Identity-aware CycleGAN.
CoRR, 2017

Towards an Understanding of Our World by GANing Videos in the Wild.
CoRR, 2017

Detection-aided liver lesion segmentation using deep learning.
CoRR, 2017

Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification.
CoRR, 2017

Object Discovery By Generative Adversarial & Ranking Networks.
CoRR, 2017

Object Referring in Visual Scene with Spoken Language.
CoRR, 2017

WebVision Database: Visual Learning and Understanding from Web Data.
CoRR, 2017

Semantic Instance Segmentation with a Discriminative Loss Function.
CoRR, 2017

Fast Scene Understanding for Autonomous Driving.
CoRR, 2017

Speech-Based Visual Question Answering.
CoRR, 2017

On the Relation between Color Image Denoising and Classification.
CoRR, 2017

Generative Autotransporters.
CoRR, 2017

Crossing Nets: Dual Generative Models with a Shared Latent Space for Hand Pose Estimation.
CoRR, 2017

The 2017 DAVIS Challenge on Video Object Segmentation.
CoRR, 2017

WebVision Challenge: Visual Learning and Understanding With Web Data.
CoRR, 2017

F2F: A Library For Fast Kernel Expansions.
CoRR, 2017

HashBox: Hash Hierarchical Segmentation exploiting Bounding Box Object Detection.
CoRR, 2017

The WILDTRACK Multi-Camera Person Dataset.
CoRR, 2017

Semantically-Guided Video Object Segmentation.
CoRR, 2017

Soft-to-Hard Vector Quantization for End-to-End Learned Compression of Images and Neural Networks.
CoRR, 2017

Repeated Pattern Detection Using CNN Activations.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Material Classification under Natural Illumination Using Reflectance Maps.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

VarCity - the video: the struggles and triumphs of leveraging fundamental research results in a graphics video production.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2017

k^2 k 2 -means for Fast and Accurate Large Scale Clustering.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Pose Guided Person Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deep visual words: Improved fisher vector for image classification.
Proceedings of the Fifteenth IAPR International Conference on Machine Vision Applications, 2017

Query-adaptive Video Summarization via Quality-aware Relevance Estimation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Augmented Reality for User-Friendly Intra-Oral Scanning.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2017

Deep Domain Adaptation by Geodesic Distance Minimization.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

PathTrack: Fast Trajectory Annotation with Path Supervision.
Proceedings of the IEEE International Conference on Computer Vision, 2017

DSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learned Multi-patch Similarity.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Optimal Transformation Estimation with Semantic Cues.
Proceedings of the IEEE International Conference on Computer Vision, 2017

What is Around the Camera?
Proceedings of the IEEE International Conference on Computer Vision, 2017

Anchored Regression Networks Applied to Age Estimation and Super Resolution.
Proceedings of the IEEE International Conference on Computer Vision, 2017

UntrimmedNets for Weakly Supervised Action Recognition and Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Crossing Nets: Combining GANs and VAEs with a Shared Latent Space for Hand Pose Estimation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017


Consensus Maximization with Linear Matrix Inequality Constraints.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Learning on Lie Groups for Skeleton-Based Action Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Weakly Supervised Cascaded Convolutional Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Temporal Linear Encoding Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

One-Shot Video Object Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Semantic Instance Segmentation for Autonomous Driving.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Cloud-based collaborative 3D reconstruction using smartphones.
Proceedings of the 14th European Conference on Visual Media Production (CVMP 2017), 2017

A Riemannian Network for SPD Matrix Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Efficient Model-Free Anthropometry from Depth Data.
Proceedings of the 2017 International Conference on 3D Vision, 2017

Autonomous Mapping of the Priscilla Catacombs.
Proceedings of the Mixed Reality and Gamification for Cultural Heritage, 2017

2016
Introduction to Large-Scale Visual Geo-localization.
Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016

Learning to Predict Sequences of Human Visual Fixations.
IEEE Trans. Neural Networks Learn. Syst., 2016

Demosaicing Based on Directional Difference Regression and Efficient Regression Priors.
IEEE Trans. Image Process., 2016

Sub-Markov Random Walk for Image Segmentation.
IEEE Trans. Image Process., 2016

Joint Tracking and Ground Plane Estimation.
IEEE Signal Process. Lett., 2016

Incremental Learning of Random Forests for Large-Scale Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

ATLAS: A Three-Layered Approach to Facade Parsing.
Int. J. Comput. Vis., 2016

Semantic super-resolution: When and where is it useful?
Comput. Vis. Image Underst., 2016

PICASO: PIxel correspondences and SOft match selection for real-time tracking.
Comput. Vis. Image Underst., 2016

Tracking by switching state space models.
Comput. Vis. Image Underst., 2016

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016.
CoRR, 2016

Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images.
CoRR, 2016

Direction matters: hand pose estimation from local surface normals.
CoRR, 2016

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Detection.
CoRR, 2016

Failure Detection for Facial Landmark Detectors.
CoRR, 2016

Does V-NIR based Image Enhancement Come with Better Features?
CoRR, 2016

Image-level Classification in Hyperspectral Images using Feature Descriptors, with Application to Face Recognition.
CoRR, 2016

DARN: a Deep Adversial Residual Network for Intrinsic Image Decomposition.
CoRR, 2016

Natural Illumination from Multiple Materials Using Deep Learning.
CoRR, 2016

DeLight-Net: Decomposing Reflectance Maps into Specular Materials and Natural Illumination.
CoRR, 2016

Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification.
CoRR, 2016

Unsupervised High-level Feature Learning by Ensemble Projection for Semi-supervised Image Classification and Image Clustering.
CoRR, 2016

k2-means for fast and accurate large scale clustering.
CoRR, 2016

Energy-efficient ConvNets through approximate computing.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Leveraging single for multi-target tracking using a novel trajectory overlap affinity measure.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Mobile phone and cloud - A dream team for 3D reconstruction.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Architectural decomposition for 3D landmark building understanding.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Is image super-resolution helpful for other vision tasks?
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Low-cost scene modeling using a density function improves segmentation performance.
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016

Dynamic Filter Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Deep Retinal Image Understanding.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

ETH-CVL @ MediaEval 2016: Textual-Visual Embeddings and Video2GIF for Video Interestingness.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

A Dataset for Multimodal Question Answering in the Cultural Heritage Domain.
Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities, 2016

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Recognition.
Proceedings of the Interspeech 2016, 2016

Dilemma First Search for effortless optimization of NP-hard problems.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Efficient volumetric fusion of airborne and street-side data for urban reconstruction.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Regressor Basis Learning for anchored super-resolution.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition.
Proceedings of the Computer Vision - ECCV 2016, 2016

Hand Pose Estimation from Local Surface Normals.
Proceedings of the Computer Vision - ECCV 2016, 2016

Combining Human Body Shape and Pose Estimation for Robust Upper Body Tracking Using a Depth Sensor.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Convolutional Oriented Boundaries.
Proceedings of the Computer Vision - ECCV 2016, 2016

Fast Optical Flow Using Dense Inverse Search.
Proceedings of the Computer Vision - ECCV 2016, 2016

Actionness Estimation Using Hybrid Fully Convolutional Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Features or Not: Temperature and Time Prediction in Outdoor Scenes.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

Structured Output SVM Prediction of Apparent Age, Gender and Smile from Deep Features.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

Seven Ways to Improve Example-Based Single Image Super Resolution.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Some Like It Hot - Visual Guidance for Preference Prediction.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Progressive Prioritized Multi-view Stereo.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Fast Algorithms for Linear and Kernel SVM+.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Scale-Aware Alignment of Hierarchical Image Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Two-Stream SR-CNNs for Action Recognition in Videos.
Proceedings of the British Machine Vision Conference 2016, 2016

Generic 3D Convolutional Fusion for Image Restoration.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

Markov Chain Monte Carlo Cascade for Camera Network Calibration Based on Unconstrained Pedestrian Tracklets.
Proceedings of the Computer Vision - ACCV 2016, 2016

Progressive 3D Modeling All the Way.
Proceedings of the Fourth International Conference on 3D Vision, 2016

3D Saliency for Finding Landmark Buildings.
Proceedings of the Fourth International Conference on 3D Vision, 2016

2015
Iterative Nearest Neighbors.
Pattern Recognit., 2015

An Elastic Deformation Field Model for Object Detection and Tracking.
Int. J. Comput. Vis., 2015

The Pascal Visual Object Classes Challenge: A Retrospective.
Int. J. Comput. Vis., 2015

SEEDS: Superpixels Extracted Via Energy-Driven Sampling.
Int. J. Comput. Vis., 2015

Weakly supervised motion segmentation with particle matching.
Comput. Vis. Image Underst., 2015

Oracle MCG: A first peek into COCO Detection Challenges.
CoRR, 2015

How Useful Is Image Super-resolution to Other Vision Tasks?
CoRR, 2015

Jointly Optimized Regressors for Image Super-resolution.
Comput. Graph. Forum, 2015

Learned Collaborative Representations for Image Classification.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Sparse Flow: Sparse Matching for Small to Large Displacement Optical Flow.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Discovery of Sets of Mutually Orthogonal Vanishing Points in Videos.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision Workshops, 2015

Discriminative learning of apparel features.
Proceedings of the 14th IAPR International Conference on Machine Vision Applications, 2015

ETH-CVL @ MediaEval 2015: Learning Objective Functions for Improved Image Retrieval.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Automatic Handwritten Mensural Notation Interpreter: From Manuscript to MIDI Performance.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Robust aerial object tracking in images with lens flare.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Efficient Real-Time Pixelwise Object Class Labeling for Safe Human-Robot Collaboration in Industrial Domain.
Proceedings of the 4th Workshop on Machine Learning for Interactive Systems, 2015

Efficient regression priors for post-processing demosaiced images.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Efficient regression priors for reducing image compression artifacts.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

DLDR: Deep Linear Discriminative Retrieval for Cultural Event Classification from a Single Image.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

DEX: Deep EXpectation of Apparent Age from a Single Image.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Boosting Object Proposals: From Pascal to COCO.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A Gaussian Process Latent Variable Model for BRDF Inference.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A new approach to digitalization and data management of cultural heritage sites.
Proceedings of the 2nd Digital Heritage International Congress, 2015

From categories to subcategories: Large-scale image classification with partial class label refinement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

3D all the way: Semantic segmentation of urban scenes from start to end in 3D.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Joint vanishing point extraction and tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Make my day - high-fidelity color denoising with Near-Infrared.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Video summarization by learning submodular mixtures of objectives.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Metric imitation by manifold transfer for efficient vision applications.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Superpixel meshes for fast edge-preserving surface reconstruction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Saliency Prediction with Active Semantic Segmentation.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
One-Shot Person Re-identification with a Consumer Depth Camera.
Proceedings of the Person Re-Identification, 2014

Derivative-Based Scale Invariant Image Feature Detector With Error Resilience.
IEEE Trans. Image Process., 2014

Adaptive and Weighted Collaborative Representations for image classification.
Pattern Recognit. Lett., 2014

Body Parts Dependent Joint Regressors for Human Pose Estimation in Still Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Multi-view traffic sign detection, recognition, and 3D localisation.
Mach. Vis. Appl., 2014

Special issue on car navigation and vehicle systems.
Mach. Vis. Appl., 2014

Guest editorial: Event-based video analysis/retrieval.
Multim. Tools Appl., 2014

Branch&Rank for Efficient Object Detection.
Int. J. Comput. Vis., 2014

Object and Action Classification with Latent Window Parameters.
Int. J. Comput. Vis., 2014

Comment on "Ensemble Projection for Semi-supervised Image Classification".
CoRR, 2014

Markerless Vision-Based Augmented Reality for Urban Planning.
Comput. Aided Civ. Infrastructure Eng., 2014

Scale-invariant line descriptors for wide baseline matching.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Quantized Kernel Learning for Feature Matching.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Self-Adaptable Templates for Feature Coding.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

3D reconstruction of freely moving persons for re-identification with a depth sensor.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Navigation using special buildings as signposts.
Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Interacting with Maps, 2014

Segmentation Using SubMarkov Random Walk.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2014

Learning Where to Classify in Multi-view Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Face Detection without Bells and Whistles.
Proceedings of the Computer Vision - ECCV 2014, 2014

Appearances Can Be Deceiving: Learning Visual Tracking from Few Trajectory Annotations.
Proceedings of the Computer Vision - ECCV 2014, 2014

Robust Visual Tracking with Double Bounding Box Model.
Proceedings of the Computer Vision - ECCV 2014, 2014

Video Registration to SfM Models.
Proceedings of the Computer Vision - ECCV 2014, 2014

Creating Summaries from User Videos.
Proceedings of the Computer Vision - ECCV 2014, 2014

Food-101 - Mining Discriminative Components with Random Forests.
Proceedings of the Computer Vision - ECCV 2014, 2014

Motion Segmentation with Weak Labeling Priors.
Proceedings of the Pattern Recognition - 36th German Conference, 2014

Multi-view Tracking of Multiple Targets with Dynamic Cameras.
Proceedings of the Pattern Recognition - 36th German Conference, 2014

Gesture Recognition Portfolios for Personalization.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Latent Dictionary Learning for Sparse Representation Based Classification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Incremental Learning of NCM Forests for Large-Scale Image Classification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Using a Deformation Field Model for Localizing Faces and Facial Points under Weak Supervision.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Ground Plane Estimation Using a Hidden Markov Model.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

The Synthesizability of Texture Examples.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Fast, Approximate Piecewise-Planar Modeling Based on Sparse Structure-from-Motion and Superpixels.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Object Classification with Adaptable Regions.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Learning to Rank Histograms for Object Retrieval.
Proceedings of the British Machine Vision Conference, 2014

A unified framework for content-aware view selection and planning through view importance.
Proceedings of the British Machine Vision Conference, 2014

Frankenhorse: Automatic Completion of Articulating Objects from Image-based Reconstruction.
Proceedings of the British Machine Vision Conference, 2014

A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution.
Proceedings of the Computer Vision - ACCV 2014, 2014

Non-maximum Suppression for Object Detection by Passing Messages Between Windows.
Proceedings of the Computer Vision - ACCV 2014, 2014

An Integer Linear Programming Model for View Selection on Overlapping Camera Clusters.
Proceedings of the 2nd International Conference on 3D Vision, 2014

Hierarchical Co-Segmentation of Building Facades.
Proceedings of the 2nd International Conference on 3D Vision, 2014

Reconstruction of Inextensible Surfaces on a Budget via Bootstrapping.
Proceedings of the 2nd International Conference on 3D Vision, 2014

Matching Features Correctly through Semantic Understanding.
Proceedings of the 2nd International Conference on 3D Vision, 2014

Tackling Shapes and BRDFs Head-On.
Proceedings of the 2nd International Conference on 3D Vision, 2014

2013
Efficient Loopy Belief Propagation Using the Four Color Theorem.
Proceedings of the Advanced Topics in Computer Vision, 2013

Motion Control of the CyberCarpet Platform.
IEEE Trans. Control. Syst. Technol., 2013

SIFER: Scale-Invariant Feature Detector with Error Resilience.
Int. J. Comput. Vis., 2013

Random Forests for Real Time 3D Face Analysis.
Int. J. Comput. Vis., 2013

Tracking with a mixed continuous-discrete Conditional Random Field.
Comput. Vis. Image Underst., 2013

Random Binary Mappings for Kernel Learning and Efficient SVM.
CoRR, 2013

A Survey of Urban Reconstruction.
Comput. Graph. Forum, 2013

Nonuniform image patch exemplars for low level vision.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Depth SEEDS: Recovering incomplete depth data using superpixels.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Exploration and mapping of catacombs with mobile robots.
Proceedings of the IEEE International Symposium on Safety, Security, and Rescue Robotics, 2013

Visual interestingness in image sequences.
Proceedings of the ACM Multimedia Conference, 2013

Traffic sign recognition - How far are we from the solution?
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Sparse Variation Dictionary Learning for Face Recognition with a Single Training Sample per Person.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Anchored Neighborhood Regression for Fast Example-Based Super-Resolution.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Active MAP Inference in CRFs for Efficient Semantic Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Handling Occlusions with Franken-Classifiers.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Prime Object Proposals with Randomized Prim's Algorithm.
Proceedings of the IEEE International Conference on Computer Vision, 2013

The Interestingness of Images.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Example-Based Facade Texture Synthesis.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Ensemble Projection for Semi-supervised Image Classification.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Event Recognition in Photo Collections with a Stopwatch HMM.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Online Video SEEDS for Temporal Window Objectness.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Face recognition based on regularized nearest points between image sets.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Real time 3D face alignment with Random Forests-based Active Appearance Models.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

A Comparative Study of Astronomical Clock towers in Europe and China based on their detailed 3D modeling.
Proceedings of the 8th Annual International Conference of the Alliance of Digital Humanities Organizations, 2013

Robust Realtime Motion-Split-And-Merge for Motion Segmentation.
Proceedings of the Pattern Recognition - 35th German Conference, 2013

Is There a Procedural Logic to Architecture?
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Query Adaptive Similarity for Large Scale Object Retrieval.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Bayesian Grammar Learning for Inverse Procedural Modeling.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Fast Energy Minimization Using Learned State Filters.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Human Pose Estimation Using Body Parts Dependent Joint Regressors.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Sparse Quantization for Patch Description.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Seeking the Strongest Rigid Detector.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Robust Scene Stitching in Large Scale Mobile Mapping.
Proceedings of the British Machine Vision Conference, 2013

Overlapping camera clustering through dominant sets for scalable 3D reconstruction.
Proceedings of the British Machine Vision Conference, 2013

Automatic Shape Expansion with Verification to Improve 3D Retrieval, Classification and Matching.
Proceedings of the 6th Eurographics Workshop on 3D Object Retrieval, 2013

2012
Discrimination of Locomotion Direction at Different Speeds: A Comparison between Macaque Monkeys and Algorithms.
Proceedings of the Detection and Identification of Rare Audiovisual Cues, 2012

DIRAC: Detection and Identification of Rare Audio-Visual Events.
Proceedings of the Detection and Identification of Rare Audiovisual Cues, 2012

Constant Time Joint Bilateral Filtering Using Joint Integral Histograms.
IEEE Trans. Image Process., 2012

In Memoriam: Mark Everingham.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Beyond Novelty Detection: Incongruent Events, When General and Specific Classifiers Disagree.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Coupled Action Recognition and Pose Estimation from Multiple Views.
Int. J. Comput. Vis., 2012

Efficient multi-camera vehicle detection, tracking, and identification in a tunnel surveillance application.
Comput. Vis. Image Underst., 2012

Codebook-free exemplar models for object detection.
Proceedings of the 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012

Non-parametric motion-priors for flow understanding.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Real-time stereo and flow-based video segmentation with superpixels.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Real time 3D head pose estimation: Recent achievements and future challenges.
Proceedings of the 5th International Symposium on Communications, 2012

On-line semantic perception using uncertainty.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Weighted collaborative representation and classification of images.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Towards a real-time high-definition depth sensor with hardware-efficient stereo matching.
Proceedings of the Stereoscopic Displays and Applications XXIII, 2012

Base Materials for Photometric Stereo.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Latent Hough Transform for Object Detection.
Proceedings of the Computer Vision - ECCV 2012, 2012

Destination Flow for Crowd Simulation.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Optimal Templates for Nonrigid Surface Reconstruction.
Proceedings of the Computer Vision - ECCV 2012, 2012

A Three-Layered Approach to Facade Parsing.
Proceedings of the Computer Vision - ECCV 2012, 2012

Stixels Motion Estimation without Optical Flow Computation.
Proceedings of the Computer Vision - ECCV 2012, 2012

Learning Domain Knowledge for Façade Labelling.
Proceedings of the Computer Vision - ECCV 2012, 2012

Ensemble Partitioning for Unsupervised Image Categorization.
Proceedings of the Computer Vision - ECCV 2012, 2012

TriCoS: A Tri-level Class-Discriminative Co-segmentation Method for Image Classification.
Proceedings of the Computer Vision - ECCV 2012, 2012

Nested Sparse Quantization for Efficient Feature Coding.
Proceedings of the Computer Vision - ECCV 2012, 2012

SEEDS: Superpixels Extracted via Energy-Driven Sampling.
Proceedings of the Computer Vision - ECCV 2012, 2012

Fast Stixel Computation for Fast Pedestrian Detection.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Motion Capture of Hands in Action Using Discriminative Salient Points.
Proceedings of the Computer Vision - ECCV 2012, 2012

Classification with Global, Local and Shared Features.
Proceedings of the Pattern Recognition, 2012

Interactive object detection.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Iterative Nearest Neighbors for classification and dimensionality reduction.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Parameter-free/Pareto-driven procedural 3D reconstruction of buildings from ground-level sequences.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Real-time facial feature detection using conditional regression forests.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Pedestrian detection at 100 frames per second.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

A Training-free Classification Framework for Textures, Writers, and Materials.
Proceedings of the British Machine Vision Conference, 2012

Sparsity Potentials for Detecting Objects with the Hough Transform.
Proceedings of the British Machine Vision Conference, 2012

Metric Learning from Poses for Temporal Clustering of Human Motion.
Proceedings of the British Machine Vision Conference, 2012

Naive Bayes Image Classification: Beyond Nearest Neighbors.
Proceedings of the Computer Vision - ACCV 2012, 2012

Automatic Stave Discovery for Musical Facsimiles.
Proceedings of the Computer Vision, 2012

Dynamic Objectness for Adaptive Tracking.
Proceedings of the Computer Vision - ACCV 2012, 2012

Local Context Priors for Object Proposal Generation.
Proceedings of the Computer Vision - ACCV 2012, 2012

Apparel Classification with Style.
Proceedings of the Computer Vision, 2012

Exploiting Physical Inconsistencies for 3D Scene Understanding.
Proceedings of the 2012 Second International Conference on 3D Imaging, 2012

2011
Real-Time and Accurate Stereo: A Scalable Approach With Bitwise Fast Voting on CUDA.
IEEE Trans. Circuits Syst. Video Technol., 2011

Robust Low Complexity Corner Detector.
IEEE Trans. Circuits Syst. Video Technol., 2011

Hough Forests for Object Detection, Tracking, and Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Online classification of visual tasks for industrial workflow monitoring.
Neural Networks, 2011

Fast PRISM: Branch and Bound Hough Transform for Object Class Detection.
Int. J. Comput. Vis., 2011

Online loop closure for real-time interactive 3D scanning.
Comput. Vis. Image Underst., 2011

Action recognition: A region based approach.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Combining RGB and ToF cameras for real-time 3D hand gesture interaction.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Systematic evaluation of super-resolution using classification.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

The V-City Project.
Proceedings of the 12th International Symposium on Virtual Reality, 2011

Reconstructing and Exploring Massive Detailed Cityscapes.
Proceedings of the 12th International Symposium on Virtual Reality, 2011

An Introduction to Random Forests for Multi-class Object Detection.
Proceedings of the Outdoor and Large-Scale Real-World Scene Analysis - 15th International Workshop on Theoretical Foundations of Computer Vision, Dagstuhl Castle, Germany, June 26, 2011

Real-time 3D hand gesture interaction with a robot for understanding directions from humans.
Proceedings of the 20th IEEE International Symposium on Robot and Human Interactive Communication, 2011

A cross-based filter for fast edge-preserving smoothing.
Proceedings of the Real-Time Image and Video Processing 2011, 2011

Learning Probabilistic Non-Linear Latent Variable Models for Tracking Complex Activities.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

A Public System for Image Based 3D Model Generation.
Proceedings of the Computer Vision/Computer Graphics Collaboration Techniques, 2011

Real-time sign language letter and word recognition from depth data.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Multi-view manhole detection, recognition, and 3D localisation.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Transferring activities: Updating human behavior analysis.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Unsupervised workflow discovery in industrial environments.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Augmented faces.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Stixels estimation without depth map computation.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

WEAR++: 3D model driven camera tracking on board the International Space Station.
Proceedings of the International Conference on 3D Imaging, 2011

Data-driven animation of hand-object interactions.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Real Time Head Pose Estimation from Consumer Depth Cameras.
Proceedings of the Pattern Recognition - 33rd DAGM Symposium, Frankfurt/Main, Germany, August 31, 2011

Efficient multi-camera detection, tracking, and identification using a shared set of haar-features.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Scalable multi-class object detection.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Separating rigid motion from linear local deformation models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

What makes a chair a chair?
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Functional categorization of objects using real-time markerless motion capture.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Real time head pose estimation with random regression forests.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Does Human Action Recognition Benefit from Pose Estimation?.
Proceedings of the British Machine Vision Conference, 2011

Efficient 3D Object Detection using Multiple Pose-Specific Classifiers.
Proceedings of the British Machine Vision Conference, 2011

Sparse Representation Based Projections.
Proceedings of the British Machine Vision Conference, 2011

On-line Hough Forests.
Proceedings of the British Machine Vision Conference, 2011

Class-specific 3D localization using constellations of object parts.
Proceedings of the British Machine Vision Conference, 2011

Temporal Relations in Videos for Unsupervised Activity Analysis.
Proceedings of the British Machine Vision Conference, 2011

Transforming Image Completion.
Proceedings of the British Machine Vision Conference, 2011

Branch&Rank: Non-Linear Object Detection.
Proceedings of the British Machine Vision Conference, 2011

Object and Action Classification with Latent Variables.
Proceedings of the British Machine Vision Conference, 2011

Improved person detection in industrial environments using multiple self-calibrated cameras.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Automatic Occlusion Removal from Facades for 3D Urban Reconstruction.
Proceedings of the Advances Concepts for Intelligent Vision Systems, 2011

Procedural 3D Building Reconstruction Using Shape Grammars and Detectors.
Proceedings of the International Conference on 3D Imaging, 2011

Scene Cut: Class-Specific Object Detection and Segmentation in 3D Scenes.
Proceedings of the International Conference on 3D Imaging, 2011

Predicting Pedestrian Trajectories.
Proceedings of the Visual Analysis of Humans - Looking at People., 2011

2010
A 3-D Audio-Visual Corpus of Affective Communication.
IEEE Trans. Multim., 2010

Multibody Structure-from-Motion in Practice.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Multi-object tracking evaluated on sparse events.
Multim. Tools Appl., 2010

Analysis and retrieval of events/actions and workflows in video streams.
Multim. Tools Appl., 2010

Object Detection and Tracking for Autonomous Navigation in Dynamic Environments.
Int. J. Robotics Res., 2010

The Pascal Visual Object Classes (VOC) Challenge.
Int. J. Comput. Vis., 2010

Computational Symmetry in Computer Vision and Computer Graphics.
Found. Trends Comput. Graph. Vis., 2010

Grammar-based Encoding of Facades.
Comput. Graph. Forum, 2010

Real-time detection of unusual regions in image streams.
Proceedings of the 18th International Conference on Multimedia 2010, 2010


Orientation invariant 3D object classification using hough transform based methods.
Proceedings of the ACM workshop on 3D object retrieval, 2010

GPU-Accelerated Robotic Intra-operative Laparoscopic 3D Reconstruction.
Proceedings of the Information Processing in Computer-Assisted Interventions, 2010

Variations of a Hough-Voting Action Recognition System.
Proceedings of the Recognizing Patterns in Signals, Speech, Images and Videos, 2010

Integrating Object Detection with 3D Tracking Towards a Better Driver Assistance System.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Joint integral histograms and its application in stereo matching.
Proceedings of the International Conference on Image Processing, 2010

Robust low complexity feature tracking.
Proceedings of the International Conference on Image Processing, 2010

Lococo: low complexity corner detector.
Proceedings of the IEEE International Conference on Acoustics, 2010

Robust Workflow Recognition Using Holistic Features and Outlier-Tolerant Fused Hidden Markov Models.
Proceedings of the Artificial Neural Networks - ICANN 2010, 2010

Cascaded Confidence Filtering for Improved Tracking-by-Detection.
Proceedings of the Computer Vision, 2010

Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections.
Proceedings of the Computer Vision, 2010

Improving Data Association by Joint Modeling of Pedestrian Trajectories and Groupings.
Proceedings of the Computer Vision, 2010

Visibility Maps for Improving Seam Carving.
Proceedings of the Trends and Topics in Computer Vision, 2010

Scene Carving: Scene Consistent Image Retargeting.
Proceedings of the Computer Vision, 2010

Hough Transform and 3D SURF for Robust Three Dimensional Classification.
Proceedings of the Computer Vision - ECCV 2010, 2010

Size Does Matter: Improving Object Recognition and 3D Reconstruction with Cross-Media Analysis of Image Clusters.
Proceedings of the Computer Vision, 2010

2D Action Recognition Serves 3D Human Pose Estimation.
Proceedings of the Computer Vision, 2010

Hough Forest-Based Facial Expression Recognition from Video Sequences.
Proceedings of the Trends and Topics in Computer Vision, 2010

Tracking People in Broadcast Sports.
Proceedings of the Pattern Recognition, 2010

A Hough transform-based voting framework for action recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Finding nemo: Deformable object class modelling using curve matching.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Wrong turn - No dead end: A stochastic pedestrian motion model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Exploiting simple hierarchies for unsupervised human behavior analysis.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

What's going on? Discovering spatio-temporal dependencies in dynamic scenes.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

An object-dependent hand pose prior from sparse training data.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Tracking the invisible: Learning where the object might be.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Server-side object recognition and client-side object tracking for mobile augmented reality.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

On-line Adaption of Class-specific Codebooks for Instance Tracking.
Proceedings of the British Machine Vision Conference, 2010

Automatic annotation of unique locations from video and text.
Proceedings of the British Machine Vision Conference, 2010

Automatic Workflow Monitoring in Industrial Environments.
Proceedings of the Computer Vision - ACCV 2010, 2010

Four Color Theorem for Fast Early Vision.
Proceedings of the Computer Vision - ACCV 2010, 2010

Optimal Regions for Linear Model-Based 3D Face Reconstruction.
Proceedings of the Computer Vision - ACCV 2010, 2010

Object Flow: Learning Object Displacement.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Fast Categorisation of Articulated Human Motion.
Proceedings of the Machine Learning for Human Motion Analysis - Theory and Practice., 2010

2009
Robust Multiperson Tracking from a Mobile Platform.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Using Multi-view Recognition and Meta-data Annotation to Guide a Robot's Attention.
Int. J. Robotics Res., 2009

Learning Generative Models for Multi-Activity Body Pose Estimation.