Andrew Zisserman

Orcid: 0000-0002-8945-8573

Affiliations:
  • University of Oxford, UK


According to our database1, Andrew Zisserman authored at least 620 papers between 1985 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
FlexCap: Generating Rich, Localized, and Flexible Captions in Images.
CoRR, 2024

N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields.
CoRR, 2024

A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval.
CoRR, 2024

BootsTAP: Bootstrapped Training for Tracking-Any-Point.
CoRR, 2024

Synchformer: Efficient Synchronization from Sparse Cues.
CoRR, 2024

Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling.
CoRR, 2024

The Manga Whisperer: Automatically Generating Transcriptions for Comics.
CoRR, 2024

2023
Persistent animal identification leveraging non-visual markers.
Mach. Vis. Appl., July, 2023

Amodal Ground Truth and Completion in the Wild.
CoRR, 2023

Perception Test 2023: A Summary of the First Challenge And Outcome.
CoRR, 2023

Text-Conditioned Resampler For Long Form Video Understanding.
CoRR, 2023

Appearance-based Refinement for Object-Centric Motion Segmentation.
CoRR, 2023

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames.
CoRR, 2023

Learning from One Continuous Video Stream.
CoRR, 2023

Predicting Spine Geometry and Scoliosis from DXA Scans.
CoRR, 2023

Show from Tell: Audio-Visual Modelling in Clinical Settings.
CoRR, 2023

What Does Stable Diffusion Know about the 3D Scene?
CoRR, 2023

OxfordVGG Submission to the EGO4D AV Transcription Challenge.
CoRR, 2023

Open-world Text-specified Object Counting.
CoRR, 2023

Three ways to improve feature alignment for open vocabulary detection.
CoRR, 2023

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio.
CoRR, 2023

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge.
CoRR, 2023

Zorro: the masked multimodal transformer.
CoRR, 2023

The Change You Want to See.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

No Representation Rules Them All in Category Discovery.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Deep Facial Phenotyping with Mixup Augmentation.
Proceedings of the Medical Image Understanding and Analysis - 27th Annual Conference, 2023

Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime.
Proceedings of the Medical Imaging with Deep Learning, 2023

3D Shape Analysis of Scoliosis.
Proceedings of the Shape in Medical Imaging - International Workshop, 2023

Multi-Modal Classifiers for Open-Vocabulary Object Detection.
Proceedings of the International Conference on Machine Learning, 2023

The Change You Want to See (Now in 3D).
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Helping Hands: An Object-Aware Ego-Centric Video Recognition Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Verbs in Action: Improving verb understanding in video-language models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

The Making and Breaking of Camouflage.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AutoAD II: The Sequel - Who, When, and What in Movie Audio Description.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Epic-Sounds: A Large-Scale Dataset of Actions that Sound.
Proceedings of the IEEE International Conference on Acoustics, 2023

AutoAD: Movie Description in Context.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Light Touch Approach to Teaching Transformers Multi-view Geometry.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GestSync: Determining who is speaking without a talking head.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Open-world Text-specifed Object Counting.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
LAEO-Net++: Revisiting People Looking at Each Other in Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

AutoNovel: Automatically Discovering and Learning Novel Visual Categories.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep Audio-Visual Speech Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Scaling Up Sign Spotting Through Sign Language Dictionaries.
Int. J. Comput. Vis., 2022

End-to-end Tracking with a Multi-query Transformer.
CoRR, 2022

A CLIP-Hitchhiker's Guide to Long Video Retrieval.
CoRR, 2022

SpineNetV2: Automated Detection, Labelling and Radiological Grading Of Clinical MR Scans.
CoRR, 2022

Flamingo: a Visual Language Model for Few-Shot Learning.
CoRR, 2022

Hierarchical Perceiver.
CoRR, 2022

VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge.
CoRR, 2022

Segmenting Moving Objects via an Object-Centric Layered Representation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Associating Objects and Their Effects in Video through Coordination Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


Context-Aware Transformers for Spinal Cancer Detection and Radiological Grading.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Open-Set Recognition: A Good Closed-Set Classifier is All You Need.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Perceiver IO: A General Architecture for Structured Inputs & Outputs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Automatic Dense Annotation of Large-Vocabulary Sign Language Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022

Object Discovery and Representation Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

Input-level Inductive Biases for 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

It's About Time: Analog Clock Reading in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Generalized Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Sub-word Level Lip Reading With Visual Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Label, Verify, Correct: A Simple Few Shot Object Detection Method.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Temporal Alignment Networks for Long-term Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Tri-Layer Plugin to Improve Occluded Detection.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Weakly-supervised Fingerspelling Recognition in British Sign Language Videos.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

CounTR: Transformer-based Generalised Visual Counting.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Personalised CLIP or: how to find your vacation videos.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Turbo Training with Token Dropout.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Is an Object-Centric Video Representation Beneficial for Transfer?
Proceedings of the Computer Vision - ACCV 2022, 2022

Compressed Vision for Efficient Video Understanding.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Synthetic Humans for Action Recognition from Unseen Viewpoints.
Int. J. Comput. Vis., 2021

Tracking and Long-Term Identification Using Non-Visual Markers.
CoRR, 2021

BBC-Oxford British Sign Language Dataset.
CoRR, 2021

NeRF in detail: Learning to sample for view synthesis.
CoRR, 2021

Comment on Stochastic Polyak Step-Size: Performance of ALI-G.
CoRR, 2021

PASS: An ImageNet replacement for self-supervised pretraining without humans.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Automated Video Labelling: Identifying Faces by Corroborative Evidence.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Self-supervised Multi-modal Alignment for Whole Body Medical Imaging.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Perceiver: General Perception with Iterative Attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

Visual Analysis of Chapbooks Printed in Scotland.
Proceedings of the HIP@ICDAR 2021: The 6th International Workshop on Historical Document Imaging and Processing, 2021

LSD-C: Linearly Separable Deep Clusters.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Face, Body, Voice: Video Person-Clustering with Multiple Modalities.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Self-supervised Video Object Segmentation by Motion Grouping.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Broaden Your Views for Self-Supervised Video Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TeachText: CrossModal Generalized Distillation for Text-Video Retrieval.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Aligning Subtitles in Sign Language Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

QUERYD: A Video Dataset with High-Quality Text and Audio Narrations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Slow-Fast Auditory Streams for Audio Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

SeeHear: Signer Diarisation and a New Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021

Playing a Part: Speaker Verification at the movies.
Proceedings of the IEEE International Conference on Acoustics, 2021

Temporal Query Networks for Fine-Grained Video Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Co-Attention for Conditioned Image Matching.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Read and Attend: Temporal Localisation in Sign Language Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Omnimatte: Associating Objects and Their Effects in Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Localizing Visual Sounds the Hard Way.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Visual Keyword Spotting with Attention.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Segmenting Invisible Moving Objects.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Audio-Visual Synchronisation in the wild.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Layered neural rendering for retiming people in video.
ACM Trans. Graph., 2020

Automated Video Face Labelling for Films and TV Material.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Deep Insights into Convolutional Networks for Video Recognition.
Int. J. Comput. Vis., 2020

Voxceleb: Large-scale speaker verification in the wild.
Comput. Speech Lang., 2020

VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge.
CoRR, 2020

QuerYD: A video dataset with high-quality textual and audio narrations.
CoRR, 2020

A Short Note on the Kinetics-700-2020 Human Action Dataset.
CoRR, 2020

Inducing Predictive Uncertainty Estimation for Face Recognition.
CoRR, 2020

RareAct: A video dataset of unusual interactions.
CoRR, 2020

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020).
CoRR, 2020

D2D: Learning to find good correspondences for image matching and manipulation.
CoRR, 2020

The AVA-Kinetics Localized Human Actions Video Dataset.
CoRR, 2020

Monocular Depth Estimation with Self-supervised Instance Adaptation.
CoRR, 2020

Self-supervised Co-Training for Video Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

CrossTransformers: spatially-aware few-shot transfer.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Self-Supervised MultiModal Versatile Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Convolutional Approach to Vertebrae Detection and Labelling in Whole Spine MRI.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Spot the Conversation: Speaker Diarisation in the Wild.
Proceedings of the Interspeech 2020, 2020

Now You're Speaking My Language: Visual Language Identification.
Proceedings of the Interspeech 2020, 2020

Training Neural Networks for and by Interpolation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Automatically Discovering and Learning New Visual Categories with Ranking Statistics.
Proceedings of the 8th International Conference on Learning Representations, 2020

Disentangled Speech Embeddings Using Cross-Modal Self-Supervision.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sight to Sound: An End-to-End Approach for Visual Piano Transcription.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Vggsound: A Large-Scale Audio-Visual Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ASR is All You Need: Cross-Modal Distillation for Lip Reading.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Adaptive Text Recognition Through Visual Matching.
Proceedings of the Computer Vision - ECCV 2020, 2020

Amplifying Key Cues for Human-Object-Interaction Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Memory-Augmented Dense Predictive Coding for Video Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

SLRTP 2020: The Sign Language Recognition, Translation & Production Workshop.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval.
Proceedings of the Computer Vision - ECCV 2020, 2020

BSL-1K: Scaling Up Co-articulated Sign Language Recognition Using Mouthing Cues.
Proceedings of the Computer Vision - ECCV 2020, 2020

Self-supervised Learning of Audio-Visual Objects from Video.
Proceedings of the Computer Vision - ECCV 2020, 2020

Visual Grounding in Video for Unsupervised Word Translation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semi-Supervised Learning with Scarce Annotations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Speech2Action: Cross-Modal Supervision for Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

End-to-End Learning of Visual Representations From Uncurated Instructional Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Inducing Predictive Uncertainty Estimation for Face Verification.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Seeing wake words: Audio-visual Keyword Spotting.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Constrained Video Face Clustering using1NN Relations.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Watch, Read and Lookup: Learning to Spot Signs from Multiple Supervisors.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Condensed Movies: Story Based Retrieval with Contextual Embeddings.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
From Images to 3D Shape Attributes.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Non-contact physiological monitoring of preterm infants in the Neonatal Intensive Care Unit.
npj Digit. Medicine, 2019

Learning to Predict 3D Surfaces of Sculptures from Single and Multiple Views.
Int. J. Comput. Vis., 2019

You Said That?: Synthesising Talking Faces from Audio.
Int. J. Comput. Vis., 2019

VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge.
CoRR, 2019

A Short Note on the Kinetics-700 Human Action Dataset.
CoRR, 2019

Sim2real transfer learning for 3D pose estimation: motion to the rescue.
CoRR, 2019

A Hierarchical Probabilistic U-Net for Modeling Multi-Scale Ambiguities.
CoRR, 2019

Object Discovery with a Copy-Pasting GAN.
CoRR, 2019

A Geometric Approach to Obtain a Bird's Eye View from an Image.
CoRR, 2019

The VGG Image Annotator (VIA).
CoRR, 2019

The StreetLearn Environment and Dataset.
CoRR, 2019

Unsupervised Learning of Object Keypoints for Perception and Control.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Sim2real transfer learning for 3D human pose estimation: motion to the rescue.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

The VIA Annotation Software for Images, Audio and Video.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

My Lips Are Concealed: Audio-Visual Speech Enhancement Through Obstructions.
Proceedings of the Interspeech 2019, 2019

Deep Frank-Wolfe For Neural Network Optimization.
Proceedings of the 7th International Conference on Learning Representations, 2019

Self-Supervised Learning of Class Embeddings from Video.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Video Representation Learning by Dense Predictive Coding.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Count, Crop and Recognise: Fine-Grained Recognition in the Wild.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

A Geometric Approach to Obtain a Bird's Eye View From an Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning to Discover Novel Visual Categories via Deep Transfer Clustering.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Controllable Attention for Structured Layered Video Decomposition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Utterance-level Aggregation for Speaker Recognition in the Wild.
Proceedings of the IEEE International Conference on Acoustics, 2019

Future Event Prediction: If and When.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Video Action Transformer Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Temporal Cycle-Consistency Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploiting Temporal Context for 3D Human Pose Estimation in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

The Visual Centrifuge: Model-Free Layered Video Representations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Geometry-Aware Video Object Detection for Static Cameras.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Use What You Have: Video retrieval using representations from collaborative experts.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Fully-automated alignment of 3D fetal brain ultrasound to a canonical reference space using multi-task learning.
Medical Image Anal., 2018

Template adaptation for face verification and identification.
Image Vis. Comput., 2018

Learning to lip read words by watching videos.
Comput. Vis. Image Underst., 2018

LRS3-TED: a large-scale dataset for visual speech recognition.
CoRR, 2018

A Short Note about Kinetics-600.
CoRR, 2018

X2Face: A network for controlling face generation by using images, audio, and pose codes.
CoRR, 2018

A Better Baseline for AVA.
CoRR, 2018

Kickstarting Deep Reinforcement Learning.
CoRR, 2018

Microscopy cell counting and detection with fully convolutional regression networks.
Comput. methods Biomech. Biomed. Eng. Imaging Vis., 2018

Learning to Navigate in Cities Without a Map.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Predicting Scoliosis in DXA Scans Using Intermediate Representations.
Proceedings of the Computational Methods and Clinical Applications for Spine Imaging, 2018

VoxCeleb2: Deep Speaker Recognition.
Proceedings of the Interspeech 2018, 2018

Deep Lip Reading: A Comparison of Models and an Online Application.
Proceedings of the Interspeech 2018, 2018

The Conversation: Deep Audio-Visual Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Learning to Read by Spelling: Towards Unsupervised Text Recognition.
Proceedings of the ICVGIP 2018: 11th Indian Conference on Computer Vision, 2018

Smooth Loss Functions for Deep Top-k Classification.
Proceedings of the 6th International Conference on Learning Representations, 2018

VGGFace2: A Dataset for Recognising Faces across Pose and Age.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

Compact Deep Aggregation for Set Retrieval.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Comparator Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

3D Surface Reconstruction by Pointillism.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

X2Face: A Network for Controlling Face Generation Using Images, Audio, and Pose Codes.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learnable PINs: Cross-modal Embeddings for Person Identity.
Proceedings of the Computer Vision - ECCV 2018, 2018

Massively Parallel Video Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Objects that Sound.
Proceedings of the Computer Vision - ECCV 2018, 2018

Turning a Blind Eye: Explicit Removal of Biases and Variation from Deep Neural Network Embeddings.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Learning and Using the Arrow of Time.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

What Have We Learned From Deep Representations for Action Recognition?
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multicolumn Networks for Face Recognition.
Proceedings of the British Machine Vision Conference 2018, 2018

Self-supervised learning of a facial attribute embedding from video.
Proceedings of the British Machine Vision Conference 2018, 2018

Inductive Visual Localisation: Factorised Training for Superior Generalisation.
Proceedings of the British Machine Vision Conference 2018, 2018

GhostVLAD for Set-Based Face Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

NightOwls: A Pedestrians at Night Dataset.
Proceedings of the Computer Vision - ACCV 2018, 2018

Class-Agnostic Counting.
Proceedings of the Computer Vision - ACCV 2018, 2018

From Same Photo: Cheating on Visual Kinship Challenges.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Interferences in Match Kernels.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Guest Editorial: Best of CVPR 2015.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

SpineNet: Automated classification and evidence visualization in spinal MRIs.
Medical Image Anal., 2017

Human pose search using deep networks.
Image Vis. Comput., 2017

The Kinetics Human Action Video Dataset.
CoRR, 2017

Self-supervised Learning for Spinal MRIs.
Proceedings of the Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, 2017

Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2017, 2017

VoxCeleb: A Large-Scale Speaker Identification Dataset.
Proceedings of the Interspeech 2017, 2017

Trusting SVM for Piecewise Linear CNNs.
Proceedings of the 5th International Conference on Learning Representations, 2017

Discovery of Rare Phenotypes in Cellular Images Using Weakly Supervised Deep Learning.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Detect to Track and Track to Detect.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Multi-task Self-Supervised Visual Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Look, Listen and Learn.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Multi-Task Convolutional Neural Network for Patient Detection and Skin Segmentation in Continuous Non-Contact Vital Sign Monitoring.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Recurrent Human Pose Estimation.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Lip Reading Sentences in the Wild.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

SilNet : Single- and Multi-View Reconstruction by Learning from Silhouettes.
Proceedings of the British Machine Vision Conference 2017, 2017

From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script.
Proceedings of the British Machine Vision Conference 2017, 2017

Lip Reading in Profile.
Proceedings of the British Machine Vision Conference 2017, 2017

You said that?
Proceedings of the British Machine Vision Conference 2017, 2017

Mining Faces from Biomedical Literature using Deep Learning.
Proceedings of the 8th ACM International Conference on Bioinformatics, 2017

2016
Detecting overlapping instances in microscopy images using extremal region trees.
Medical Image Anal., 2016

Reading Text in the Wild with Convolutional Neural Networks.
Int. J. Comput. Vis., 2016

Understanding Higher-Order Shape via 3D Shape Attributes.
CoRR, 2016

Signs in time: Encoding human motion as a temporal image.
CoRR, 2016

SpineNet: Automatically Pinpointing Classification Evidence in Spinal MRIs.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

The Art of Detection.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Counting in the Wild.
Proceedings of the Computer Vision - ECCV 2016, 2016

Synthetic Data for Text Localisation in Natural Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

3D Shape Attributes.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Convolutional Two-Stream Network Fusion for Video Action Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Personalizing Human Video Pose Estimation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Faces in Places: compound query retrieval.
Proceedings of the British Machine Vision Conference 2016, 2016

Out of Time: Automated Lip Sync in the Wild.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

Lip Reading in the Wild.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
On-the-fly learning for visual search of large-scale image and video datasets.
Int. J. Multim. Inf. Retr., 2015

The Pascal Visual Object Classes Challenge: A Retrospective.
Int. J. Comput. Vis., 2015

Part level transfer regularization for enhancing exemplar SVMs.
Comput. Vis. Image Underst., 2015

Very Deep Convolutional Networks for Large-Scale Image Recognition.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Automatic Discovery and Optimization of Parts for Image Classification.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Deep Structured Output Learning for Unconstrained Text Recognition.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Spatial Transformer Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Automatic Intervertebral Discs Localization and Segmentation: A Vertebral Approach.
Proceedings of the Computational Methods and Clinical Applications for Spine Imaging, 2015

Automatic Modic Changes Classification in Spinal MRI.
Proceedings of the Computational Methods and Clinical Applications for Spine Imaging, 2015

Flowing ConvNets for Human Pose Estimation in Videos.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Human pose search using deep poselets.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Deep Face Recognition.
Proceedings of the British Machine Vision Conference 2015, 2015

Face Painting: querying art with photos.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Learning Local Feature Descriptors Using Convex Optimisation.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Temporal models for mitotic phase labelling.
Medical Image Anal., 2014

Deblurring Shaken and Partially Saturated Images.
Int. J. Comput. Vis., 2014

Detecting People Looking at Each Other in Videos.
Int. J. Comput. Vis., 2014

Automatic and Efficient Human Pose Estimation for Sign Language Videos.
Int. J. Comput. Vis., 2014

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition.
CoRR, 2014

Two-Stream Convolutional Networks for Action Recognition in Videos.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Total Cluster: A person agnostic clustering method for broadcast videos.
Proceedings of the 2014 Indian Conference on Computer Vision, 2014

Domain-Adaptive Discriminative One-Shot Learning of Gestures.
Proceedings of the Computer Vision - ECCV 2014, 2014

Deep Features for Text Spotting.
Proceedings of the Computer Vision - ECCV 2014, 2014

In Search of Art.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Re-presentations of Art Collections.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Multi-Task Multi-Sample Learning.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Interactive Object Counting.
Proceedings of the Computer Vision - ECCV 2014, 2014

Seeing the Arrow of Time.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

A Compact and Discriminative Face Track Descriptor.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Triangulation Embedding and Democratic Aggregation for Image Search.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Talking Heads: Detecting Humans and Recognizing Their Interactions.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Immediate, Scalable Object Category Detection.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Speeding up Convolutional Neural Networks with Low Rank Expansions.
Proceedings of the British Machine Vision Conference, 2014

Action Recognition From Weak Alignment of Body Parts.
Proceedings of the British Machine Vision Conference, 2014

The State of the Art: Object Retrieval in Paintings using Discriminative Regions.
Proceedings of the British Machine Vision Conference, 2014

Return of the Devil in the Details: Delving Deep into Convolutional Nets.
Proceedings of the British Machine Vision Conference, 2014

Upper Body Pose Estimation with Temporal Sequential Forests.
Proceedings of the British Machine Vision Conference, 2014

Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos.
Proceedings of the Computer Vision - ACCV 2014, 2014

Improving Human Action Recognition Using Score Distribution and Ranking.
Proceedings of the Computer Vision - ACCV 2014, 2014

Thread-Safe: Towards Recognizing Human Actions Across Shot Boundaries.
Proceedings of the Computer Vision - ACCV 2014, 2014

Efficient On-the-fly Category Retrieval Using ConvNets and GPUs.
Proceedings of the Computer Vision - ACCV 2014, 2014

DisLocation: Scalable Descriptor Distinctiveness for Location Recognition.
Proceedings of the Computer Vision - ACCV 2014, 2014

Visual Vocabulary with a Semantic Twist.
Proceedings of the Computer Vision - ACCV 2014, 2014

Efficient, blind, spatially-variant deblurring for shaken images.
Proceedings of the Motion Deblurring: Algorithms and Systems, 2014

2013

Deep Fisher Networks for Large-Scale Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

The AXES PRO video search system.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Mitotic phase based detection of chromosome segregation errors in embryonic stem cells.
Proceedings of the 10th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2013

Symbiotic Segmentation and Part Localization for Fine-Grained Categorization.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Blocks That Shout: Distinctive Parts for Scene Classification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Discriminative Sub-categorization.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Learning to Detect Partially Overlapping Instances.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

All About VLAD.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Towards on-the-fly Large Scale Video Search.
Proceedings of the British Machine Vision Conference, 2013

Fisher Vector Faces in the Wild.
Proceedings of the British Machine Vision Conference, 2013

Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences).
Proceedings of the British Machine Vision Conference, 2013

Of Gods and Goats: Weakly Supervised Learning of Figurative Art.
Proceedings of the British Machine Vision Conference, 2013

Domain Adaptation for Upper Body Pose Tracking in Signed TV Broadcasts.
Proceedings of the British Machine Vision Conference, 2013

2012
In Memoriam: Mark Everingham.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Efficient Additive Kernels via Explicit Feature Maps.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Structured Learning of Human Interactions in TV Shows.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Non-uniform Deblurring for Shaken Images.
Int. J. Comput. Vis., 2012

2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images.
Int. J. Comput. Vis., 2012

On-the-fly specific person retrieval.
Proceedings of the 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012


Video retrieval by mimicking poses.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Name that sculpture.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Immediate ROI Search for 3-D Medical Images.
Proceedings of the Medical Content-Based Retrieval for Clinical Decision Support, 2012

Learning to Detect Cells Using Non-overlapping Extremal Regions.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2012, 2012

Discriminative Semi-Markov Models for automated mitotic phase labelling.
Proceedings of the 9th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2012

Self-similar Sketch.
Proceedings of the Computer Vision - ECCV 2012, 2012

Descriptor Learning Using Convex Optimisation.
Proceedings of the Computer Vision - ECCV 2012, 2012

Taxonomic Multi-class Prediction and Person Layout Using Efficient Structured Ranking.
Proceedings of the Computer Vision - ECCV 2012, 2012

Has My Algorithm Succeeded? An Evaluator for Human Pose Estimators.
Proceedings of the Computer Vision - ECCV 2012, 2012

TriCoS: A Tri-level Class-Discriminative Co-segmentation Method for Image Classification.
Proceedings of the Computer Vision - ECCV 2012, 2012

Sparse kernel approximations for efficient classification and detection.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Cats and dogs.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Three things everyone should know to improve object retrieval.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Automatic and Efficient Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts.
Proceedings of the British Machine Vision Conference, 2012

Latent SVMs for Human Detection with a Locally Affine Deformation Field.
Proceedings of the British Machine Vision Conference, 2012

Enhancing Exemplar SVMs using Part Level Transfer Regularization.
Proceedings of the British Machine Vision Conference, 2012

Multiple queries for large scale specific object retrieval.
Proceedings of the British Machine Vision Conference, 2012

VISOR: Towards On-the-Fly Large-Scale Object Category Retrieval.
Proceedings of the Computer Vision, 2012

2011
Harvesting Image Databases from the Web.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets.
Int. J. Comput. Vis., 2011

Upper Body Detection and Tracking in Extended Signing Sequences.
Int. J. Comput. Vis., 2011


Pylon Model for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Immediate Structured Visual Search for Medical Images.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2011, 2011

Humanising GrabCut: Learning to segment humans using the Kinect.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Learning equivariant structured output SVM regressors.
Proceedings of the IEEE International Conference on Computer Vision, 2011

The truth about cats and dogs.
Proceedings of the IEEE International Conference on Computer Vision, 2011

BiCoS: A Bi-level co-segmentation method for image classification.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Tabula rasa: Model transfer for object category detection.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Smooth object retrieval using a bag of boundaries.
Proceedings of the IEEE International Conference on Computer Vision, 2011

CLAROS - Collaborating on Delivering the Future of the Past.
Proceedings of the 6th Annual International Conference of the Alliance of Digital Humanities Organizations, 2011

Hand detection using multiple proposals.
Proceedings of the British Machine Vision Conference, 2011

"Here's looking at you, kid". Detecting people looking at each other in videos.
Proceedings of the British Machine Vision Conference, 2011

The devil is in the details: an evaluation of recent feature encoding methods.
Proceedings of the British Machine Vision Conference, 2011

2010
Learning Object Categories From Internet Image Searches.
Proc. IEEE, 2010

OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up Cues.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Delving deeper into the whorl of flower segmentation.
Image Vis. Comput., 2010

Illuminance Flow Estimation by Regression.
Int. J. Comput. Vis., 2010

The Pascal Visual Object Classes (VOC) Challenge.
Int. J. Comput. Vis., 2010

Oxford-IIIT TRECVID 2010 - Notebook paper.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Learning To Count Objects in Images.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Simultaneous Object Detection and Ranking with Weak Supervision.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Descriptor Learning for Efficient Retrieval.
Proceedings of the Computer Vision, 2010

Human Focused Action Localization in Video.
Proceedings of the Trends and Topics in Computer Vision, 2010

Semi-supervised Learning of Facial Attributes in Video.
Proceedings of the Trends and Topics in Computer Vision, 2010

Finding nemo: Deformable object class modelling using curve matching.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Geodesic star convexity for interactive image segmentation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Generalized RBF feature maps for Efficient Detection.
Proceedings of the British Machine Vision Conference, 2010

High Five: Recognising human interactions in TV shows.
Proceedings of the British Machine Vision Conference, 2010

Efficient image retrieval for 3D structures.
Proceedings of the British Machine Vision Conference, 2010

2009
A Statistical Approach to Material Classification Using Image Patch Exemplars.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Efficient Visual Search of Videos Cast as Text Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Taking the bite out of automated naming of characters in TV video.
Image Vis. Comput., 2009

Bayesian Methods for Image Super-Resolution.
Comput. J., 2009

Oxford-IIIT TRECVID 2009 Notebook paper.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Structured output regression for detection with partial truncation.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Segmenting Scenes by Matching Image Composites.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Efficient retrieval of deformable shape classes using local self-similarities.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Multiple kernels for object detection.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Non-local sparse models for image restoration.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Efficient discriminative learning of parts-based models.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

CLAROS - Bringing Classical Art to a Global Public.
Proceedings of the Fifth International Conference on e-Science, 2009

"Who are you?" - Learning person specific classifiers from video.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Pose search: Retrieving people using their pose.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Learning sign language by watching TV (using weakly aligned subtitles).
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Automatic retrieval of visual continuity errors in movies.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Get Out of my Picture! Internet-based Inpainting.
Proceedings of the British Machine Vision Conference, 2009

Subtitle-free Movie to Script Alignment.
Proceedings of the British Machine Vision Conference, 2009

2008
Efficient Visual Search for Objects in Videos.
Proc. IEEE, 2008

Scene Classification Using a Hybrid Generative/Discriminative Approach.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection.
Int. J. Comput. Vis., 2008

Learning Layered Motion Segmentations of Video.
Int. J. Comput. Vis., 2008

Oxford/IIIT TRECVID 2008 - Notebook paper.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Supervised Dictionary Learning.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Object Mining Using a Matching Graph on Very Large Image Collections.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Automated Flower Classification over a Large Number of Classes.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Texture classification with minimal training images.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

2D Human Pose Estimation in TV Shows.
Proceedings of the Statistical and Geometrical Approaches to Visual Motion Analysis, 2008

Unsupervised discovery of visual object class hierarchies.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Lost in quantization: Improving particular object retrieval in large scale image databases.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Discriminative learned dictionaries for local image analysis.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Progressive search space reduction for human pose estimation.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

University of Oxford video retrieval system.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Object Class Segmentation using Random Forests.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

Geometric LDA: A Generative Model for Particular Object Discovery.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

Near Duplicate Image Detection: min-Hash and tf-idf Weighting.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

2007
Tracking People by Learning Their Appearance.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition.
Int. J. Comput. Vis., 2007

Overcoming Registration Uncertainty in Image Super-Resolution: Maximize or Marginalize?
EURASIP J. Adv. Signal Process., 2007

Oxford TRECVid 2007 \u2013 Notebook paper.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Learning Visual Attributes.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

An Invariant Large Margin Nearest Neighbour Classifier.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Image Classification using Random Forests and Ferns.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Object retrieval with large vocabularies and fast spatial matching.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

An Exemplar Model for Learning Object Classes.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Scalable near identical image and shot detection.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

Representing shape with a spatial pyramid kernel.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

Delving into the Whorl of Flower Segmentation.
Proceedings of the British Machine Vision Conference 2007, 2007

Who Are You? - Real-time Person Identification.
Proceedings of the British Machine Vision Conference 2007, 2007

2006
Object Level Grouping for Video Shots.
Int. J. Comput. Vis., 2006

Editorial.
Int. J. Comput. Vis., 2006

Oxford TRECVID 2006 - Notebook paper.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Bayesian Image Super-resolution, Continued.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Single-Histogram Class Models for Image Segmentation.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Learning Class-Specific Edges for Object Detection and Segmentation.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Regression and Classification Approaches to Eye Localization in Face Images.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

A Boundary-Fragment-Model for Object Detection.
Proceedings of the Computer Vision, 2006

Scene Classification Via pLSA.
Proceedings of the Computer Vision, 2006

Using Multiple Segmentations to Discover Objects and their Extent in Image Collections.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Incremental learning of object detectors using a visual shape alphabet.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

A Visual Vocabulary for Flower Classification.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Solving Markov Random Fields using Second Order Cone Programming Relaxations.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Video Google: Efficient Visual Search of Videos.
Proceedings of the Toward Category-Level Object Recognition, 2006


An Object Category Specific mrffor Segmentation.
Proceedings of the Toward Category-Level Object Recognition, 2006

A Sparse Object Category Model for Efficient Learning and Complete Recognition.
Proceedings of the Toward Category-Level Object Recognition, 2006

Optimizing and Learning for Super-resolution.
Proceedings of the British Machine Vision Conference 2006, 2006

Fusing Shape and Appearance Information for Object Category Detection.
Proceedings of the British Machine Vision Conference 2006, 2006

Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video.
Proceedings of the British Machine Vision Conference 2006, 2006

Multiple view geometry in computer vision (2. ed.).
Cambridge University Press, ISBN: 978-0-521-54051-3, 2006

2005
A Statistical Approach to Texture Classification from Single Images.
Int. J. Comput. Vis., 2005

A Comparison of Affine Region Detectors.
Int. J. Comput. Vis., 2005

Image-Based Rendering Using Image-Based Priors.
Int. J. Comput. Vis., 2005


Discovering Objects and their Localization in Images.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Learning Layered Motion Segmentation of Video.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Learning Object Categories from Google's Image Search.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Identifying Individuals in Video by Combining "Generative" and Discriminative Head Models.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Fast and Controllable 3D Modelling From Silhouettes.
Proceedings of the 26th Annual Conference of the European Association for Computer Graphics, 2005

Estimating the Affine Transformation between Textures.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2005

Tracking People and Recognizing Their Activities.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Strike a Pose: Tracking People by Finding Stylized Poses.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

OBJ CUT.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Automatic Face Recognition for Film Character Retrieval in Feature-Length Films.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Person Spotting: Video Shot Retrieval for Face Sets.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

2004
Unifying statistical texture classification frameworks.
Image Vis. Comput., 2004

Minimal projective reconstruction for combinations of points and lines in three views.
Image Vis. Comput., 2004

Guest editorial.
Image Vis. Comput., 2004

Efficient Visual Content Retrieval and Mining in Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Sampling Methods for Unsupervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Trainable Visual Models for Object Class Recognition.
Proceedings of the ICVGIP 2004, 2004

Learning Layered Pictorial Structures from Video.
Proceedings of the ICVGIP 2004, 2004

Automated Visual Identification of Characters in Situation Comedies.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Efficient object retrieval from videos.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Human Detection Based on a Probabilistic Assembly of Robust Part Detectors.
Proceedings of the Computer Vision, 2004

An Affine Invariant Salient Region Detector.
Proceedings of the Computer Vision, 2004

A Visual Category Filter for Google Images.
Proceedings of the Computer Vision, 2004

A Linguistic Feature Vector for the Visual Interpretation of Sign Language.
Proceedings of the Computer Vision, 2004

04021 Abstracts Collection - Content-Based Retrieval.
Proceedings of the Content-Based Retrieval, 4.-9. January 2004, 2004

Estimating Illumination Direction from Textured Images.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Video Data Mining Using Configurations of Viewpoint Invariant Regions.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Augmenting Images of Non-Rigid Scenes Using Point and Curve Correspondences.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Automated Person Identification in Video.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

Extending Pictorial Structures for Object Recognition.
Proceedings of the British Machine Vision Conference, 2004

Minimal Training, Large Lexicon, Unconstrained Sign Language Recognition.
Proceedings of the British Machine Vision Conference, 2004

Direct Estimation of Non-Rigid Registration.
Proceedings of the British Machine Vision Conference, 2004

Multiple View Geometry in Computer Vision.
Cambridge University Press, ISBN: 9780511811685, 2004

2003
Computer vision applied to super resolution.
IEEE Signal Process. Mag., 2003

Geometry of Single Axis Motions Using Conic Fitting.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Automated location matching in movies.
Comput. Vis. Image Underst., 2003

A Sampled Texture Prior for Image Super-Resolution.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Automated multisensor polyhedral model acquisition.
Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003

Video Google: A Text Retrieval Approach to Object Matching in Videos.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Projective Reconstruction of Surfaces of Revolution.
Proceedings of the Pattern Recognition, 2003

Learning epipolar geometry from image sequences.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Texture Classification: Are Filter Banks Necessary?
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Linear Auto-Calibration for Ground Plane Motion.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Joint Manifold Distance: a new approach to appearance based clustering.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Object Class Recognition by Unsupervised Scale-Invariant Learning.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Shape recognition with edge-based features.
Proceedings of the British Machine Vision Conference, 2003

2002
Estimation of the partial volume effect in MRI.
Medical Image Anal., 2002

Image-based Environment Matting.
Proceedings of the 13th Eurographics Workshop on Rendering Techniques, 2002

Statistical Approaches to Material Classification.
Proceedings of the ICVGIP 2002, 2002

Automated reconstruction from multiple photographs.
Proceedings of the 2002 International Conference on Image Processing, 2002

Bayesian Estimation of Layers from Multiple Images.
Proceedings of the Computer Vision, 2002

New Techniques for Automated Architectural Reconstruction from Photographs.
Proceedings of the Computer Vision, 2002

Classifying Images of Materials: Achieving Viewpoint and Illumination Independence.
Proceedings of the Computer Vision, 2002

Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?".
Proceedings of the Computer Vision, 2002

Single Axis Geometry by Fitting Conics.
Proceedings of the Computer Vision, 2002

On Affine Invariant Clustering and Automatic Cast Listing in Movies.
Proceedings of the Computer Vision, 2002

Automated Scene Matching in Movies.
Proceedings of the Image and Video Retrieval, International Conference, 2002

Model selection for automated reconstruction from multiple views.
Proceedings of the British Machine Vision Conference 2002, 2002

2001
Viewpoint Invariant Texture Matching and Wide Baseline Stereo.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Super-Resolution from Multiple Views Using Learnt Image Models.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

2000
Segmentation and measurement of brain structures in MRI including confidence bounds.
Medical Image Anal., 2000

Planar grouping for automatic detection of vanishing lines and points.
Image Vis. Comput., 2000

The Geometry and Matching of Lines and Curves Over Multiple Views.
Int. J. Comput. Vis., 2000

Single View Metrology.
Int. J. Comput. Vis., 2000

MLESAC: A New Robust Estimator with Application to Estimating Image Geometry.
Comput. Vis. Image Underst., 2000

Markerless tracking using planar structures in the scene.
Proceedings of the IEEE and ACM International Symposium on Augmented Reality, 2000

Super-Resolution Enhancement of Text Image Sequences.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

A Six Point Solution for Structure and Motion.
Proceedings of the Computer Vision - ECCV 2000, 6th European Conference on Computer Vision, Dublin, Ireland, June 26, 2000

Multibody Structure and Motion: 3-D Reconstruction of Independently Moving Objects.
Proceedings of the Computer Vision - ECCV 2000, 6th European Conference on Computer Vision, Dublin, Ireland, June 26, 2000

Stereo Autocalibration from One Plane.
Proceedings of the Computer Vision - ECCV 2000, 6th European Conference on Computer Vision, Dublin, Ireland, June 26, 2000

Shape from Texture: Homogeneity Revisited.
Proceedings of the British Machine Vision Conference 2000, 2000

From Images to Virtual and Augmented Reality.
Proceedings of the Confluence of Computer Vision and Computer Graphics, 2000

Surface Reconstruction from Multiple Views Using Apparent Contours and Surface Texture.
Proceedings of the Confluence of Computer Vision and Computer Graphics, 2000

1999
A plane measuring device.
Image Vis. Comput., 1999

The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences.
Int. J. Comput. Vis., 1999

Creating Architectural Models from Images.
Comput. Graph. Forum, 1999

Integrating Geometric and Photometric Information for Image Retrieval.
Proceedings of the Shape, Contour and Grouping in Computer Vision, 1999

Geometric Grouping of Repeated Elements within Images.
Proceedings of the Shape, Contour and Grouping in Computer Vision, 1999

VHS to VRML: 3D Graphical Models from Video Sequences.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Feature Based Methods for Structure and Motion Estimation.
Proceedings of the Vision Algorithms: Theory and Practice, 1999

Combining Scene and Auto-Calibration Constraints.
Proceedings of the International Conference on Computer Vision, 1999

Parallax Geometry of Smooth Surfaces in Multiple Views.
Proceedings of the International Conference on Computer Vision, 1999

Automatic Reconstruction of Piecewise Planar Models from Multiple Views.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

Improving Augmented Reality using Image and Scene Constraints.
Proceedings of the British Machine Vision Conference 1999, 1999

1998
Planar homologies as a basis for grouping and recognition.
Image Vis. Comput., 1998

Book Review : Epipolar Geometry in Stereo, Motion and Object Recognition - A Unified Approach By Gang Xu and Zhengyou Zhang Published by Kluwer Academic Publishers Group; 1996; 313 pages; US$ 160.
Int. J. Robotics Res., 1998

Robust Detection of Degenerate Configurations while Estimating the Fundamental Matrix.
Comput. Vis. Image Underst., 1998

Matching and Reconstruction from Widely Separated Views.
Proceedings of the 3D Structure from Multiple Images of Large-Scale Environments, 1998

Automatic 3D Model Construction for Turn-Table Sequences.
Proceedings of the 3D Structure from Multiple Images of Large-Scale Environments, 1998

Measurement of Brain Structures Based on Statistical and Geometrical 3D Segmentation.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 1998

Robust Computation and Parametrization of Multiple View Relations.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Maintaining Multiple Motion Model Hypotheses Through Many Views to Recover Matching and Structure.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Wide Baseline Stereo Matching.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Quadric Surface Reconstruction from Dual-Space Geometry.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Automatic 3D model acquisition and generation of new images from video sequences.
Proceedings of the 9th European Signal Processing Conference, 1998

Concerning Bayesian Motion Segmentation, Model, Averaging, Matching and the Trifocal Tensor.
Proceedings of the Computer Vision, 1998

The Geometry and Matching of Curves in Multiple Views.
Proceedings of the Computer Vision, 1998

Automatic Camera Recovery for Closed or Open Image Sequences.
Proceedings of the Computer Vision, 1998

Duality, Rigidity and Planar Parallax.
Proceedings of the Computer Vision, 1998

Metric Rectification for Perspective Images of Planes.
Proceedings of the 1998 Conference on Computer Vision and Pattern Recognition (CVPR '98), 1998

Automatic Mosaicing with Super-Resolution Zoom.
Proceedings of the 1998 Conference on Computer Vision and Pattern Recognition (CVPR '98), 1998

Real-time Panoramic Mosaics and Augmented Reality.
Proceedings of the British Machine Vision Conference 1998, 1998

1997
Performance characterization of fundamental matrix estimation under image degradation.
Mach. Vis. Appl., 1997

Robust parameterization and computation of the trifocal tensor.
Image Vis. Comput., 1997

Sequential Updating of Projective and Affine Structure from Motion.
Int. J. Comput. Vis., 1997

Automatic 3D model building from video sequences.
Eur. Trans. Telecommun., 1997

Finding Point Correspondences in Motion Sequences Preserving Affine Structure.
Comput. Vis. Image Underst., 1997

Automatic line matching across views.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

1996
Detection and tracking of independent motion.
Image Vis. Comput., 1996

Semi-local projective invariants for the recognition of smooth plane curves.
Int. J. Comput. Vis., 1996

Visualising Cerebral Asymmetry.
Proceedings of the Visualization in Biomedical Computing, 4th International Conference, 1996

Goal-directed Video Metrology.
Proceedings of the Computer Vision, 1996

Report on the 1996 International Workshop on Object Representation in Computer Vision.
Proceedings of the Object Representation in Computer Vision II, 1996

An Experimental Comparison of Appearance and Geometric Model Based Recognition.
Proceedings of the Object Representation in Computer Vision II, 1996

3D Model Acquisition from Extended Image Sequences.
Proceedings of the Computer Vision, 1996

Motion Deblurring and Super-resolution from an Image Sequence.
Proceedings of the Computer Vision, 1996

Self-Calibration from Image Triplets.
Proceedings of the Computer Vision, 1996

Detecting and Tracking Linear Features Efficiently.
Proceedings of the British Machine Vision Conference 1996, 1996

1995
Viewpoint-invariant representation of generalized cylinders using the symmetry set.
Image Vis. Comput., 1995

3D Motion recovery via affine Epipolar geometry.
Int. J. Comput. Vis., 1995

Planar object recognition using projective shape representation.
Int. J. Comput. Vis., 1995

3D Object Recognition Using Invariance.
Artif. Intell., 1995

Class-Based Grouping in Perspective Images.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

Robust Detection of Degenerate Configurations for the Fundamental Matrix.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

Active Visual Navigation Using Non-Metric Structure.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

Uncalibrated X-Ray Stereo Reconstruction.
Proceedings of the British Machine Vision Conference, 1995

MORSE: An Architecture for 3D Object Recognition Based on Invariants.
Proceedings of the Recent Developments in Computer Vision, 1995

1994
Extracting structure from an affine view of a 3D point set with one or two bilateral symmetries.
Image Vis. Comput., 1994

Distinctive Representations for the Recognition of Curved Surfaces Using Outlines and Markings.
Proceedings of the Object Representation in Computer Vision, 1994

Extraction of events from 3D volumes of seismic data.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

Identification of Events from 3D Volumes of Seismic Data.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

Motion From Point Matches Using Affine Epipolar Geometry.
Proceedings of the Computer Vision, 1994

Navigation using Affine Structure from Motion.
Proceedings of the Computer Vision, 1994

Using global consistency to recognise Euclidean objects with an uncalibrated camera.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1994

Euclidean Structure from Uncalibrated Images.
Proceedings of the British Machine Vision Conference, 1994

1993
A framework for spatiotemporal control in the tracking of visual contours.
Int. J. Comput. Vis., 1993

Eliciting qualitative structure from image curve deformations.
Proceedings of the Fourth International Conference on Computer Vision, 1993

Extracting projective structure from single perspective views of 3D point sets.
Proceedings of the Fourth International Conference on Computer Vision, 1993

Affine-invariant contour tracking with automatic control of spatiotemporal scale.
Proceedings of the Fourth International Conference on Computer Vision, 1993

Efficient recognition of rotationally symmetric surfaces and straight homogeneous generalized cylinders.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1993

Seismic Time Section Analysis Using Machine Vision.
Proceedings of the British Machine Vision Conference, 1993

A Case Against Epipolar Geometry.
Proceedings of the Applications of Invariance in Computer Vision, Second Joint European, 1993

Introduction and Chapter Summary.
Proceedings of the Applications of Invariance in Computer Vision, Second Joint European, 1993

Repeated Structures: Image Correspondence Constraints and 3D Structure Recovery.
Proceedings of the Applications of Invariance in Computer Vision, Second Joint European, 1993

1992
Relative motion and pose from arbitrary plane curves.
Image Vis. Comput., 1992

Transformational invariance - a primer.
Image Vis. Comput., 1992

Canonical Frames for Planar Object Recognition.
Proceedings of the Computer Vision, 1992

Recognising rotationally symmetric surfaces from their outlines.
Proceedings of the Computer Vision, 1992

Real-time Visual Tracking for Surveillance and Path Planning.
Proceedings of the Computer Vision, 1992

Camera Calibration Using Multiple Images.
Proceedings of the Computer Vision, 1992

Efficient model library access by projectively invariant indexing functions.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1992

Affine and Projective Structure from Motion.
Proceedings of the British Machine Vision Conference, 1992

1991
Reflections on Shading.
IEEE Trans. Pattern Anal. Mach. Intell., 1991

Invariant Descriptors for 3D Object Recognition and Pose.
IEEE Trans. Pattern Anal. Mach. Intell., 1991

Projectively invariant representations using implicit algebraic curves.
Image Vis. Comput., 1991

Cooperating Motion Processes.
Proceedings of the British Machine Vision Conference, 1991

Using Projective Invariants for Constant Time Library Indexing in Model Based Vision.
Proceedings of the British Machine Vision Conference, 1991

1990
Shape from shading in the light of mutual illumination.
Image Vis. Comput., 1990

Invariance-a new framework for vision.
Proceedings of the Third International Conference on Computer Vision, 1990

Relative motion and pose from invariants.
Proceedings of the British Machine Vision Conference, 1990

Towards qualitative vision: motion parallax.
Proceedings of the British Machine Vision Conference, 1990

1989
The information available to a moving observer from specularities.
Image Vis. Comput., 1989

Using a mixed wave/ diffusion process to elicit the symmetry set.
Image Vis. Comput., 1989

Mutual illumination.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1989

1987
Localising discontinuities using weak continuity constraints.
Pattern Recognit. Lett., 1987

Knowledge source for describing stereoscopically viewed textured surfaces.
Image Vis. Comput., 1987

Visual Reconstruction
MIT Press, ISBN: 0-262-02271-0, 1987

1986
Weak Continuity Constraints Generate Uniform Scale-Space Descriptions of Plane Curves.
Proceedings of the Advances in Artificial Intelligence II, 1986

1985
Surface descriptions from stereo and shading.
Image Vis. Comput., 1985


  Loading...