Clinton Fookes

Orcid: 0000-0002-8515-6324

According to our database1, Clinton Fookes authored at least 363 papers between 2000 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FICE: Text-conditioned fashion-image editing with guided GAN inversion.
Pattern Recognit., 2025

2024
Synthetic Data for Deep Learning in Computer Vision & Medical Imaging: A Means to Reduce Data Bias.
ACM Comput. Surv., November, 2024

GeoAdapt: Self-Supervised Test-Time Adaptation in LiDAR Place Recognition Using Geometric Priors.
IEEE Robotics Autom. Lett., January, 2024

AG-ReID.v2: Bridging Aerial and Ground Views for Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

FactoFormer: Factorized Hyperspectral Transformers With Self-Supervised Pretraining.
IEEE Trans. Geosci. Remote. Sens., 2024

Multimodal Colearning Meets Remote Sensing: Taxonomy, State of the Art, and Future Works.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

Deep cross-domain transfer for emotion recognition via joint learning.
Multim. Tools Appl., 2024

Revisiting the Role of Texture in 3D Person Re-identification.
CoRR, 2024

Physics Augmented Tuple Transformer for Autism Severity Level Detection.
CoRR, 2024

Online 6DoF Pose Estimation in Forests using Cross-View Factor Graph Optimisation and Deep Learned Re-localisation.
CoRR, 2024

PseudoNeg-MAE: Self-Supervised Point Cloud Learning using Conditional Pseudo-Negative Embeddings.
CoRR, 2024

AI and Entrepreneurship: Facial Recognition Technology Detects Entrepreneurs, Outperforming Human Experts.
CoRR, 2024

PINNs for Medical Image Analysis: A Survey.
CoRR, 2024

SALVE: A 3D Reconstruction Benchmark of Wounds from Consumer-grade Videos.
CoRR, 2024

Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach.
CoRR, 2024

Part-based Quantitative Analysis for Heatmaps.
CoRR, 2024

Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields.
CoRR, 2024

Zoom-shot: Fast and Efficient Unsupervised Zero-Shot Transfer of CLIP to Vision Encoders with Multimodal Loss.
CoRR, 2024

SafeSea: Synthetic Data Generation for Adverse & Low Probability Maritime Conditions.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024

Localisation of Racial Information in Chest X-Ray for Deep Learning Diagnosis.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Uncertainty Driven Bottleneck Attention U-Net For Organ at Risk Segmentation.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024


Multi-Stage Learning for Radar Pulse Activity Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024

NeRF Director: Revisiting View Selection in Neural Volume Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Multi-stage stacked temporal convolution neural networks (MS-S-TCNs) for biosignal segmentation and anomaly localization.
Pattern Recognit., July, 2023

Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization.
IEEE Robotics Autom. Lett., May, 2023

Pose-driven attention-guided image generation for person re-Identification.
Pattern Recognit., May, 2023

Meta-transfer learning for emotion recognition.
Neural Comput. Appl., May, 2023

Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion.
Dataset, April, 2023

Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion.
Dataset, April, 2023

Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion.
Dataset, April, 2023

Generalized Generative Deep Learning Models for Biosignal Synthesis and Modality Transfer.
IEEE J. Biomed. Health Informatics, February, 2023

Toward On-Board Panoptic Segmentation of Multispectral Satellite Images.
IEEE Trans. Geosci. Remote. Sens., 2023

Complex-Valued Iris Recognition Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Continuous Human Action Recognition for Human-machine Interaction: A Review.
ACM Comput. Surv., 2023

WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments.
CoRR, 2023

Deep Learning Approaches for Seizure Video Analysis: A Review.
CoRR, 2023

FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pre-Training.
CoRR, 2023

A Survey on Physics Informed Reinforcement Learning: Review and Open Problems.
CoRR, 2023

Learning Through Guidance: Knowledge Distillation for Endoscopic Image Classification.
CoRR, 2023

GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors.
CoRR, 2023

General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation.
CoRR, 2023

Physics-Informed Computer Vision: A Review and Perspectives.
CoRR, 2023

Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction.
CoRR, 2023

Physical Adversarial Attacks for Surveillance: A Survey.
CoRR, 2023

Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models.
CoRR, 2023

Uncertainty Driven Bottleneck Attention U-net for OAR Segmentation.
CoRR, 2023

Boosting Zero-shot Classification with Synthetic Data Diversity via Stable Diffusion.
CoRR, 2023

DBCE : A Saliency Method for Medical Deep Learning Through Anatomically-Consistent Free-Form Deformations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Piecewise Deterministic Markov Processes for Bayesian Neural Networks.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Enhancing Emitter Localization Accuracy Through Integration of Received Signal Strength in Direct Position Determination.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2023

When to Use Augmentation - Variability Insufficient for Cortical Thickness Estimation Improvement.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Generalization Properties of Geometric 3D Deep Learning Models for Medical Segmentation.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Dual Memory Fusion for Multimodal Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Aerial-Ground Person Re-ID.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023


Bias Identification with RankPix Saliency.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Task Learning For Radar Signal Characterisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Uncertainty in Real-Time Semantic Segmentation on Embedded Systems.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Elasticity Meets Continuous-Time: Map-Centric Dense 3D LiDAR SLAM.
IEEE Trans. Robotics, 2022

Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition.
IEEE Trans. Multim., 2022

Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings.
IEEE J. Biomed. Health Informatics, 2022

Geometric Deep Learning for Subject Independent Epileptic Seizure Prediction Using Scalp EEG Signals.
IEEE J. Biomed. Health Informatics, 2022

Component-Based Attention for Large-Scale Trademark Retrieval.
IEEE Trans. Inf. Forensics Secur., 2022

3-D Bi-directional LSTM for Satellite Soil Moisture Downscaling.
IEEE Trans. Geosci. Remote. Sens., 2022

Channel Graph Regularized Correlation Filters for Visual Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

An efficient framework for zero-shot sketch-based image retrieval.
Pattern Recognit., 2022

Split 'n' merge net: A dynamic masking network for multi-task attention.
Pattern Recognit., 2022

Quantifiable brain atrophy synthesis for benchmarking of cortical thickness estimation methods.
Medical Image Anal., 2022

Affect recognition from scalp-EEG using channel-wise encoder networks coupled with geometric deep learning and multi-channel feature fusion.
Knowl. Based Syst., 2022

Learning test-time augmentation for content-based image retrieval.
Comput. Vis. Image Underst., 2022

Deep Learning for Medical Anomaly Detection - A Survey.
ACM Comput. Surv., 2022

Using Auxiliary Information for Person Re-Identification - A Tutorial Overview.
CoRR, 2022

CorticalFlow: A Diffeomorphic Mesh Deformation Module for Cortical Surface Reconstruction.
CoRR, 2022

Towards On-Board Panoptic Segmentation of Multispectral Satellite Images.
CoRR, 2022

Learning Dense Correspondence from Synthetic Environments.
CoRR, 2022

The State of Aerial Surveillance: A Survey.
CoRR, 2022

A survey on graph-based deep learning for computational histopathology.
Comput. Medical Imaging Graph., 2022

When AI meets store layout design: a review.
Artif. Intell. Rev., 2022

Jointly Trained Conversion Model With LPCNet for Any-to-One Voice Conversion Using Speaker-Independent Linguistic Features.
IEEE Access, 2022

Lesser of Two Evils Improves Learning in the Context of Cortical Thickness Estimation Models - Choose Wisely.
Proceedings of the Data Augmentation, Labelling, and Imperfections, 2022

CorticalFlow<sup>++</sup>: Boosting Cortical Surface Reconstruction Accuracy, Regularity, and Interoperability.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

InCloud: Incremental Learning for Point Cloud Place Recognition.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

SESS: Saliency Enhancing with Scaling and Sliding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Does Interference Exist When Training a Once-For-All Network?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
A Robust Interpretable Deep Learning Classifier for Heart Anomaly Detection Without Segmentation.
IEEE J. Biomed. Health Informatics, 2021

Identification of Children at Risk of Schizophrenia via Deep Learning and EEG Responses.
IEEE J. Biomed. Health Informatics, 2021

TMMF: Temporal Multi-Modal Fusion for Single-Stage Continuous Gesture Recognition.
IEEE Trans. Image Process., 2021

End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2021

Detection of Fake and Fraudulent Faces via Neural Memory Networks.
IEEE Trans. Inf. Forensics Secur., 2021

Domain Generalization in Biosignal Classification.
IEEE Trans. Biomed. Eng., 2021

Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion.
IEEE Signal Process. Mag., 2021

Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future.
Sensors, 2021

Memory based fusion for multi-modal deep learning.
Inf. Fusion, 2021

Multi-modal semantic image segmentation.
Comput. Vis. Image Underst., 2021

Point Cloud Segmentation Using Sparse Temporal Local Attention.
CoRR, 2021

Discriminative Domain-Invariant Adversarial Network for Deep Domain Generalization.
CoRR, 2021

Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis.
CoRR, 2021

Preserving Semantic Consistency in Unsupervised Domain Adaptation Using Generative Adversarial Networks.
CoRR, 2021

Deep Domain Generalization with Feature-norm Network.
CoRR, 2021

Im2Mesh GAN: Accurate 3D Hand Mesh Recovery from a Single RGB Image.
CoRR, 2021

IGSSTRCF: Importance Guided Sparse Spatio-Temporal Regularized Correlation Filters For Tracking.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

DeepCSR: A 3D Deep Learning Approach for Cortical Surface Reconstruction.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

CorticalFlow: A Diffeomorphic Mesh Transformer Network for Cortical Surface Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Detail Matters: High-Frequency Content for Realistic Synthetic MRI Generation.
Proceedings of the Simulation and Synthesis in Medical Imaging, 2021

A Multiple Decoder Cnn For Inverse Consistent 3d Image Registration.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Smocam: Smooth Conditional Attention Mask For 3d-Regression Models.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Going Deeper With Brain Morphometry Using Neural Networks.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning Regional Attention Over Multi-Resolution Deep Convolutional Features For Trademark Retrieval.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Reduction of Feature Contamination for Hyper Spectral Image Classification.
Proceedings of the 2021 Digital Image Computing: Techniques and Applications, 2021

A Comparison of Saliency Methods for Deep Learning Explainability.
Proceedings of the 2021 Digital Image Computing: Techniques and Applications, 2021

MongeNet: Efficient Sampler for Geometric Deep Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Memory Augmented Deep Generative Models for Forecasting the Next Shot Location in Tennis.
IEEE Trans. Knowl. Data Eng., 2020

Heart Sound Segmentation Using Bidirectional LSTMs With Attention.
IEEE J. Biomed. Health Informatics, 2020

Constrained Design of Deep Iris Networks.
IEEE Trans. Image Process., 2020

Target-Specific Siamese Attention Network for Real-Time Object Tracking.
IEEE Trans. Inf. Forensics Secur., 2020

Temporarily-Aware Context Modeling Using Generative Adversarial Networks for Speech Activity Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Spatiotemporal Camera-LiDAR Calibration: A Targetless and Structureless Approach.
IEEE Robotics Autom. Lett., 2020

Hierarchical Attention Network for Action Segmentation.
Pattern Recognit. Lett., 2020

Correlation-aware adversarial domain adaptation and generalization.
Pattern Recognit., 2020

Context from within: Hierarchical context modeling for semantic segmentation.
Pattern Recognit., 2020

Fine-grained action segmentation using the semi-supervised action GAN.
Pattern Recognit., 2020

Neural memory plasticity for medical anomaly detection.
Neural Networks, 2020

MTRNet++: One-stage mask-based scene text eraser.
Comput. Vis. Image Underst., 2020

Joint identification-verification for person re-identification: A four stream deep learning approach with improved quartet loss function.
Comput. Vis. Image Underst., 2020

LSTM guided ensemble correlation filter tracking with appearance model pool.
Comput. Vis. Image Underst., 2020

Patient-independent Epileptic Seizure Prediction using Deep Learning Models.
CoRR, 2020

Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers.
CoRR, 2020

Multi-modal Fusion for Single-Stage Continuous Gesture Recognition.
CoRR, 2020

Memory Based Attentive Fusion.
CoRR, 2020

Meta Transfer Learning for Emotion Recognition.
CoRR, 2020

Bayesian Neural Networks: An Introduction and Survey.
CoRR, 2020

Understanding the Importance of Heart Sound Segmentation for Heart Anomaly Detection.
CoRR, 2020

Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection.
CoRR, 2020

Joint Deep Cross-Domain Transfer Learning for Emotion Recognition.
CoRR, 2020

Enhancing Feature Invariance with Learned Image Transformations for Image Retrieval.
CoRR, 2020

Semantic Consistency and Identity Mapping Multi-Component Generative Adversarial Network for Person Re-Identification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

3D Brain MRI GAN-Based Synthesis Conditioned on Partial Volume Maps.
Proceedings of the Simulation and Synthesis in Medical Imaging, 2020

Two-Stream Deep Feature Modelling for Automated Video Endoscopy Data Analysis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Attention Driven Fusion for Multi-Modal Emotion Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Neural Memory Networks for Seizure Type Classification.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

Attention Networks for Multi-Task Signal Analysis.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

Fruit Detection in the Wild: The Impact of Varying Conditions and Cultivar.
Proceedings of the Digital Image Computing: Techniques and Applications, 2020

Geometry-Constrained Car Recognition Using a 3D Perspective Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

On Minimum Discrepancy Estimation for Deep Domain Adaptation.
Proceedings of the Domain Adaptation for Visual Understanding, 2020

2019
Voice Presentation Attack Detection Using Convolutional Neural Networks.
Proceedings of the Handbook of Biometric Anti-Spoofing, 2019

Understanding Patients' Behavior: Vision-Based Analysis of Seizure Disorders.
IEEE J. Biomed. Health Informatics, 2019

Scene Invariant Virtual Gates Using DNNs.
IEEE Trans. Circuits Syst. Video Technol., 2019

Robust Photogeometric Localization Over Time for Map-Centric Loop Closure.
IEEE Robotics Autom. Lett., 2019

Sparse over-complete patch matching.
Pattern Recognit. Lett., 2019

Multimodal clothing recognition for semantic search in unconstrained surveillance imagery.
J. Vis. Commun. Image Represent., 2019

Deep domain adaptation for anti-spoofing in speaker verification systems.
Comput. Speech Lang., 2019

Neural Memory Networks for Robust Classification of Seizure Type.
CoRR, 2019

Exploiting Human Social Cognition for the Detection of Fake and Fraudulent Faces via Memory Networks.
CoRR, 2019

Neural Memory Plasticity for Anomaly Detection.
CoRR, 2019

Dense Deformation Network for High Resolution Tissue Cleared Image Registration.
CoRR, 2019

On Minimum Discrepancy Estimation for Deep Domain Adaptation.
CoRR, 2019

Multi-Component Image Translation for Deep Domain Generalization.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Semantic Correspondence in the Wild.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Coupled Generative Adversarial Network for Continuous Fine-Grained Action Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Semantic Segmentation Of Hands In Multimodal Images: A Region New-Based CNN Approach.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Towards Extreme-Resolution Image Registration with Deep Learning.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

A Study of x-Vector Based Speaker Recognition on Short Utterances.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

MTRNet: A Generic Scene Text Eraser.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Neighbourhood Context Embeddings in Deep Inverse Reinforcement Learning for Predicting Pedestrian Motion Over Long Time Horizons.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Predicting the Future: A Jointly Learnt Model for Action Anticipation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Investigating Domain Sensitivity of DNN Embeddings for Speaker Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Vision-based System for Breathing Disorder Identification: A Deep Learning Perspective.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Motion Signatures for the Analysis of Seizure Evolution in Epilepsy.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Vision-Based Mouth Motion Analysis in Epilepsy: A 3D Perspective.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Unified 2D and 3D Hand Pose Estimation from a Single Visible or X-ray Image.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Forecasting Future Action Sequences with Neural Memory Networks.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Learning Salient Features for Multimodal Emotion Recognition with Recurrent Neural Networks and Attention Based Fusion.
Proceedings of the 15th International Conference on Auditory-Visual Speech Processing, 2019

2018
Fruit Quantity and Ripeness Estimation Using a Robotic Vision System.
IEEE Robotics Autom. Lett., 2018

Super-resolution for biometrics: A comprehensive survey.
Pattern Recognit., 2018

<i>Soft</i> + <i>Hardwired</i> attention: An LSTM framework for human trajectory prediction and abnormal event detection.
Neural Networks, 2018

Tree Memory Networks for modelling long-term temporal dependencies.
Neurocomputing, 2018

Human-level face verification with intra-personal factor analysis and deep face representation.
IET Biom., 2018

Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition.
Comput. Vis. Image Underst., 2018

A Comparative Analysis of Registration Tools: Traditional vs Deep Learning Approach on High Resolution Tissue Cleared Data.
CoRR, 2018

Performance of Image Registration Tools on High-Resolution 3D Brain Images.
CoRR, 2018

Semantic Correspondence: A Hierarchical Approach.
CoRR, 2018

Fruit Quantity and Quality Estimation using a Robotic Vision System.
CoRR, 2018

Iris Recognition With Off-the-Shelf CNN Features: A Deep Learning Perspective.
IEEE Access, 2018

A Deep Four-Stream Siamese Convolutional Neural Network with Joint Verification and Identification Loss for Person Re-Detection.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Task Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Tracking by Prediction: A Deep Generative Model for Mutli-person Localisation and Tracking.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Pedestrian Trajectory Prediction with Structured Memory Hierarchies.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018

Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Elastic LiDAR Fusion: Dense Map-Centric Continuous-Time SLAM.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Meta Transfer Learning for Facial Emotion Recognition.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Non-rigid Reconstruction with a Single Moving RGB-D Camera.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Deep Match Tracker: Classifying when Dissimilar, Similarity Matching when Not.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Hierarchical Relational Attention for Video Question Answering.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Calibrating Cameras in Poor-Conditioned Pitch-Based Sports Games.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Performance of Registration Tools on High-Resolution 3D Brain Images.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Deep Motion Analysis for Epileptic Seizure Classification.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Deep Classification of Epileptic Signals.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Skeleton Driven Non-Rigid Motion Tracking and 3D Reconstruction.
Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

Deep Decision Trees for Discriminative Dictionary Learning With Adversarial Multi-Agent Trajectories.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Semantic Person Retrieval in Surveillance Using Soft Biometrics: AVSS 2018 Challenge II.
Proceedings of the 15th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2018

Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Rethinking Planar Homography Estimation Using Perspective Fields.
Proceedings of the Computer Vision - ACCV 2018, 2018

Image2Mesh: A Learning Framework for Single Image 3D Reconstruction.
Proceedings of the Computer Vision - ACCV 2018, 2018

Learning Free-Form Deformations for 3D Object Reconstruction.
Proceedings of the Computer Vision - ACCV 2018, 2018

Multi-level Sequence GAN for Group Activity Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Locating People in Surveillance Video Using Soft Biometric Traits.
Proceedings of the Handbook of Biometrics for Forensic Science, 2017

Long range iris recognition: A survey.
Pattern Recognit., 2017

A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems.
Int. J. Speech Technol., 2017

Fine-grained action recognition of boxing punches from depth imagery.
Comput. Vis. Image Underst., 2017

Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection.
CoRR, 2017

Joint Max Margin and Semantic Features for Continuous Event Detection in Complex Scenes.
CoRR, 2017

Deep Spatio-Temporal Features for Multimodal Emotion Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Deep Context Modeling for Semantic Segmentation.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

From Affine Rank Minimization Solution to Sparse Modeling.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Gate connected convolutional neural network for object tracking.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Facial analysis in the wild with LSTM networks.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Single image depth prediction using super-column super-pixel features.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deep discovery of facial motions using a shallow embedding layer.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A cascaded long short-term memory (LSTM) driven generic visual question answering (VQA).
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Probabilistic Surfel Fusion for Dense LiDAR Mapping.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Going Deeper: Autonomous Steering with Neural Memory Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Using Synthetic Data to Improve Facial Expression Analysis with 3D Convolutional Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Deep features-based expression-invariant tied factor analysis for emotion recognition.
Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017

Two-stage facial age prediction using group-specific features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Compact Model Representation for 3D Reconstruction.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
Discovering Team Structures in Soccer from Spatiotemporal Data.
IEEE Trans. Knowl. Data Eng., 2016

A flexible hierarchical approach for facial age estimation based on multiple features.
Pattern Recognit., 2016

Detecting rare events using Kullback-Leibler divergence: A weakly supervised approach.
Expert Syst. Appl., 2016

Recent Advances in Camera Planning for Large Area Surveillance: A Comprehensive Review.
ACM Comput. Surv., 2016

Automatic Event Detection for Signal-based Surveillance.
CoRR, 2016

Improving Short Utterance PLDA Speaker Verification using SUV Modelling and Utterance Partitioning Approach.
CoRR, 2016

DNN based Speaker Recognition on Short Utterances.
CoRR, 2016

Domain adaptation based Speaker Recognition on Short Utterances.
CoRR, 2016

Discovery of facial motions using deep machine perception.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speakers In The Wild (SITW): The QUT Speaker Recognition System.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A robust UAV landing site detection system using mid-level discriminative patches.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Deeper and wider fully convolutional network coupled with conditional random fields for scene labeling.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Vertical Axis Detection for Sport Video Analytics.
Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications, 2016

Complex Event Detection Using Joint Max Margin and Semantic Features.
Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications, 2016

2015
Score-Level Multibiometric Fusion Based on Dempster-Shafer Theory Incorporating Uncertainty Factors.
IEEE Trans. Hum. Mach. Syst., 2015

An Efficient and Robust System for Multiperson Event Detection in Real-World Indoor Surveillance Scenes.
IEEE Trans. Circuits Syst. Video Technol., 2015

Searching for people using semantic soft biometric descriptions.
Pattern Recognit. Lett., 2015

Automatic surveillance in transportation hubs: No longer just about catching the bad guy.
Expert Syst. Appl., 2015

A framework for model integration and holistic modelling of socio-technical systems.
Decis. Support Syst., 2015

An evaluation of crowd counting methods, features and regression models.
Comput. Vis. Image Underst., 2015

Acoustic Adaptation in Cross Database Audio Visual SHMM Training for Phonetic Spoken Term Detection.
Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia, 2015

Cross database training of audio-visual hidden Markov models for phone recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Complete-linkage clustering for voice activity detection in audio and visual speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Class-specific sparse codes for representing activities.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Improving deep convolutional neural networks with unsupervised feature learning.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Combat sports analytics: Boxing punch classification using overhead depthimagery.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Detecting rare events using Kullback-Leibler divergence.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Searching for semantic person queries using channel representations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Closed-Form Solutions for Low-Rank Non-Rigid Reconstruction.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Robust Automatic Face Clustering in News Video.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Learning Temporal Alignment Uncertainty for Efficient Event Detection.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Large scale monitoring of crowds and building utilisation: A new database and distributed approach.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
Optimal Camera Planning Under Versatile User Constraints in Multi-Camera Image Processing Systems.
IEEE Trans. Image Process., 2014

Real-time video event detection in crowded scenes using MPEG derived features: A multiple instance learning approach.
Pattern Recognit. Lett., 2014

Scene invariant multi camera crowd counting.
Pattern Recognit. Lett., 2014

Local inter-session variability modelling for object classification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Activity recognition using binary tree SVM.
Proceedings of the IEEE Workshop on Statistical Signal Processing, 2014

Social signal processing for pain monitoring using a hidden conditional random field.
Proceedings of the IEEE Workshop on Statistical Signal Processing, 2014

Agent-based modelling of aircraft boarding methods.
Proceedings of the 4th International Conference On Simulation And Modeling Methodologies, 2014

Analysis of passenger group behaviour and its impact on passenger flow using an agent-based model.
Proceedings of the 4th International Conference On Simulation And Modeling Methodologies, 2014

SAIVT-ADMRG @ MediaEval 2014 Social Event Detection.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Multiple Instance Dictionary Learning for Activity Representation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Locating People in Video from Semantic Descriptions: A New Database and Approach.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Supervised Latent Dirichlet Allocation Models for Efficient Activity Representation.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

Automatic UAV Forced Landing Site Detection Using Machine Learning.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

An MRF based abnormal event detection approach using motion and appearance features.
Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

2013
Evaluation of two-view geometry methods with automatic ground-truth generation.
Image Vis. Comput., 2013

Feature-domain super-resolution for iris recognition.
Comput. Vis. Image Underst., 2013

Liveness detection based on 3D face shape analysis.
Proceedings of the 1st International Workshop on Biometrics and Forensics, 2013

Deformable face ensemble alignment with robust grouped-L1 anchors.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Semi-Binary Based Video Features for Activity Representation.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

An Evaluation of Different Features and Learning Models for Anomalous Event Detection.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Quality Based Frame Selection for Face Clustering in News Video.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Histogram of Weighted Local Directions for Gait Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Scene Invariant Crowd Counting and Crowd Occupancy Analysis.
Proceedings of the Video Analytics for Business Intelligence, 2012

Identifying Customer Behaviour and Dwell Time Using Soft Biometrics.
Proceedings of the Video Analytics for Business Intelligence, 2012

A Mask-Based Approach for the Geometric Calibration of Thermal-Infrared Cameras.
IEEE Trans. Instrum. Meas., 2012

Evaluation of image resolution and super-resolution on face recognition performance.
J. Vis. Commun. Image Represent., 2012

Hessian-Based Affine Adaptation of Salient Local Image Features.
J. Math. Imaging Vis., 2012

SAIVT-QUT@TRECVid 2012: Interactive Surveillance Event Detection.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Modelling Passengers Flow at Airport Terminals - Individual Agent Decision Model for Stochastic Passenger Behaviour.
Proceedings of the SIMULTECH 2012 - Proceedings of the 2nd International Conference on Simulation and Modeling Methodologies, Technologies and Applications, Rome, Italy, 28, 2012

Efficient real-time face detection for high resolution surveillance applications.
Proceedings of the 6th International Conference on Signal Processing and Communication Systems, 2012

Quality based frame selection for video face recognition.
Proceedings of the 6th International Conference on Signal Processing and Communication Systems, 2012

On the Statistical Determination of Optimal Camera Configurations in Large Scale Surveillance Networks.
Proceedings of the Computer Vision - ECCV 2012, 2012

Spatio Temporal Feature Evaluation for Action Recognition.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

The Backfilled GEI - A Cross-Capture Modality Gait Feature for Frontal and Side-View Gait Recognition.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Anomalous Event Detection Using a Semi-Two Dimensional Hidden Markov Model.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Can You Describe Him for Me? A Technique for Semantic Person Search in Video.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

A Database for Person Re-Identification in Multi-Camera Surveillance Networks.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Feature-domain super-resolution framework for Gabor-based face and iris recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Activity Analysis in Complicated Scenes Using DFT Coefficients of Particle Trajectories.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Unusual Scene Detection Using Distributed Behaviour Model and Sparse Representation.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Use of brain computer interface to drive: preliminary results.
Proceedings of the International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2012

2011
Quality-Driven Super-Resolution for Less Constrained Iris Recognition at a Distance and on the Move.
IEEE Trans. Inf. Forensics Secur., 2011

Check-in processing: simulation of passengers with advanced traits.
Proceedings of the Winter Simulation Conference 2011, 2011

Gait energy volumes and frontal gait recognition using depth images.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

Activity Modelling in Crowded Environments: A Soft-Decision Approach.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Unusual Event Detection in Crowded Scenes Using Bag of LBPs in Spatio-Temporal Patches.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

An Exploration of Feature Detector Performance in the Thermal-Infrared Modality.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Compressive Sensing for Gait Recognition.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Scene Invariant Crowd Counting.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Visual Voice Activity Detection Using Frontal versus Profile Views.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Negative Determinant of Hessian Features.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Practical Improvements to Simultaneous Computation of Multi-view Geometry and Radial Lens Distortion.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Evaluating Automatic Road Detection across a Large Aerial Imagery Collection.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

3D ellipsoid fitting for multi-view gait recognition.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Textures of optical flow for real-time anomaly detection in crowds.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Determining operational measures from multi-camera surveillance systems using soft biometrics.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
Multi-spectral fusion for surveillance systems.
Comput. Electr. Eng., 2010

Robust mean super-resolution for less cooperative NIR iris recognition at a distance and on the move.
Proceedings of the 2010 Symposium on Information and Communication Technology, 2010

Fusing shrinking and expanding active contour models for robust iris segementation.
Proceedings of the 10th International Conference on Information Sciences, 2010

Lip detection for audio-visual speech recognition in-car environment.
Proceedings of the 10th International Conference on Information Sciences, 2010

Eigengaze - covert behavioral biometric exploiting visual attention characteristics.
Proceedings of the 10th International Conference on Information Sciences, 2010

Labelled silhouettes for human pose estimation.
Proceedings of the 10th International Conference on Information Sciences, 2010

Accurate silhouette segmentation using motion detection and graph cuts.
Proceedings of the 10th International Conference on Information Sciences, 2010

Accurate Silhouettes for Surveillance - Improved Motion Segmentation Using Graph Cuts.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2010

Crowd Counting Using Group Tracking and Local Features.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

Multi-Modal Object Tracking using Dynamic Performance Metrics.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

Cascading appearance-based features for visual voice activity detection.
Proceedings of the Auditory-Visual Speech Processing, 2010

Exploring visual features through Gabor representations for facial expression detection.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
PhD forum: Multiple camera management using wide base-line matching.
Proceedings of the Third ACM/IEEE International Conference on Distributed Smart Cameras, 2009

Crowd Counting Using Multiple Local Features.
Proceedings of the DICTA 2009, 2009

Dense Correspondence Extraction in Difficult Uncalibrated Scenarios.
Proceedings of the DICTA 2009, 2009

Improved Simultaneous Computation of Motion Detection and Optical Flow for Object Tracking.
Proceedings of the DICTA 2009, 2009

Soft-Biometrics: Unconstrained Authentication in a Surveillance Environment.
Proceedings of the DICTA 2009, 2009

Affine Adaptation of Local Image Features Using the Hessian Matrix.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Dynamic Performance Measures for Object Tracking Systems.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

2008
3D face verification using a free-parts approach.
Pattern Recognit. Lett., 2008

Normalisation and Recognition of 3D Face Data Using Robust Hausdorff Metric.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Improved GrabCut Segmentation via GMM Optimisation.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

2007
Super-Resolved Faces for Improved Face Recognition from Surveillance Video.
Proceedings of the Advances in Biometrics, International Conference, 2007

2006
What Is the Average Human Face?
Proceedings of the Advances in Image and Video Technology, First Pacific Rim Symposium, 2006

3D Face Recognition using Log-Gabor Templates.
Proceedings of the British Machine Vision Conference 2006, 2006

Human Face Reconstruction Using Bayesian Deformable Models.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

The Role of Motion Models in Super-Resolving Surveillance Video for Face Recognition.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Multi-view Intelligent Vehicle Surveillance System.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

A Multi-Class Tracker Using a Scalable Condensation Filter.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

2005
Gabor Filter Bank Representation for 3D Face Recognition.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2005

2004
Optimal grid point selection for improved nonrigid medical image registration.
Proceedings of the Medical Imaging 2004: Image Processing, 2004

Quadrature-Based Image Registration Method using Mutual Information.
Proceedings of the 2004 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2004

Multi-Spectral Stereo Image Matching using Mutual Information.
Proceedings of the 2nd International Symposium on 3D Data Processing, 2004

Face Recognition from 3D Data using Iterative Closest Point Algorithm and Gaussian Mixture Models.
Proceedings of the 2nd International Symposium on 3D Data Processing, 2004

2003
Rigid Medical Image Registration And Its Association With Mutual Information.
Int. J. Pattern Recognit. Artif. Intell., 2003

Non-Rigid Image Registration and the Use of Mutual Information.
Aust. J. Intell. Inf. Process. Syst., 2003

Segmentation-based image compression using BTC-VQ technique.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

2002
Improved Stereo Image Matching Using Mutual Information and Hierarchical Prior Probabilities.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

2000
Global 3D Rigid Registration of Medical Images.
Proceedings of the 2000 International Conference on Image Processing, 2000


  Loading...