Sridha Sridharan

Orcid: 0000-0003-4316-9001

According to our database1, Sridha Sridharan authored at least 511 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
GeoAdapt: Self-Supervised Test-Time Adaptation in LiDAR Place Recognition Using Geometric Priors.
IEEE Robotics Autom. Lett., January, 2024

AG-ReID.v2: Bridging Aerial and Ground Views for Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

FactoFormer: Factorized Hyperspectral Transformers With Self-Supervised Pretraining.
IEEE Trans. Geosci. Remote. Sens., 2024

Multimodal Colearning Meets Remote Sensing: Taxonomy, State of the Art, and Future Works.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

Deep cross-domain transfer for emotion recognition via joint learning.
Multim. Tools Appl., 2024

2023
Multi-stage stacked temporal convolution neural networks (MS-S-TCNs) for biosignal segmentation and anomaly localization.
Pattern Recognit., July, 2023

Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization.
IEEE Robotics Autom. Lett., May, 2023

Pose-driven attention-guided image generation for person re-Identification.
Pattern Recognit., May, 2023

Meta-transfer learning for emotion recognition.
Neural Comput. Appl., May, 2023

Generalized Generative Deep Learning Models for Biosignal Synthesis and Modality Transfer.
IEEE J. Biomed. Health Informatics, February, 2023

Toward On-Board Panoptic Segmentation of Multispectral Satellite Images.
IEEE Trans. Geosci. Remote. Sens., 2023

Complex-Valued Iris Recognition Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments.
CoRR, 2023

FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pre-Training.
CoRR, 2023

Learning Through Guidance: Knowledge Distillation for Endoscopic Image Classification.
CoRR, 2023

GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors.
CoRR, 2023

General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation.
CoRR, 2023

Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction.
CoRR, 2023

Physical Adversarial Attacks for Surveillance: A Survey.
CoRR, 2023

Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models.
CoRR, 2023

Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Aerial-Ground Person Re-ID.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023


2022
Elasticity Meets Continuous-Time: Map-Centric Dense 3D LiDAR SLAM.
IEEE Trans. Robotics, 2022

Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition.
IEEE Trans. Multim., 2022

Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings.
IEEE J. Biomed. Health Informatics, 2022

Geometric Deep Learning for Subject Independent Epileptic Seizure Prediction Using Scalp EEG Signals.
IEEE J. Biomed. Health Informatics, 2022

Component-Based Attention for Large-Scale Trademark Retrieval.
IEEE Trans. Inf. Forensics Secur., 2022

Channel Graph Regularized Correlation Filters for Visual Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

An efficient framework for zero-shot sketch-based image retrieval.
Pattern Recognit., 2022

Split 'n' merge net: A dynamic masking network for multi-task attention.
Pattern Recognit., 2022

Affect recognition from scalp-EEG using channel-wise encoder networks coupled with geometric deep learning and multi-channel feature fusion.
Knowl. Based Syst., 2022

Learning test-time augmentation for content-based image retrieval.
Comput. Vis. Image Underst., 2022

Deep Learning for Medical Anomaly Detection - A Survey.
ACM Comput. Surv., 2022

Using Auxiliary Information for Person Re-Identification - A Tutorial Overview.
CoRR, 2022

Towards On-Board Panoptic Segmentation of Multispectral Satellite Images.
CoRR, 2022

The State of Aerial Surveillance: A Survey.
CoRR, 2022

Jointly Trained Conversion Model With LPCNet for Any-to-One Voice Conversion Using Speaker-Independent Linguistic Features.
IEEE Access, 2022

InCloud: Incremental Learning for Point Cloud Place Recognition.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion.
Proceedings of the Interspeech 2022, 2022

LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

SESS: Saliency Enhancing with Scaling and Sliding.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
A Robust Interpretable Deep Learning Classifier for Heart Anomaly Detection Without Segmentation.
IEEE J. Biomed. Health Informatics, 2021

Identification of Children at Risk of Schizophrenia via Deep Learning and EEG Responses.
IEEE J. Biomed. Health Informatics, 2021

TMMF: Temporal Multi-Modal Fusion for Single-Stage Continuous Gesture Recognition.
IEEE Trans. Image Process., 2021

End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2021

Detection of Fake and Fraudulent Faces via Neural Memory Networks.
IEEE Trans. Inf. Forensics Secur., 2021

Domain Generalization in Biosignal Classification.
IEEE Trans. Biomed. Eng., 2021

Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion.
IEEE Signal Process. Mag., 2021

Memory based fusion for multi-modal deep learning.
Inf. Fusion, 2021

Multi-modal semantic image segmentation.
Comput. Vis. Image Underst., 2021

Point Cloud Segmentation Using Sparse Temporal Local Attention.
CoRR, 2021

Discriminative Domain-Invariant Adversarial Network for Deep Domain Generalization.
CoRR, 2021

Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis.
CoRR, 2021

Preserving Semantic Consistency in Unsupervised Domain Adaptation Using Generative Adversarial Networks.
CoRR, 2021

Deep Domain Generalization with Feature-norm Network.
CoRR, 2021

Im2Mesh GAN: Accurate 3D Hand Mesh Recovery from a Single RGB Image.
CoRR, 2021

IGSSTRCF: Importance Guided Sparse Spatio-Temporal Regularized Correlation Filters For Tracking.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning Regional Attention Over Multi-Resolution Deep Convolutional Features For Trademark Retrieval.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Reduction of Feature Contamination for Hyper Spectral Image Classification.
Proceedings of the 2021 Digital Image Computing: Techniques and Applications, 2021

2020
Memory Augmented Deep Generative Models for Forecasting the Next Shot Location in Tennis.
IEEE Trans. Knowl. Data Eng., 2020

Heart Sound Segmentation Using Bidirectional LSTMs With Attention.
IEEE J. Biomed. Health Informatics, 2020

Constrained Design of Deep Iris Networks.
IEEE Trans. Image Process., 2020

Target-Specific Siamese Attention Network for Real-Time Object Tracking.
IEEE Trans. Inf. Forensics Secur., 2020

Temporarily-Aware Context Modeling Using Generative Adversarial Networks for Speech Activity Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Spatiotemporal Camera-LiDAR Calibration: A Targetless and Structureless Approach.
IEEE Robotics Autom. Lett., 2020

Hierarchical Attention Network for Action Segmentation.
Pattern Recognit. Lett., 2020

Correlation-aware adversarial domain adaptation and generalization.
Pattern Recognit., 2020

Context from within: Hierarchical context modeling for semantic segmentation.
Pattern Recognit., 2020

Fine-grained action segmentation using the semi-supervised action GAN.
Pattern Recognit., 2020

Neural memory plasticity for medical anomaly detection.
Neural Networks, 2020

MTRNet++: One-stage mask-based scene text eraser.
Comput. Vis. Image Underst., 2020

Joint identification-verification for person re-identification: A four stream deep learning approach with improved quartet loss function.
Comput. Vis. Image Underst., 2020

LSTM guided ensemble correlation filter tracking with appearance model pool.
Comput. Vis. Image Underst., 2020

Patient-independent Epileptic Seizure Prediction using Deep Learning Models.
CoRR, 2020

Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers.
CoRR, 2020

Multi-modal Fusion for Single-Stage Continuous Gesture Recognition.
CoRR, 2020

Memory Based Attentive Fusion.
CoRR, 2020

Meta Transfer Learning for Emotion Recognition.
CoRR, 2020

Understanding the Importance of Heart Sound Segmentation for Heart Anomaly Detection.
CoRR, 2020

Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection.
CoRR, 2020

Joint Deep Cross-Domain Transfer Learning for Emotion Recognition.
CoRR, 2020

Enhancing Feature Invariance with Learned Image Transformations for Image Retrieval.
CoRR, 2020

Semantic Consistency and Identity Mapping Multi-Component Generative Adversarial Network for Person Re-Identification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Two-Stream Deep Feature Modelling for Automated Video Endoscopy Data Analysis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Attention Driven Fusion for Multi-Modal Emotion Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Geometry-Constrained Car Recognition Using a 3D Perspective Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

On Minimum Discrepancy Estimation for Deep Domain Adaptation.
Proceedings of the Domain Adaptation for Visual Understanding, 2020

2019
Voice Presentation Attack Detection Using Convolutional Neural Networks.
Proceedings of the Handbook of Biometric Anti-Spoofing, 2019

Understanding Patients' Behavior: Vision-Based Analysis of Seizure Disorders.
IEEE J. Biomed. Health Informatics, 2019

Scene Invariant Virtual Gates Using DNNs.
IEEE Trans. Circuits Syst. Video Technol., 2019

Robust Photogeometric Localization Over Time for Map-Centric Loop Closure.
IEEE Robotics Autom. Lett., 2019

Sparse over-complete patch matching.
Pattern Recognit. Lett., 2019

Multimodal clothing recognition for semantic search in unconstrained surveillance imagery.
J. Vis. Commun. Image Represent., 2019

Deep domain adaptation for anti-spoofing in speaker verification systems.
Comput. Speech Lang., 2019

Exploiting Human Social Cognition for the Detection of Fake and Fraudulent Faces via Memory Networks.
CoRR, 2019

Neural Memory Plasticity for Anomaly Detection.
CoRR, 2019

On Minimum Discrepancy Estimation for Deep Domain Adaptation.
CoRR, 2019

Multi-Component Image Translation for Deep Domain Generalization.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Semantic Correspondence in the Wild.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Coupled Generative Adversarial Network for Continuous Fine-Grained Action Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Semantic Segmentation Of Hands In Multimodal Images: A Region New-Based CNN Approach.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

A Study of x-Vector Based Speaker Recognition on Short Utterances.
Proceedings of the Interspeech 2019, 2019

MTRNet: A Generic Scene Text Eraser.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Neighbourhood Context Embeddings in Deep Inverse Reinforcement Learning for Predicting Pedestrian Motion Over Long Time Horizons.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Predicting the Future: A Jointly Learnt Model for Action Anticipation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Investigating Domain Sensitivity of DNN Embeddings for Speaker Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2019

Vision-Based Mouth Motion Analysis in Epilepsy: A 3D Perspective.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Unified 2D and 3D Hand Pose Estimation from a Single Visible or X-ray Image.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Forecasting Future Action Sequences with Neural Memory Networks.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Interactive Sports Analytics: An Intelligent Interface for Utilizing Trajectories for Interactive Sports Play Retrieval and Analytics.
ACM Trans. Comput. Hum. Interact., 2018

Super-resolution for biometrics: A comprehensive survey.
Pattern Recognit., 2018

<i>Soft</i> + <i>Hardwired</i> attention: An LSTM framework for human trajectory prediction and abnormal event detection.
Neural Networks, 2018

Tree Memory Networks for modelling long-term temporal dependencies.
Neurocomputing, 2018

Human-level face verification with intra-personal factor analysis and deep face representation.
IET Biom., 2018

Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition.
Comput. Vis. Image Underst., 2018

Improving PLDA speaker verification performance using domain mismatch compensation techniques.
Comput. Speech Lang., 2018

Semantic Correspondence: A Hierarchical Approach.
CoRR, 2018

Iris Recognition With Off-the-Shelf CNN Features: A Deep Learning Perspective.
IEEE Access, 2018

A Deep Four-Stream Siamese Convolutional Neural Network with Joint Verification and Identification Loss for Person Re-Detection.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Task Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Tracking by Prediction: A Deep Generative Model for Mutli-person Localisation and Tracking.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Pedestrian Trajectory Prediction with Structured Memory Hierarchies.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018

Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance.
Proceedings of the Interspeech 2018, 2018

Elastic LiDAR Fusion: Dense Map-Centric Continuous-Time SLAM.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Meta Transfer Learning for Facial Emotion Recognition.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Non-rigid Reconstruction with a Single Moving RGB-D Camera.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Deep Match Tracker: Classifying when Dissimilar, Similarity Matching when Not.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Hierarchical Relational Attention for Video Question Answering.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Calibrating Cameras in Poor-Conditioned Pitch-Based Sports Games.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Motion Analysis for Epileptic Seizure Classification.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Deep Classification of Epileptic Signals.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Skeleton Driven Non-Rigid Motion Tracking and 3D Reconstruction.
Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

Deep Decision Trees for Discriminative Dictionary Learning With Adversarial Multi-Agent Trajectories.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Rethinking Planar Homography Estimation Using Perspective Fields.
Proceedings of the Computer Vision - ACCV 2018, 2018

Image2Mesh: A Learning Framework for Single Image 3D Reconstruction.
Proceedings of the Computer Vision - ACCV 2018, 2018

Learning Free-Form Deformations for 3D Object Reconstruction.
Proceedings of the Computer Vision - ACCV 2018, 2018

Multi-level Sequence GAN for Group Activity Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Locating People in Surveillance Video Using Soft Biometric Traits.
Proceedings of the Handbook of Biometrics for Forensic Science, 2017

Long range iris recognition: A survey.
Pattern Recognit., 2017

A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems.
Int. J. Speech Technol., 2017

Fine-grained action recognition of boxing punches from depth imagery.
Comput. Vis. Image Underst., 2017

Cross database audio visual speech adaptation for phonetic spoken term detection.
Comput. Speech Lang., 2017

Fine-Grained Retrieval of Sports Plays using Tree-Based Alignment of Trajectories.
CoRR, 2017

Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection.
CoRR, 2017

Joint Max Margin and Semantic Features for Continuous Event Detection in Complex Scenes.
CoRR, 2017

Deep Spatio-Temporal Features for Multimodal Emotion Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Deep Context Modeling for Semantic Segmentation.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

From Affine Rank Minimization Solution to Sparse Modeling.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification.
Proceedings of the Interspeech 2017, 2017

Gate connected convolutional neural network for object tracking.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Facial analysis in the wild with LSTM networks.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Single image depth prediction using super-column super-pixel features.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deep discovery of facial motions using a shallow embedding layer.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A cascaded long short-term memory (LSTM) driven generic visual question answering (VQA).
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Probabilistic Surfel Fusion for Dense LiDAR Mapping.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Going Deeper: Autonomous Steering with Neural Memory Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Using Synthetic Data to Improve Facial Expression Analysis with 3D Convolutional Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Deep features-based expression-invariant tied factor analysis for emotion recognition.
Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017

Fast, Dense Feature SDM on an iPhone.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Compact Model Representation for 3D Reconstruction.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
Forecasting the Next Shot Location in Tennis Using Fine-Grained Spatiotemporal Tracking Data.
IEEE Trans. Knowl. Data Eng., 2016

Discovering Team Structures in Soccer from Spatiotemporal Data.
IEEE Trans. Knowl. Data Eng., 2016

Feature mapping using far-field microphones for distant speech recognition.
Speech Commun., 2016

Detecting rare events using Kullback-Leibler divergence: A weakly supervised approach.
Expert Syst. Appl., 2016

Recent Advances in Camera Planning for Large Area Surveillance: A Comprehensive Review.
ACM Comput. Surv., 2016

A study of speaker clustering for speaker attribution in large telephone conversation datasets.
Comput. Speech Lang., 2016

Automatic Event Detection for Signal-based Surveillance.
CoRR, 2016

Improving Short Utterance PLDA Speaker Verification using SUV Modelling and Utterance Partitioning Approach.
CoRR, 2016

DNN based Speaker Recognition on Short Utterances.
CoRR, 2016

Domain adaptation based Speaker Recognition on Short Utterances.
CoRR, 2016

Discovery of facial motions using deep machine perception.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification.
Proceedings of the Interspeech 2016, 2016

Speakers In The Wild (SITW): The QUT Speaker Recognition System.
Proceedings of the Interspeech 2016, 2016

A robust UAV landing site detection system using mid-level discriminative patches.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Deeper and wider fully convolutional network coupled with conditional random fields for scene labeling.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Vertical Axis Detection for Sport Video Analytics.
Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications, 2016

Complex Event Detection Using Joint Max Margin and Semantic Features.
Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications, 2016

2015
Score-Level Multibiometric Fusion Based on Dempster-Shafer Theory Incorporating Uncertainty Factors.
IEEE Trans. Hum. Mach. Syst., 2015

An Efficient and Robust System for Multiperson Event Detection in Real-World Indoor Surveillance Scenes.
IEEE Trans. Circuits Syst. Video Technol., 2015

Searching for people using semantic soft biometric descriptions.
Pattern Recognit. Lett., 2015

Automatic surveillance in transportation hubs: No longer just about catching the bad guy.
Expert Syst. Appl., 2015

An evaluation of crowd counting methods, features and regression models.
Comput. Vis. Image Underst., 2015

Acoustic Adaptation in Cross Database Audio Visual SHMM Training for Phonetic Spoken Term Detection.
Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia, 2015

Predicting Serves in Tennis using Style Priors.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Dataset-invariant covariance normalization for out-domain PLDA speaker verification.
Proceedings of the INTERSPEECH 2015, 2015

Investigating in-domain data requirements for PLDA training.
Proceedings of the INTERSPEECH 2015, 2015

Improving PLDA speaker verification using WMFD and linear-weighted approaches in limited microphone data conditions.
Proceedings of the INTERSPEECH 2015, 2015

Incorporating visual information for spoken term detection.
Proceedings of the INTERSPEECH 2015, 2015

Cross database training of audio-visual hidden Markov models for phone recognition.
Proceedings of the INTERSPEECH 2015, 2015

Channel selection in the short-time modulation domain for distant speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Complete-linkage clustering for voice activity detection in audio and visual speech.
Proceedings of the INTERSPEECH 2015, 2015

The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition.
Proceedings of the INTERSPEECH 2015, 2015

Class-specific sparse codes for representing activities.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Improving deep convolutional neural networks with unsupervised feature learning.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Combat sports analytics: Boxing punch classification using overhead depthimagery.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Predicting Ball Ownership in Basketball from a Monocular View Using Only Player Trajectories.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Detecting rare events using Kullback-Leibler divergence.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A cluster-voting approach for speaker diarization and linking of Australian broadcast news recordings.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Searching for semantic person queries using channel representations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Closed-Form Solutions for Low-Rank Non-Rigid Reconstruction.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Robust Automatic Face Clustering in News Video.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Learning Temporal Alignment Uncertainty for Efficient Event Detection.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Large scale monitoring of crowds and building utilisation: A new database and distributed approach.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
Optimal Camera Planning Under Versatile User Constraints in Multi-Camera Image Processing Systems.
IEEE Trans. Image Process., 2014

Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques.
Speech Commun., 2014

Real-time video event detection in crowded scenes using MPEG derived features: A multiple instance learning approach.
Pattern Recognit. Lett., 2014

Scene invariant multi camera crowd counting.
Pattern Recognit. Lett., 2014

I-vector based speaker recognition using advanced channel compensation techniques.
Comput. Speech Lang., 2014

Learning detectors quickly using structured covariance matrices.
CoRR, 2014

Understanding and analyzing a large collection of archived swimming videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Predicting movie ratings from audience behaviors.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Local inter-session variability modelling for object classification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Activity recognition using binary tree SVM.
Proceedings of the IEEE Workshop on Statistical Signal Processing, 2014

Social signal processing for pain monitoring using a hidden conditional random field.
Proceedings of the IEEE Workshop on Statistical Signal Processing, 2014

SAIVT-ADMRG @ MediaEval 2014 Social Event Detection.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

An iterative speaker re-diarization scheme for improving speaker-based entity extraction in multimedia archives.
Proceedings of the INTERSPEECH 2014, 2014

Multiple Instance Dictionary Learning for Activity Representation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Locating People in Video from Semantic Descriptions: A New Database and Approach.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Identifying Team Style in Soccer Using Formations Learned from Spatiotemporal Tracking Data.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Large-Scale Analysis of Soccer Matches Using Spatiotemporal Tracking Data.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Improving PLDA speaker verification with limited development data.
Proceedings of the IEEE International Conference on Acoustics, 2014

Topic dependent language modelling for spoken term detection.
Proceedings of the 22nd European Signal Processing Conference, 2014

A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets.
Proceedings of the 22nd European Signal Processing Conference, 2014

Supervised Latent Dirichlet Allocation Models for Efficient Activity Representation.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

Automatic UAV Forced Landing Site Detection Using Machine Learning.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

An MRF based abnormal event detection approach using motion and appearance features.
Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

Forecasting Events Using an Augmented Hidden Conditional Random Field.
Proceedings of the Computer Vision - ACCV 2014, 2014

Learning Detectors Quickly with Stationary Statistics.
Proceedings of the Computer Vision - ACCV 2014, 2014

Unsupervised Temporal Ensemble Alignment for Rapid Annotation.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Fourier Lucas-Kanade Algorithm.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Evaluation of two-view geometry methods with automatic ground-truth generation.
Image Vis. Comput., 2013

Feature-domain super-resolution for iris recognition.
Comput. Vis. Image Underst., 2013

Eigenvoice modelling for cross likelihood ratio based speaker clustering: A Bayesian approach.
Comput. Speech Lang., 2013

Multiple cameras for audio-visual speech recognition in an automotive environment.
Comput. Speech Lang., 2013

Liveness detection based on 3D face shape analysis.
Proceedings of the 1st International Workshop on Biometrics and Forensics, 2013

Improving the PLDA based speaker verification in limited microphone data conditions.
Proceedings of the INTERSPEECH 2013, 2013

Improving short utterance based i-vector speaker recognition using source and utterance-duration normalization techniques.
Proceedings of the INTERSPEECH 2013, 2013

Speaker Attribution of Australian Broadcast News Data.
Proceedings of the First Workshop on Speech, 2013

Rank Minimization across Appearance and Shape for AAM Ensemble Fitting.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Deformable face ensemble alignment with robust grouped-L1 anchors.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Large-Scale Analysis of Formations in Soccer.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Predicting Shot Locations in Tennis Using Spatiotemporal Data.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Semi-Binary Based Video Features for Activity Representation.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Swimmer Localization from a Moving Camera.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

An Evaluation of Different Features and Learning Models for Anomalous Event Detection.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Person Re-Identification Using Group Information.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Quality Based Frame Selection for Face Clustering in News Video.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Histogram of Weighted Local Directions for Gait Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Recognising Team Activities from Noisy Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Visual front-endwars: Viola-Jones face detector vs Fourier Lucas-Kanade.
Proceedings of the Auditory-Visual Speech Processing, 2013

2012
Scene Invariant Crowd Counting and Crowd Occupancy Analysis.
Proceedings of the Video Analytics for Business Intelligence, 2012

Identifying Customer Behaviour and Dwell Time Using Soft Biometrics.
Proceedings of the Video Analytics for Business Intelligence, 2012

In the Pursuit of Effective Affective Computing: The Relationship Between Features and Registration.
IEEE Trans. Syst. Man Cybern. Part B, 2012

A Mask-Based Approach for the Geometric Calibration of Thermal-Infrared Cameras.
IEEE Trans. Instrum. Meas., 2012

Evaluation of image resolution and super-resolution on face recognition performance.
J. Vis. Commun. Image Represent., 2012

Hessian-Based Affine Adaptation of Salient Local Image Features.
J. Math. Imaging Vis., 2012

Self-calibration of wireless cameras with restricted degrees of freedom.
Comput. Vis. Image Underst., 2012

SAIVT-QUT@TRECVid 2012: Interactive Surveillance Event Detection.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

PLDA based speaker recognition on short utterances.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

PLDA based speaker verification with weighted LDA techniques.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Efficient real-time face detection for high resolution surveillance applications.
Proceedings of the 6th International Conference on Signal Processing and Communication Systems, 2012

Quality based frame selection for video face recognition.
Proceedings of the 6th International Conference on Signal Processing and Communication Systems, 2012

Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Hand-held monocular SLAM in thermal-infrared.
Proceedings of the 12th International Conference on Control Automation Robotics & Vision, 2012

Efficient Articulated Trajectory Reconstruction Using Dynamic Programming and Filters.
Proceedings of the Computer Vision - ECCV 2012, 2012

On the Statistical Determination of Optimal Camera Configurations in Large Scale Surveillance Networks.
Proceedings of the Computer Vision - ECCV 2012, 2012

Anchored Deformable Face Ensemble Alignment.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Spatio Temporal Feature Evaluation for Action Recognition.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

The Backfilled GEI - A Cross-Capture Modality Gait Feature for Frontal and Side-View Gait Recognition.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Anomalous Event Detection Using a Semi-Two Dimensional Hidden Markov Model.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Can You Describe Him for Me? A Technique for Semantic Person Search in Video.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

A Database for Person Re-Identification in Multi-Camera Surveillance Networks.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Feature-domain super-resolution framework for Gabor-based face and iris recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Improved facial expression recognition via uni-hyperplane classification.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Activity Analysis in Complicated Scenes Using DFT Coefficients of Particle Trajectories.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Unusual Scene Detection Using Distributed Behaviour Model and Sparse Representation.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Use of brain computer interface to drive: preliminary results.
Proceedings of the International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2012

2011
Automatically Detecting Pain in Video Through Facial Action Units.
IEEE Trans. Syst. Man Cybern. Part B, 2011

Quality-Driven Super-Resolution for Less Constrained Iris Recognition at a Distance and on the Move.
IEEE Trans. Inf. Forensics Secur., 2011

Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection.
IEEE Trans. Speech Audio Process., 2011

The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition.
IEEE Trans. Speech Audio Process., 2011

Clustered Blind Beamforming From Ad-Hoc Microphone Arrays.
IEEE Trans. Speech Audio Process., 2011

The use of phase in complex spectrum subtraction for robust speech recognition.
Comput. Speech Lang., 2011

Sparse Temporal Representations for Facial Expression Recognition.
Proceedings of the Advances in Image and Video Technology - 5th Pacific Rim Symposium, 2011

Cross Likelihood Ratio Based Speaker Clustering Using Eigenvoice Models.
Proceedings of the INTERSPEECH 2011, 2011

Can Audio-Visual Speech Recognition Outperform Acoustically Enhanced Speech Recognition in Automotive Environment?
Proceedings of the INTERSPEECH 2011, 2011

i-vector Based Speaker Recognition on Short Utterances.
Proceedings of the INTERSPEECH 2011, 2011

Extending the Task of Diarization to Speaker Attribution.
Proceedings of the INTERSPEECH 2011, 2011

Fourier Active Appearance Models.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Gait energy volumes and frontal gait recognition using depth images.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

Person-independent facial expression detection using Constrained Local Models.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Activity Modelling in Crowded Environments: A Soft-Decision Approach.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Unusual Event Detection in Crowded Scenes Using Bag of LBPs in Spatio-Temporal Patches.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

An Exploration of Feature Detector Performance in the Thermal-Infrared Modality.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Graph Rigidity for Near-Coplanar Structure from Motion.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Compressive Sensing for Gait Recognition.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Scene Invariant Crowd Counting.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Visual Voice Activity Detection Using Frontal versus Profile Views.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Negative Determinant of Hessian Features.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Practical Improvements to Simultaneous Computation of Multi-view Geometry and Radial Lens Distortion.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Evaluating Automatic Road Detection across a Large Aerial Imagery Collection.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

3D ellipsoid fitting for multi-view gait recognition.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Textures of optical flow for real-time anomaly detection in crowds.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Determining operational measures from multi-camera surveillance systems using soft biometrics.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
A Comparison of Session Variability Compensation Approaches for Speaker Verification.
IEEE Trans. Inf. Forensics Secur., 2010

Making Confident Speaker Verification Decisions With Minimal Speech.
IEEE Trans. Speech Audio Process., 2010

Data-Driven Background Dataset Selection for SVM-Based Speaker Verification.
IEEE Trans. Speech Audio Process., 2010

Dynamic visual features for audio-visual speaker verification.
Comput. Speech Lang., 2010

Multi-spectral fusion for surveillance systems.
Comput. Electr. Eng., 2010

Robust mean super-resolution for less cooperative NIR iris recognition at a distance and on the move.
Proceedings of the 2010 Symposium on Information and Communication Technology, 2010

Experiments in SVM-based Speaker Verification Using Short Utterances.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

On the Use of Factor Analysis with Restricted Target Data in Speaker Verification.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Bayes Factor based speaker clustering for speaker diarization.
Proceedings of the 10th International Conference on Information Sciences, 2010

Fusing shrinking and expanding active contour models for robust iris segementation.
Proceedings of the 10th International Conference on Information Sciences, 2010

Lip detection for audio-visual speech recognition in-car environment.
Proceedings of the 10th International Conference on Information Sciences, 2010

QUT Speaker Identity Verification system for EVALITA 2009.
Proceedings of the 10th International Conference on Information Sciences, 2010

Eigengaze - covert behavioral biometric exploiting visual attention characteristics.
Proceedings of the 10th International Conference on Information Sciences, 2010

Bayes factor based speaker segmentation for speaker diarization.
Proceedings of the INTERSPEECH 2010, 2010

Noise robust voice activity detection using features extracted from the time-domain autocorrelation function.
Proceedings of the INTERSPEECH 2010, 2010

The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms.
Proceedings of the INTERSPEECH 2010, 2010

Optimising Figure of Merit for phonetic spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Exploiting multiple feature sets in data-driven impostor dataset selection for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Clustering of ad-hoc microphone arrays for robust blind beamforming.
Proceedings of the IEEE International Conference on Acoustics, 2010

Noise robust voice activity detection using normal probability testing and time-domain histogram analysis.
Proceedings of the IEEE International Conference on Acoustics, 2010

Accurate Silhouettes for Surveillance - Improved Motion Segmentation Using Graph Cuts.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2010

Crowd Counting Using Group Tracking and Local Features.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

Multi-Modal Object Tracking using Dynamic Performance Metrics.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

Cascading appearance-based features for visual voice activity detection.
Proceedings of the Auditory-Visual Speech Processing, 2010

Exploring visual features through Gabor representations for facial expression detection.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
Efficient constrained local model fitting for non-rigid face alignment.
Image Vis. Comput., 2009

The effect of language models on phonetic decoding for spoken term detection.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009

Within-session variability modelling for factor analysis speaker verification.
Proceedings of the INTERSPEECH 2009, 2009

Improved GMM-based speaker verification using SVM-driven impostor dataset selection.
Proceedings of the INTERSPEECH 2009, 2009

Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

PhD forum: Multiple camera management using wide base-line matching.
Proceedings of the Third ACM/IEEE International Conference on Distributed Smart Cameras, 2009

Least-squares congealing for large numbers of images.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Minimising Speaker Verification Utterance Length through Confidence Based Early Verification Decisions.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Data-Driven Impostor Selection for T-Norm Score Normalisation and the Background Dataset in SVM-Based Speaker Verification.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Scatter Difference NAP for SVM Speaker Recognition.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Spoken term detection using fast phonetic decoding.
Proceedings of the IEEE International Conference on Acoustics, 2009

Improved SVM speaker verification through data-driven background dataset collection.
Proceedings of the IEEE International Conference on Acoustics, 2009

The Australian English Speech Corpus for In-Car Speech processing.
Proceedings of the IEEE International Conference on Acoustics, 2009

Crowd Counting Using Multiple Local Features.
Proceedings of the DICTA 2009, 2009

Dense Correspondence Extraction in Difficult Uncalibrated Scenarios.
Proceedings of the DICTA 2009, 2009

Improved Simultaneous Computation of Motion Detection and Optical Flow for Object Tracking.
Proceedings of the DICTA 2009, 2009

Soft-Biometrics: Unconstrained Authentication in a Surveillance Environment.
Proceedings of the DICTA 2009, 2009

Automatically detecting action units from faces of pain: Comparing shape and appearance features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Affine Adaptation of Local Image Features Using the Hessian Matrix.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Dynamic Performance Measures for Object Tracking Systems.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Automatically detecting pain using facial actions.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
3D face verification using a free-parts approach.
Pattern Recognit. Lett., 2008

Explicit modelling of session variability for speaker verification.
Comput. Speech Lang., 2008

Factor analysis modelling for speaker verification with short utterances.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Discriminant NAP for SVM speaker recognition.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Comparing object alignment algorithms with appearance variation: Forward-additive vs inverse-composition.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Factor analysis subspace estimation for speaker verification with short utterances.
Proceedings of the INTERSPEECH 2008, 2008

Continuous pose-invariant lipreading.
Proceedings of the INTERSPEECH 2008, 2008

Cascading appearance-based features for visual speaker verification.
Proceedings of the INTERSPEECH 2008, 2008

Dealing with uncertainty in microphone placement in a microphone array speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2008

Normalisation and Recognition of 3D Face Data Using Robust Hausdorff Metric.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Improved GrabCut Segmentation via GMM Optimisation.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Least squares congealing for unsupervised alignment of images.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Patch-based analysis of visual speech from multiple views.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

Improving pain recognition through better utilisation of temporal information.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

Fused HMM adaptation of synchronous HMMs for audio-visual speaker verification.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
Multiscale Representation for 3-D Face Recognition.
IEEE Trans. Inf. Forensics Secur., 2007

Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting.
IEEE Trans. Speech Audio Process., 2007

An adaptive optical flow technique for person tracking systems.
Pattern Recognit. Lett., 2007

Automatic Tracking, Super-Resolution and Recognition of Human Faces from Surveillance Video.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2007), 2007

Robust Real Time Multi-Layer Foreground Segmentation.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2007), 2007

A phonetic search approach to the 2006 NIST spoken term detection evaluation.
Proceedings of the INTERSPEECH 2007, 2007

A comparison of session variability compensation techniques for SVM-based speaker recognition.
Proceedings of the INTERSPEECH 2007, 2007

A unified approach to multi-pose audio-visual ASR.
Proceedings of the INTERSPEECH 2007, 2007

Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition.
Proceedings of the INTERSPEECH 2007, 2007

SVM Speaker Verification Using Session Variability Modelling and GMM Supervectors.
Proceedings of the Advances in Biometrics, International Conference, 2007

Super-Resolved Faces for Improved Face Recognition from Surveillance Video.
Proceedings of the Advances in Biometrics, International Conference, 2007

Robust 3D Face Recognition from Expression Categorisation.
Proceedings of the Advances in Biometrics, International Conference, 2007

An extended pose-invariant lipreading system.
Proceedings of the Auditory-Visual Speech Processing 2007, 2007

Weighting and normalisation of synchronous HMMs for audio-visual speech recognition.
Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006
Gaze tracking for region of interest coding in JPEG 2000.
Signal Process. Image Commun., 2006

A syllable-scale framework for language identification.
Comput. Speech Lang., 2006

What Is the Average Human Face?
Proceedings of the Advances in Image and Video Technology, First Pacific Rim Symposium, 2006

Towards Improved Assessment of Phonotactic Information for Automatic Language Identification.
Proceedings of the Odyssey 2006, 2006

Application Specific Bounds on Detection Cost using Game Theory.
Proceedings of the Odyssey 2006, 2006

Speaker Verification using Hidden Markov Models in a Multilingual Text-constrained Framework.
Proceedings of the Odyssey 2006, 2006

Experiments in Session Variability Modelling for Speaker Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Feature Modelling of PCA Difference Vectors for 2D and 3D Face Recognition.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Human Face Reconstruction Using Bayesian Deformable Models.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

The Role of Motion Models in Super-Resolving Surveillance Video for Face Recognition.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Multi-view Intelligent Vehicle Surveillance System.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

A Multi-Class Tracker Using a Scalable Condensation Filter.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Combined 2D/3D Face Recognition Using Log-Gabor Templates.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

On the Performance and Use of Speaker Recognition Systems for Surveillance.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

2005
Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition.
IEEE Trans. Multim., 2005

Texture for Script Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Real-Time Adaptive Foreground/Background Segmentation.
EURASIP J. Adv. Signal Process., 2005

Face recognition from super-resolved images.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

Texture classification using gabor energy features and higher order spectral features: a comparative study.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

Tracking people in 3D using position, size and shape.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

Comparing audio and visual information for speech processing.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

Modelling session variability in text-independent speaker verification.
Proceedings of the INTERSPEECH 2005, 2005

Data-driven clustering for blind feature mapping in speaker verification.
Proceedings of the INTERSPEECH 2005, 2005

Gaussian mixture modelling of broad phonetic and syllabic events for text-independent speaker verification.
Proceedings of the INTERSPEECH 2005, 2005

Dynamic Match Phone-Lattice Searches For Very Fast And Accurate Unrestricted Vocabulary Keyword Spotting.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Cross-language Acoustic Model Refinement for the Indonesian Language.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Using a Free-Parts Representation for Visual Speech Recognition.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2005

Adaptive Optical Flow for Person Tracking.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2005

Object Recognition Using Stereo Vision and Higher Order Spectra.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2005

Gabor Filter Bank Representation for 3D Face Recognition.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2005

Problems associated with current area-based visual speech feature extraction techniques.
Proceedings of the Auditory-Visual Speech Processing 2005, 2005

Audio-visual speaker identification using the CUAVE database.
Proceedings of the Auditory-Visual Speech Processing 2005, 2005

2004
Importance prioritisation in JPEG 2000 for improved interpretability.
Signal Process. Image Commun., 2004

Bayes factor scoring of GMMs for speaker verification.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Pitch and energy trajectory modelling in a syllable length temporal framework for language identification.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Improved phonetic and lexical speaker recognition through MAP adaptation.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004


Visual attention based roi maps from gaze tracking data.
Proceedings of the 2004 International Conference on Image Processing, 2004

Techniques for improving stereo depth maps of faces.
Proceedings of the 2004 International Conference on Image Processing, 2004

An Application of Fractal Image-Set Coding in Facial Recognition.
Proceedings of the Biometric Authentication, First International Conference, 2004

Speaker Identification Using Higher Order Spectral Phase Features and their Effectiveness vis-a-vis Mel-Cepstral Features.
Proceedings of the Biometric Authentication, First International Conference, 2004

Logarithmic quantisation of wavelet coefficients for improved texture classification performance.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Multi-Spectral Stereo Image Matching using Mutual Information.
Proceedings of the 2nd International Symposium on 3D Data Processing, 2004

Face Recognition from 3D Data using Iterative Closest Point Algorithm and Gaussian Mixture Models.
Proceedings of the 2nd International Symposium on 3D Data Processing, 2004

2003
Interpretability performance assessment of JPEG2000 and part 1 compliant region of interest coding.
IEEE Trans. Consumer Electron., 2003

Improved Facial-Feature Detection for AVSP via Unsupervised Clustering and Discriminant Analysis.
EURASIP J. Adv. Signal Process., 2003

Interpolative coding of speech parameters using hierarchical temporal decomposition.
Digit. Signal Process., 2003

Importance prioritization coding in JPEG2000 for interpretability with application to surveillance imagery.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Dependence of GMM adaptation on feature post-processing for speaker recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Isolated word verification using cohort word-level verification.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Cross-lingual pronunciation modelling for indonesian speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Three approaches to multilingual phone recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Real-time adaptive background segmentation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Robust Face Localisation Using Motion, Colour and Fusion.
Proceedings of the Seventh International Conference on Digital Image Computing: Techniques and Applications, 2003

2002
Adaptive mouth segmentation using chromatic features.
Pattern Recognit. Lett., 2002

Near-field Adaptive Beamformer for Robust Speech Recognition.
Digit. Signal Process., 2002

Methods to improve Gaussian mixture model based language identification system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Chromatic colour spaces for skin detection using GMMS.
Proceedings of the IEEE International Conference on Acoustics, 2002

Progressive coding in JPEG2000 - improving content recognition performance using ROIs and importance maps.
Proceedings of the 11th European Signal Processing Conference, 2002

2001
Multi-Channel Sub-Band Speech Recognition.
EURASIP J. Adv. Signal Process., 2001

Adaptive Fusion of Speech and Lip Information for Robust Speaker Identification.
Digit. Signal Process., 2001

Feature warping for robust speaker verification.
Proceedings of the 2001: A Speaker Odyssey, 2001

Robust speaker recognition using microphone arrays.
Proceedings of the 2001: A Speaker Odyssey, 2001

Importance coding of surveillance imagery for interpretability using quadtree dynamic importance maps.
Proceedings of the Sixth International Symposium on Signal Processing and its Applications, 2001

Improving visual noise insensitivity in small vocabulary audio visual speech recognition applications.
Proceedings of the Sixth International Symposium on Signal Processing and its Applications, 2001

Robustness to expression variations in fractal-based face recognition.
Proceedings of the Sixth International Symposium on Signal Processing and its Applications, 2001

An investigation of HMM classifier combination strategies for improved audio-visual speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Application of the trended hidden Markov model to speech synthesis.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Importance coding of still imagery based on importance maps of visually interpretable regions.
Proceedings of the 2001 International Conference on Image Processing, 2001

A suitability metric for mouth tracking through chromatic segmentation.
Proceedings of the 2001 International Conference on Image Processing, 2001

Face recognition using fractal codes.
Proceedings of the 2001 International Conference on Image Processing, 2001

Microphone array sub-band speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001

Trainable speech synthesis with trended hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Frequency offset correction for HF radio speech reception.
IEEE Trans. Ind. Electron., 2000

Improving Speech Recognition Accuracy for Small Vocabulary Applications in Adverse Environments.
Int. J. Speech Technol., 2000

Vector Quantization Based Gaussian Modeling for Speaker Verification.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Initialized Eigenlip Estimator for Fast Lip Tracking Using Linear Regression.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs.
Proceedings of the IEEE International Conference on Acoustics, 2000

Hybrid coding of mixed signals for digital covert audio surveillance.
Proceedings of the IEEE International Conference on Acoustics, 2000

Improving the performance of a small microphone array at low frequencies using critical band and LPC codebooks.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Adaptive Vector Quantization for Speech Spectrum Coding.
Digit. Signal Process., 1999

Fast Search Methods for Spectral Quantization.
Digit. Signal Process., 1999

Enhancing automatic speaker identification using phoneme clustering and frame based parameter and frame size selection.
Proceedings of the ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications, 1999

Digital coding of covert audio for monitoring and storage.
Proceedings of the ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications, 1999

Chromatic lip tracking using a connectivity based fuzzy thresholding technique.
Proceedings of the ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications, 1999

Speech compaction using vector quantisation and hidden Markov models.
Proceedings of the ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications, 1999

Modelling output probability distributions for enhancing speaker recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

The Use of Speech and Lip Modalities for Robust Speaker Verification under Adverse Conditions.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Robust speaker verification via fusion of speech and lip modalities.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Improving speaker identification performance in reverberant conditions using lip information.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environments.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A comparison of fusion techniques in mel-cepstral based speaker identification.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speech enhancement using critical band spectral subtraction.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On the convergence of Gaussian mixture models: improvements through vector quantization.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An approach to statistical lip modelling for speaker identification via chromatic feature extraction.
Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998

A syntactic approach to automatic lip feature extraction for speaker identification.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Two novel lossless algorithms to exploit index redundancy in VQ speech compression.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Multichannel speech separation by eigendecomposition and its application to co-talker interference removal.
IEEE Trans. Speech Audio Process., 1997

Automatic gender identification under adverse conditions.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Robust enhancement of reverberant speech using iterative noise removal.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speech compression with preservation of speaker identity.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Telephone based speaker recognition using multiple binary classifier and Gaussian mixture models.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
A two stage fuzzy decision classifier for speaker identification.
Speech Commun., 1996

Enhancing The Multiple Binary Classifier Model.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

A Comparison of Three Discriminant Models for Automatic Speaker Verification.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Robust Speech Coding for the Preservation of Speaker Identity.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Comparison of Four Distance Measures for Long Time Text-Independent Speaker Identification.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Improving The Effectiveness of Existing Noise Reduction Techniques Using Neural Networks.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Enhancement Methods for Reverberant Speech.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Intelligibility Measurement of Processed Reverberant Speech.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Academic Strategy Planning For A University Research Centre.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Speech Enhancement Iby Simulation Of Cocktail Party Effect With Neural Network Controlled Iterative Filter.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

An Intelligent Microphone Array for Speech Enhancement.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

A New Approach To Teaching Signal Processing At Undergraduate Level.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Speaker recognition in reverberant enclosures.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

A comparison of Gaussian mixture and multiple binary classifier models for speaker verification.
Proceedings of the Australian New Zealand Conference on Intelligent Information Systems, 1996

1995
Speech enhancement by eigen decomposition with two-channel observations.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speech-seeking microphone array with multi-stage processing.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
The design and development of an undergraduate signal processing laboratory.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1987
Implementation of state-space digital filter structures using block floating-point arithmetic.
Proceedings of the IEEE International Conference on Acoustics, 1987


  Loading...