Mubarak Shah

Orcid: 0000-0001-6172-5572

Affiliations:
  • University of Central Florida, Orlando, USA


According to our database1, Mubarak Shah authored at least 543 papers between 1984 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Rethinking Data Heterogeneity in Federated Learning: Introducing a New Notion and Standard Benchmarks.
IEEE Trans. Artif. Intell., March, 2024

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

CT-VOS: Cutout prediction and tagging for self-supervised video object segmentation.
Comput. Vis. Image Underst., January, 2024

Deep Learning-based Human Pose Estimation: A Survey.
ACM Comput. Surv., January, 2024

Low-Rank and Sparse Decomposition for Low-Query Decision-Based Adversarial Attacks.
IEEE Trans. Inf. Forensics Secur., 2024

Towards Temporally Consistent Referring Video Object Segmentation.
CoRR, 2024

Composed Video Retrieval via Enriched Context and Discriminative Embeddings.
CoRR, 2024

VidLA: Video-Language Alignment at Scale.
CoRR, 2024

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation.
CoRR, 2024

FSViewFusion: Few-Shots View Generation of Novel Objects.
CoRR, 2024

CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes.
CoRR, 2024

DVANet: Disentangling View and Action Features for Multi-View Action Recognition.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Guest Editorial Introduction to the Special Section on Transformer Models in Vision.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Diffusion Models in Vision: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

SSMTL++: Revisiting self-supervised multi-task learning for video anomaly detection.
Comput. Vis. Image Underst., March, 2023

Dense Video Captioning With Early Linguistic Information Fusion.
IEEE Trans. Multim., 2023

Language Model Agnostic Gray-Box Adversarial Attack on Image Captioning.
IEEE Trans. Inf. Forensics Secur., 2023

MutualNet: Adaptive ConvNet via Mutual Learning From Different Model Configurations.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Self-Supervised Learning for Videos: A Survey.
ACM Comput. Surv., 2023

Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?
CoRR, 2023

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models.
CoRR, 2023

Videoprompter: an ensemble of foundational models for zero-shot video understanding.
CoRR, 2023

Egocentric RGB+Depth Action Recognition in Industry-Like Settings.
CoRR, 2023

Ensemble Modeling for Multimodal Visual Action Recognition.
CoRR, 2023

Reverse Stable Diffusion: What prompt was used to generate this image?
CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.
CoRR, 2023

Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors.
CoRR, 2023

Exploiting the Brain's Network Structure for Automatic Identification of ADHD Subjects.
CoRR, 2023

R<sup>2</sup>Former: Unified Retrieval and Reranking Transformer for Place Recognition.
CoRR, 2023

Video Instance Segmentation in an Open-World.
CoRR, 2023

GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

EventTransAct: A Video Transformer-Based Framework for Event-Camera Based Action Recognition.
IROS, 2023

TransVisDrone: Spatio-Temporal Transformer for Vision-based Drone-to-Drone Detection in Aerial Videos.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

3DMODT: Attention-Guided Affinities for Joint Detection & Tracking in 3D Point Clouds.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Re-calibrating Feature Attributions for Model Interpretation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dual Student Networks for Data-Free Model Stealing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Xi-Net: Transformer based Seismic Waveform Reconstructor.
Proceedings of the IEEE International Conference on Image Processing, 2023

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

When Do Curricula Work in Federated Learning?
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Preserving Modality Structure Improves Multi-Modal Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CDFSL-V: Cross-Domain Few-Shot Learning for Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Action Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

$R^{2}$ Former: Unified Retrieval and Reranking Transformer for Place Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Situation Hyper-Graphs for Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Person Image Synthesis via Denoising Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Class Prototypes based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Efficient Distribution Similarity Identification in Clustered Federated Learning via Principal Angles between Client Data Subspaces.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Transformers in Vision: A Survey.
ACM Comput. Surv., January, 2022

Introduction to the Special Issue on Fine-Grained Visual Recognition and Re-Identification.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Cross-Domain Modality Fusion for Dense Video Captioning.
IEEE Trans. Artif. Intell., 2022

A Background-Agnostic Framework With Adversarial Training for Abnormal Event Detection in Video.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

TCLR: Temporal contrastive learning for video representation.
Comput. Vis. Image Underst., 2022

Lightning Fast Video Anomaly Detection via Adversarial Knowledge Distillation.
CoRR, 2022

Query Efficient Cross-Dataset Transferable Black-Box Attack on Action Recognition.
CoRR, 2022

Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present and Future.
CoRR, 2022

On Higher Adversarial Susceptibility of Contrastive Self-Supervised Learning.
CoRR, 2022

Learning with Capsules: A Survey.
CoRR, 2022

EBM Life Cycle: MCMC Strategies for Synthesis, Defense, and Density Modeling.
CoRR, 2022

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging.
CoRR, 2022

Video Action Detection: Analysing Limitations and Challenges.
CoRR, 2022

Transferable 3D Adversarial Textures using End-to-end Optimization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

GabriellaV2: Towards better generalization in surveillance videos for Action Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Self Supervised Learning for Multiple Object Tracking in 3D Point Clouds.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Video Generation from Text Employing Latent Path Construction for Temporal Modeling.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Detecting Compromised Architecture/Weights of a Deep Model.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Self-Joint Supervised Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

DeepSAR: Vessel Detection in SAR Imagery with Noisy Labels.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

GAMa: Cross-View Video Geo-Localization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Realistic Semi-supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Weakly Supervised Grounding for VQA in Vision-Language Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OW-DETR: Open-world Detection Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SPAct: Self-supervised Privacy Preservation for Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PSTR: End-to-End One-Step Person Search With Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Adversarial Learning for Personalized Tag Recommendation.
IEEE Trans. Multim., 2021

Odyssey: Creation, Analysis and Detection of Trojan Models.
IEEE Trans. Inf. Forensics Secur., 2021

Exploiting structured high-level knowledge for domain-specific visual classification.
Pattern Recognit., 2021

Norm-Preservation: Why Residual Networks Can Become Extremely Deep?
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Deep Affinity Network for Multiple Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

On Symbiosis of Attribute Prediction and Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Human action recognition in drone videos using a few aerial training examples.
Comput. Vis. Image Underst., 2021

Geometric Feature Learning for 3D Meshes.
CoRR, 2021

Routing with Self-Attention for Multimodal Capsule Networks.
CoRR, 2021

"Knights": First Place Submission for VIPriors21 Action Recognition Challenge at ICCV 2021.
CoRR, 2021

Threat of Adversarial Attacks on Deep Learning in Computer Vision: Survey II.
CoRR, 2021

TinyAction Challenge: Recognizing Real-world Low-resolution Activities in Videos.
CoRR, 2021

Controlled Caption Generation for Images Through Adversarial Attacks.
CoRR, 2021

Florida Wildlife Camera Trap Dataset.
CoRR, 2021

PC-DAN: Point Cloud based Deep Affinity Network for 3D Multi-Object Tracking (Accepted as an extended abstract in JRDB-ACT Workshop at CVPR21).
CoRR, 2021

LSDAT: Low-Rank and Sparse Decomposition for Decision-based Adversarial Attack.
CoRR, 2021

Cassandra: Detecting Trojaned Networks From Adversarial Perturbations.
IEEE Access, 2021

Advances in Adversarial Attacks and Defenses in Computer Vision: A Survey.
IEEE Access, 2021

Reformulating Zero-shot Action Recognition for Multi-label Actions.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unsupervised Discriminative Embedding For Sub-Action Learning in Complex Activities.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Novel View Video Prediction using a Dual Representation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Face Image Retrieval with Attribute Manipulation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Video Geo-Localization Employing Geo-Temporal Feature Learning and GPS Trajectory Smoothing.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Discriminative Region-based Multi-Label Zero-Shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Handwriting Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Out-of-Distribution Detection Using Union of 1-Dimensional Subspaces.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Modeling Multi-Label Action Dependencies for Temporal Action Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Anomaly Detection in Video via Self-Supervised and Multi-Task Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PLM: Partial Label Masking for Imbalanced Multi-Label Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Dogfight: Detecting Drones From Drones Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Meta-learning the Learning Trends Shared Across Tasks.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Self-supervised Knowledge Distillation for Few-shot Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Photography and Exploration of Tourist Locations Based on Optimal Foraging Theory.
IEEE Trans. Circuits Syst. Video Technol., 2020

Training Faster by Separating Modes of Variation in Batch-Normalized Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos.
Int. J. Comput. Vis., 2020

Video Description: A Survey of Methods, Datasets, and Evaluation Metrics.
ACM Comput. Surv., 2020

Deep Learning-Based Human Pose Estimation: A Survey.
CoRR, 2020

Correct block-design experiments mitigate temporal correlation bias in EEG classification.
CoRR, 2020

A Scene-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video.
CoRR, 2020

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events.
CoRR, 2020

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Surveillance Videos.
CoRR, 2020

UCF-System: Activity Detection in Untrimmed Videos.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Text Synopsis Generation for Egocentric Videos.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

RescueNet: Joint Building Segmentation and Damage Assessment from Satellite Imagery.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

TinyVIRAT: Low-resolution Video Action Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Deep Photo Cropper And Enhancer.
Proceedings of the IEEE International Conference on Image Processing, 2020

MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Count- and Similarity-Aware R-CNN for Pedestrian Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Multi-view Action Recognition Using Cross-View Video Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020





iTAML: An Incremental Task-Agnostic Meta-learning Approach.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Visual-Textual Capsule Routing for Text-Based Video Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Select to Better Learn: Fast and Accurate Deep Learning Using Data Selection From Nonlinear Manifolds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Uncertainty Estimation and Sample Selection for Crowd Counting.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

SubSpace Capsule Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Large-Scale Image Geo-Localization Using Dominant Sets.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

On Detection, Data Association and Segmentation for Multi-Target Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Online Localization and Prediction of Actions and Interactions.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Multi-target Tracking in Multiple Non-overlapping Cameras Using Fast-Constrained Dominant Sets.
Int. J. Comput. Vis., 2019

Holistic object detection and image understanding.
Comput. Vis. Image Underst., 2019

Crowd Transformer Network.
CoRR, 2019

UCF's 30-year REU site in computer vision.
Commun. ACM, 2019

An Online System for Real-Time Activity Detection in Untrimmed Surveillance Videos.
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Unsupervised Meta-Learning for Few-Shot Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Bridging the Domain Gap for Ground-to-Aerial Image Matching.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Deep Constrained Dominant Sets for Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Iterative Projection and Matching: Finding Structure-Preserving Representatives and Its Application to Computer Vision.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Pay Attention! - Robustifying a Deep Visuomotor Policy Through Task-Focused Visual Attention.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

An Efficient 3D CNN for Action/Object Segmentation in Video.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Learning a Multi-Concept Video Retrieval Model with Multiple Latent Variables.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Learning a Deep Model for Human Action Recognition from Novel Viewpoints.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Binary Quadratic Programing for Online Tracking of Hundreds of People in Extremely Crowded Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

EgoReID: Person re-identification in Egocentric Videos Acquired by Mobile Devices with First-Person Point-of-View.
CoRR, 2018

Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries.
CoRR, 2018

Unsupervised Meta-Learning For Few-Shot Image and Video Classification.
CoRR, 2018

Time-Aware and View-Aware Video Rendering for Unsupervised Representation Learning.
CoRR, 2018

Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features.
CoRR, 2018

Pay attention! - Robustifying a Deep Visuomotor Policy through Task-Focused Attention.
CoRR, 2018

Enhancing camera surveillance using computer vision: a research note.
CoRR, 2018

Task-Agnostic Meta-Learning for Few-shot Learning.
CoRR, 2018

VOS-GAN: Adversarial Learning of Visual-Temporal Dynamics for Unsupervised Dense Prediction in Videos.
CoRR, 2018

Action and Object Detection for TRECVID.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

VideoCapsuleNet: A Simplified Network for Action Detection.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Generating Synthetic Video Sequences by Explicitly Modeling Object Motion.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual Text Correction.
Proceedings of the Computer Vision - ECCV 2018, 2018

Composition Loss for Counting, Density Map Estimation and Localization in Dense Crowds.
Proceedings of the Computer Vision - ECCV 2018, 2018

Real-World Anomaly Detection in Surveillance Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human Semantic Parsing for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Improved scene identification and object detection on egocentric vision of daily activities.
Comput. Vis. Image Underst., 2017

Unsupervised action proposal ranking through proposal recombination.
Comput. Vis. Image Underst., 2017

Automatic action annotation in weakly labeled videos.
Comput. Vis. Image Underst., 2017

The THUMOS challenge on action recognition for videos "in the wild".
Comput. Vis. Image Underst., 2017

An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos.
CoRR, 2017

Multi-Target Tracking in Multiple Non-Overlapping Cameras using Constrained Dominant Sets.
CoRR, 2017

Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network.
CoRR, 2017

Fully Convolutional Deep Neural Networks for Persistent Multi-Frame Multi-Object Detection in Wide Area Aerial Videos.
CoRR, 2017

<i>Brain2Image</i>: Converting Brain Signals into Images.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Semi Supervised Semantic Segmentation Using Generative Adversarial Network.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Unsupervised Action Discovery and Localization in Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Generative Adversarial Networks Conditioned by Brain Signals.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Cross-View Image Matching for Geo-Localization in Urban Environments.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Learning Human Mind for Automated Visual Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Improving Facial Attribute Prediction Using Semantic Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Real-Time Temporal Action Localization in Untrimmed Videos by Sub-Action Discovery.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Introduction to Large-Scale Visual Geo-localization.
Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016

Visual Saliency Detection Using Group Lasso Regularization in Videos of Natural Scenes.
Int. J. Comput. Vis., 2016

A Framework for Human Pose Estimation in Videos.
CoRR, 2016

On Duality Of Multiple Target Tracking and Segmentation.
CoRR, 2016

Scene Labeling Through Knowledge-Based Rules Employing Constrained Integer Linear Programing.
CoRR, 2016

Video Fill in the Blank with Merging LSTMs.
CoRR, 2016

Autonomous navigation for low-altitude UAVs in urban areas.
CoRR, 2016

Covariance of Motion and Appearance Featuresfor Spatio Temporal Recognition Tasks.
CoRR, 2016

Re-identification of Humans in Crowds using Personal, Social and Environmental Constraints.
CoRR, 2016

Query-Focused Extractive Video Summarization.
Proceedings of the Computer Vision - ECCV 2016, 2016

Human Re-identification in Crowd Videos Using Personal, Social and Environmental Constraints.
Proceedings of the Computer Vision - ECCV 2016, 2016

Fast Zero-Shot Image Tagging.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

What If We Do Not have Multiple Videos of the Same Action? - Video Action Localization Using Web Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Scene Labeling Using Sparse Precision Matrix.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Predicting the Where and What of Actors and Actions through Online Action Localization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Detecting Humans in Dense Crowds Using Locally-Consistent Scale Prior and Global Occlusion Reasoning.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Complex event recognition using constrained low-rank representation.
Image Vis. Comput., 2015

Understanding Trajectory Behavior: A Motion Pattern Approach.
CoRR, 2015

UCF-CRCV at TRECVID 2015: Semantic Indexing.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Semantic Image Search From Multiple Query Images.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

How to Take a Good Selfie?
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Tracking When the Camera Looks Away.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Human Pose Estimation in Videos.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Action Localization in Videos through Context Walk.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Target Identity-aware Network Flow for online multiple target tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

GMMCP tracker: Globally optimal Generalized Maximum Multi Clique problem for multiple object tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Geo-semantic segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015


2014
Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition.
IEEE Trans. Multim., 2014

Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Visual Tracking: An Experimental Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Tracking in dense crowds using prominence and neighborhood motion concurrence.
Image Vis. Comput., 2014

UCF-CRCV at TRECVID 2014: Semantic Indexing.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

SRI-Sarnoff AURORA System at TRECVID 2014 Multimedia Event Detection and Recounting.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Learning discriminative features and metrics for measuring action similarity.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Complex event recognition by latent temporal models of concepts.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Video Object Co-segmentation by Regulated Maximum Weight Cliques.
Proceedings of the Computer Vision - ECCV 2014, 2014

DaMN - Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition.
Proceedings of the Computer Vision - ECCV 2014, 2014

GIS-Assisted Object Detection and Geospatial Localization.
Proceedings of the Computer Vision - ECCV 2014, 2014

GPS-Tag Refinement Using Random Walks with an Adaptive Damping Factor.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

NMF-KNN: Image Annotation Using Weighted Multi-view Non-negative Matrix Factorization.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Who Do I Look Like? Determining Parent-Offspring Resemblance via Gated Autoencoders.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Improving Semantic Concept Detection through the Dictionary of Visually-Distinct Elements.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Recognition of Complex Events: Exploiting Temporal Dynamics between Underlying Concepts.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Video Classification Using Semantic Concept Co-occurrences.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Multimodal Analysis for Identification and Segmentation of Moving-Sounding Objects.
IEEE Trans. Multim., 2013

Discovering Motion Primitives for Unsupervised Grouping and One-Shot Learning of Human Actions, Gestures, and Expressions.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Simultaneous Video Stabilization and Moving Object Detection in Turbulence.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Classifying web videos using a global video descriptor.
Mach. Vis. Appl., 2013

Recognizing 50 human action categories of web videos.
Mach. Vis. Appl., 2013

Multi-agent event recognition by preservation of spatiotemporal relationships between probabilistic models.
Image Vis. Comput., 2013

High-level event recognition in unconstrained videos.
Int. J. Multim. Inf. Retr., 2013

Multiframe Many-Many Point Correspondence for Vehicle Tracking in High Density Wide Area Aerial Videos.
Int. J. Comput. Vis., 2013

Shadow Casting Out Of Plane (SCOOP) Candidates for Human and Vehicle Detection in Aerial Imagery.
Int. J. Comput. Vis., 2013

Face Verification Using Boosted Cross-Image Features.
CoRR, 2013

BBN VISER TRECVID 2013 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

SRI-Sarnoff AURORA System at TRECVID 2013 Multimedia Event Detection and Recounting.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

UCF-CRCV at TRECVID 2013: Semantic Indexing.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Visual business recognition: a multimodal approach.
Proceedings of the ACM Multimedia Conference, 2013

Towards a comprehensive computational model foraesthetic assessment of videos.
Proceedings of the ACM Multimedia Conference, 2013

Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Spatiotemporal Deformable Part Models for Action Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Improving an Object Detector and Extracting Regions Using Superpixels.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Face Recognition in Movie Trailers via Mean Sequence Sparse Representation-Based Classification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multi-source Multi-scale Counting in Extremely Dense Crowd Images.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Crowd Flow Segmentation Using Lagrangian Particle Dynamics.
Proceedings of the Modeling, Simulation and Visual Analysis of Crowds, 2013

Modeling, Simulation and Visual Analysis of Crowds: A Multidisciplinary Perspective.
Proceedings of the Modeling, Simulation and Visual Analysis of Crowds, 2013

2012
Identifying Behaviors in Crowd Scenes Using Stability Analysis for Dynamical Systems.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Learning semantic features for action recognition via diffusion maps.
Comput. Vis. Image Underst., 2012

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
CoRR, 2012

BBNVISER : BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Macro-class Selection for Hierarchical k-NN Classification of Inertial Sensor Data.
Proceedings of the PECCS 2012, 2012

ADHD classification using bag of words approach on network features.
Proceedings of the Medical Imaging 2012: Image Processing, 2012

Confidence guided enhancing brain tumor segmentation in multi-parametric MRI.
Proceedings of the 9th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2012

GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs.
Proceedings of the Computer Vision - ECCV 2012, 2012

Complex Events Detection Using Data-Driven Concepts.
Proceedings of the Computer Vision - ECCV 2012, 2012

(MP)2T: Multiple People Multiple Parts Tracker.
Proceedings of the Computer Vision - ECCV 2012, 2012

Recognizing Complex Events Using Large Margin Joint Low-Level Event Model.
Proceedings of the Computer Vision - ECCV 2012, 2012

Statistical Inference of Motion in the Invisible.
Proceedings of the Computer Vision - ECCV 2012, 2012

Detection of Independently Moving Objects in Non-planar Scenes via Multi-Frame Monocular Epipolar Constraint.
Proceedings of the Computer Vision - ECCV 2012, 2012

City scale geo-spatial trajectory estimation of a moving camera.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Part-based multiple-person tracking with partial occlusion handling.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
A holistic approach to aesthetic enhancement of photographs.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Visual crowd surveillance through a hydrodynamics lens.
Commun. ACM, 2011



Horizon constraint for unambiguous UAV navigation in planar scenes.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Street View Challenge: Identification of Commercial Entities in Street View Imagery.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Keynote Abstracts.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Action recognition in videos acquired by a moving camera using motion decomposition of Lagrangian particle trajectories.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Similarity invariant classification of events by KL divergence minimization.
Proceedings of the IEEE International Conference on Computer Vision, 2011

A two-stage reconstruction approach for seeing through water.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011


Cross-view action recognition via view knowledge transfer.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

A probabilistic representation for efficient large scale visual recognition tasks.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Networked UAVs as aerial sensor network for disaster management applications.
Elektrotech. Informationstechnik, 2010

Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Visual crowd surveillance is like hydrodynamics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A framework for photo-quality assessment and enhancement based on visual aesthetics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Segmentation of the Infarct and Peri-infarct Zones in Cardiac MR Images.
Proceedings of the Medical Imaging and Augmented Reality - 5th International Workshop, 2010

Accurate Image Localization Based on Google Maps Street View.
Proceedings of the Computer Vision, 2010

Geometric Constraints for Human Detection in Aerial Imagery.
Proceedings of the Computer Vision - ECCV 2010, 2010

Detection and Tracking of Large Number of Targets in Wide Area Surveillance.
Proceedings of the Computer Vision, 2010

A Streakline Representation of Flow in Crowded Scenes.
Proceedings of the Computer Vision, 2010

Chaotic invariants of Lagrangian particle trajectories for anomaly detection in crowded scenes.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Scene understanding by statistical modeling of motion patterns.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Human identity recognition in aerial images.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Modeling Interaction for Segmentation of Neighboring Structures.
IEEE Trans. Inf. Technol. Biomed., 2009

Automatic Tracking of <i>Escherichia Coli</i> in Phase-Contrast Microscopy Video.
IEEE Trans. Biomed. Eng., 2009

Probabilistic Modeling of Scene Dynamics for Applications in Visual Surveillance.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Tracking Multiple Occluding People by Localizing on Multiple Scene Planes.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Guest Editors' Introduction to the Special Section on Award Winning Papers from the IEEE CS Conference on Computer Vision and Pattern Recognition (CVPR).
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Matching Trajectories of Anatomical Landmarks Under Viewpoint, Anthropometric and Temporal Transforms.
Int. J. Comput. Vis., 2009

UCF @ TRECVID 2009 : High-Level Feature Extraction.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Event recognition from photo collections via PageRank.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Video Scene Understanding Using Multi-scale Analysis.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Incremental action recognition using feature-tree.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Time series prediction by chaotic modeling of nonlinear dynamical systems.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Action recognition in unconstrained amateur videos.
Proceedings of the IEEE International Conference on Acoustics, 2009

Abnormal crowd behavior detection using social force model.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Learning semantic visual vocabularies using diffusion distance.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Recognizing realistic actions from videos .
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Object Association Across Multiple Cameras.
Proceedings of the Multi-Camera Networks, 2009

2008
Automatic Segmentation of High-Throughput RNAi Fluorescent Cellular Images.
IEEE Trans. Inf. Technol. Biomed., 2008

Image Diffusion Using Saliency Bilateral Filter.
IEEE Trans. Inf. Technol. Biomed., 2008

Shape matching and modeling using skeletal context.
Pattern Recognit., 2008

Trajectory Association across Multiple Airborne Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Real-time edge-enhanced dynamic correlation and predictive open-loop car-following control for robust tracking.
Mach. Vis. Appl., 2008

MinGPU: a minimum GPU library for computer vision.
J. Real Time Image Process., 2008

A differential geometric approach to representing the human actions.
Comput. Vis. Image Underst., 2008

Modeling inter-camera space-time and appearance relationships for tracking across non-overlapping views.
Comput. Vis. Image Underst., 2008

Content based video matching using spatiotemporal volumes.
Comput. Vis. Image Underst., 2008

University of Central Florida at TRECVID 2008 Content Based Copy Detection and Surveillance Event Detection.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Visual surveillance in maritime port facilities.
Proceedings of the Visual Information Processing XVII, 2008

Automatic Tracking of Escherichia Coli Bacteria.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2008

Landing a UAV on a runway using image registration.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Unfolding warping for object recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Automatic video annotation with adaptive number of key words.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Learning motion patterns in crowded scenes using motion flow field.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Detecting global motion patterns in complex videos.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Geometric constraints on 2D action models for tracking human body.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Action recognition using spatio-temporal regularity based features.
Proceedings of the IEEE International Conference on Acoustics, 2008

Floor Fields for Tracking in High Density Crowd Scenes.
Proceedings of the Computer Vision, 2008

Learning 4D action feature models for arbitrary view action recognition.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Learning human actions via information maximization.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Recognizing human actions using multiple features.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Reconstructing non-stationary articulated objects in monocular video using silhouette information.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Learning object motion patterns for anomaly detection and improved object detection.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Utilizing semantic word similarity measures for video retrieval.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Collaborative microdrones: applications and research challenges.
Proceedings of the 2nd International Conference on Autonomic Computing and Communication Systems, 2008

Automated Multi-Camera Surveillance: Algorithms and Practice.
The International Series in Video Computing 10, Springer, ISBN: 978-0-387-78881-4, 2008

2007
Spatio-Temporal Regularity Flow (SPREF): Its Estimation and Applications.
IEEE Trans. Circuits Syst. Video Technol., 2007

Resolving hand over face occlusion.
Image Vis. Comput., 2007

Automated Visual Surveillance in Realistic Scenarios.
IEEE Multim., 2007

Learning, detection and representation of multi-agent events in videos.
Artif. Intell., 2007

University of Central Florida at TRECVID 2007 Semantic Video Classification and Automatic Search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

A 3-dimensional sift descriptor and its application to action recognition.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Detecting and segmenting humans in crowded scenes.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Automatically Tuning Background Subtraction Parameters using Particle Swarm Optimization.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Foreground Segmentation in Surveillance Scenes Containing a Door.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Improving Semantic Concept Detection and Retrieval using Contextual Estimates.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

3D Model based Object Class Detection in An Arbitrary View.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Scene Modeling Using Co-Clustering.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

A Homographic Framework for the Fusion of Multi-view Silhouettes.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Chaotic Invariants for Human Action Recognition.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Trajectory Association across Non-overlapping Moving Cameras in Planar Scenes.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

On the Direct Estimation of the Fundamental Matrix.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

On the Spacetime Geometry of Galilean Cameras.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Motion and Appearance Contexts for Tracking and Re-Acquiring Targets in Aerial Videos.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Person and Vehicle Tracking in Surveillance Video.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Person Tracking in UAV Video.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

A Vision-Based System for a UGV to Handle a Road Intersection.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Video Scene Segmentation Using Markov Chain Monte Carlo.
IEEE Trans. Multim., 2006

Determining scale and sea state from water video.
IEEE Trans. Image Process., 2006

Video Compression Using Spatiotemporal Regularity Flow.
IEEE Trans. Image Process., 2006

Computer Vision for Nanoscale Imaging.
Mach. Vis. Appl., 2006

Matching actions in presence of camera motion.
Comput. Vis. Image Underst., 2006

Integrating multiple levels of zoom to enable activity analysis.
Comput. Vis. Image Underst., 2006

Self-calibration from turn-table sequences in presence of zoom and focus.
Comput. Vis. Image Underst., 2006

Object tracking: A survey.
ACM Comput. Surv., 2006

University of Central Florida at TRECVID 2006 High-Level Feature Extraction and Video Search.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Visual attention detection in video sequences using spatiotemporal cues.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Model generation for video-based object recognition.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Image Diffusion Using Saliency Bilateral Filter.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2006

Estimating Geospatial Trajectory of a Moving Camera.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Automatic Query Expansion for News Video Retrieval.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Where was the Picture Taken: Image Localization in Route Panoramas Using Epipolar Geometry.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Tracking of Human Body Joints using Anthropometry.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint.
Proceedings of the Computer Vision, 2006

Segmentation of Neighboring Structures by Modeling Their Interaction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Shape from Dynamic Texture for Planes.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Multiple Vehicle Tracking in Surveillance Videos.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

2005
Single view compositing with shadows.
Vis. Comput., 2005

Detection and representation of scenes in videos.
IEEE Trans. Multim., 2005

On the use of computable features for film classification.
IEEE Trans. Circuits Syst. Video Technol., 2005

Motion Layer Extraction in the Presence of Occlusion Using Graph Cuts.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Bayesian Modeling of Dynamic Scenes for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

A Noniterative Greedy Algorithm for Multiframe Point Correspondence.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Layer-based video registration.
Mach. Vis. Appl., 2005

Motion Layer Based Object Removal in Videos.
Proceedings of the 7th IEEE Workshop on Applications of Computer Vision / IEEE Workshop on Motion and Video Computing (WACV/MOTION 2005), 2005

Adaptive Region-Based Video Registration.
Proceedings of the 7th IEEE Workshop on Applications of Computer Vision / IEEE Workshop on Motion and Video Computing (WACV/MOTION 2005), 2005

Creating Realistic Shadows of Composited Objects.
Proceedings of the 7th IEEE Workshop on Applications of Computer Vision / IEEE Workshop on Motion and Video Computing (WACV/MOTION 2005), 2005

Video Understanding and Content-Based Retrieval.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

A Framework for Intelligent Sensor Network with Video Camera for Structural Health Monitoring of Bridges.
Proceedings of the 3rd IEEE Conference on Pervasive Computing and Communications Workshops (PerCom 2005 Workshops), 2005

Determining structure in continuously recorded videos.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Tracking news stories across different sources.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Recognizing human actions.
Proceedings of the Third ACM International Workshop on Video Surveillance & Sensor Networks, 2005

Detecting group activities using rigidity of formation.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

An object-based video coding framework for video sequences obtained from static cameras.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Segmentation of Neighboring Organs in Medical Image with Model Competition.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2005

Automatic Segmentation of Home Videos.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

An integrated approach for generic object detection using kernel PCA and boosting.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Video compression using structural flow.
Proceedings of the 2005 International Conference on Image Processing, 2005

A Multi-level Framework for Video Shot Structuring.
Proceedings of the Image Analysis and Recognition, Second International Conference, 2005

A General Framework for Temporal Video Scene Segmentation.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Recognizing Human Actions in Videos Acquired by Uncalibrated Moving Cameras.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

TemporalBoost for Event Recognition.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Exploring the Space of a Human Action.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Object Tracking across Multiple Independently Moving Aerial Cameras.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

A Supervised Learning Framework for Generic Object Detection in Images.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Actions Sketch: A Novel Action Representation.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Accurate Motion Layer Segmentation and Matting.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Bayesian Object Detection in Dynamic Scenes.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Appearance Modeling for Tracking in Multiple Non-Overlapping Cameras.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Camera Calibration and Light Source Estimation from Images with Shadows.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Automatic Detection of Heads in Colored Images.
Proceedings of the Second Canadian Conference on Computer and Robot Vision (CRV 2005), 2005

A Computer Vision System for Monitoring Medication Intake.
Proceedings of the Second Canadian Conference on Computer and Robot Vision (CRV 2005), 2005

Story Segmentation in News Videos Using Visual and Text Cues.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

Multiple Agent Event Detection and Representation in Videos.
Proceedings of the Proceedings, 2005

2004
Contour-Based Object Tracking with Occlusion Handling in Video Acquired Using Mobile Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Editorial.
Mach. Vis. Appl., 2004

Tri-view morphing.
Comput. Vis. Image Underst., 2004

University of Central Florida at TRECVID 2004.
Proceedings of the 2004 TREC Video Retrieval Evaluation, 2004

Conversation Detection in Feature Films Using Finite State Machines.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Multi Feature Path Modeling for Video Surveillance.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Ontology and Taxonomy Collaborated Framework for Meeting Classification.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

On the use of Anthropometry in the Invariant Analysis of Human Actions.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Compressed Spatio-temporal Descriptors for Video Matching and Retrieval.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Water video analysis.
Proceedings of the 2004 International Conference on Image Processing, 2004

Estimation of the radiometric response functions of a color camera from differently illuminated images.
Proceedings of the 2004 International Conference on Image Processing, 2004

A probabilistic framework for object recognition in video.
Proceedings of the 2004 International Conference on Image Processing, 2004

Region Completion in a Single Image.
Proceedings of the 25th Annual Conference of the European Association for Computer Graphics, 2004

Motion Layer Extraction in the Presence of Occlusion Using Graph Cut.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Integrating and Employing Multiple Levels of Zoom for Activity Recognition.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

A Framework for Semantic Classification of Scenes Using Finite State Machines.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

CASEE: A Hierarchical Event Representation for the Analysis of Videos.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003
Determining driver visual attention with one camera.
IEEE Trans. Intell. Transp. Syst., 2003

Review of computer vision education.
IEEE Trans. Educ., 2003

Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Understanding human behavior from motion imagery.
Mach. Vis. Appl., 2003

Target tracking in airborne forward looking infrared imagery.
Image Vis. Comput., 2003

From Images to Video: View Morphing of Three Images.
Proceedings of the 8th International Fall Workshop on Vision, Modeling, and Visualization, 2003

University of Central Florida at TRECVID 2003.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003

Invariance in motion analysis of videos.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Automatic Recognition of a Baby Gesture.
Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003), 2003

KNIGHT™: a real time surveillance system for multiple and non-overlapping cameras.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Two-Frame Wide Baseline Matching.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

A Non-Iterative Greedy Algorithm for Multi-frame Point Correspondence.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

View-invariant Alignment and Matching of Video Sequences.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Tracking Across Multiple Cameras With Disjoint Views.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Scene Detection In Hollywood Movies and TV Shows.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Person-on-person Violence Detection in Moving and Stationary Camera Videos.
Proceedings of the Sixth IASTED International Conference on Computer Graphics and Imaging, 2003

2002
Flame recognition in video.
Pattern Recognit. Lett., 2002

Guest Introduction: The Changing Shape of Computer Vision in the Twenty-First Century.
Int. J. Comput. Vis., 2002

View-Invariant Representation and Recognition of Actions.
Int. J. Comput. Vis., 2002

Estimation of Rigid and Non-Rigid Facial Motion Using Anatomical Face Model.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Movie Genre Classification By Exploiting Audio-Visual Features Of Previews.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Automatic Visual Recognition of Armed Robbery.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Person-on-Person Violence Detection in Video Data.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

View Interpolation for Dynamic Scenes.
Proceedings of the 23rd Annual Conference of the European Association for Computer Graphics, 2002

Tracking and Object Classification for Automated Surveillance.
Proceedings of the Computer Vision, 2002

Estimation of Arbitrary Albedo and Shape from Shading for Symmetric Objects.
Proceedings of the British Machine Vision Conference 2002, 2002

2001
Mentoring undergraduates in computer vision research.
IEEE Trans. Educ., 2001

Monitoring human behavior from video taken in an office environment.
Image Vis. Comput., 2001

A Computer Vision Framework for Analyzing Overhead and Computer Projections from Video of Lectures.
Int. J. Comput. Their Appl., 2001

Human Tracking in Multiple Cameras.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

A Framework for Segmentation of Talk and Game Shows.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

View-Invariant Representation and Learning of Human Action.
Proceedings of the IEEE Workshop on Detection and Recognition of Events in Video, 2001

View-Invariance in Action Recognition.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Object Based Segmentation of Video Using Color, Motion and Spatial Information.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Human identification through body measurements.
Proceedings of the ISCA 16th International Conference Computers and Their Applications, 2001

2000
Monitoring Head/Eye Motion for Driver Alertness with One Camera.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Camera Handoff: Tracking in Multiple Uncalibrated Stationary Cameras.
Proceedings of the Workshop on Human Motion, 2000

A Virtual 3D Blackboard: 3D Finger Tracking Using a Single Camera.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

Autonomous Video Registration Using Sensor Model Parameter Adjustments.
Proceedings of the 29th Applied Image Pattern Recognition Workshop (AIPR 2000), 2000

Computer Vision Framework for Analyzing Projections from Video of Lectures.
Proceedings of the ISCA 9th International Conference on Intelligent Systems, 2000

1999
Shape from intensity gradient.
IEEE Trans. Syst. Man Cybern. Part A, 1999

Learning affine transformations.
Pattern Recognit., 1999

Shape from Shading: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 1999

Toward 3-D Gesture Recognition.
Int. J. Pattern Recognit. Artif. Intell., 1999

1998
From Shape from Shading to Object Recognition.
Int. J. Pattern Recognit. Artif. Intell., 1998

Indexing Based on Algebraic Functions of Views.
Comput. Vis. Image Underst., 1998

Recognizing human actions in a static room.
Proceedings of the Proceedings Fourth IEEE Workshop on Applications of Computer Vision, 1998

Using Algebraic Functions of Views for Indexing-Based Object Recognition.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

1997
Recovering 3D Motion of Multiple Objects Using Adaptive Hough Transform.
IEEE Trans. Pattern Anal. Mach. Intell., 1997

Iterative shape recovery from multiple images.
Image Vis. Comput., 1997

1996
Motion estimation and segmentation.
Mach. Vis. Appl., 1996

Photomotion.
Comput. Vis. Image Underst., 1996

Learning affine transformations of the plane for model-based object recognition.
Proceedings of the 13th International Conference on Pattern Recognition, 1996

1995
Integration of shape from shading and stereo.
Pattern Recognit., 1995

Motion-based recognition a survey.
Image Vis. Comput., 1995

1994
Cyclic motion detection for motion based recognition.
Pattern Recognit., 1994

Shape from shading using linear approximation.
Image Vis. Comput., 1994

Estimating 3D motion and shape of multiple objects using Hough transform.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

Motion Segmentation and Estimation.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

Recognizing Hand Gestures.
Proceedings of the Computer Vision, 1994

Analysis of shape from shading techniques.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1994

Height recovery from intensity gradient.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1994

A survey of motion analysis from moving light displays.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1994

1993
Motion trajectories.
IEEE Trans. Syst. Man Cybern., 1993

Matching motion trajectories using scale-space.
Pattern Recognit., 1993

Edge Characterization Using Normalized Edge Detector.
CVGIP Graph. Model. Image Process., 1993

Shape from photomotion.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1993

Integration of shape from X modules: combining stereo and shading.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1993

1992
Interpretation of Motion Trajectories using Focus of Expansion.
IEEE Trans. Pattern Anal. Mach. Intell., 1992

A Fast algorithm for active contours and curvature estimation.
CVGIP Image Underst., 1992

Generation and segmentation of motion trajectories.
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992

Recognition using motion and shape.
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992

A fast linear shape from shading.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1992

1991
Establishing motion correspondence.
CVGIP Image Underst., 1991

1990
Edge contours using multiple scales.
Comput. Vis. Graph. Image Process., 1990

Multi-sensor fusion: a perspective.
Proceedings of the 1990 IEEE International Conference on Robotics and Automation, 1990

Normalized edge detector.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

A fast algorithm for active contours.
Proceedings of the Third International Conference on Computer Vision, 1990

1989
Optimal corner detector.
Comput. Vis. Graph. Image Process., 1989

The trajectory primal sketch: a multi-scale scheme for representing motion characteristics.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1989

1988
A VLSI architecture for computing scale space.
Comput. Vis. Graph. Image Process., 1988

1986
Pulse and staircase edge models.
Comput. Vis. Graph. Image Process., 1986

1984
Detecting time-varying corners.
Comput. Vis. Graph. Image Process., 1984


  Loading...