Niluthpol Chowdhury Mithun

Abhinav Rajvanshi

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025

Efficient Domain-Adaptive Multi-Task Dense Prediction with Vision Foundation Models.

[BibT_eX]

[DOI]

Beomseok Kang

Mikhail Sizintsev

CoRR, September, 2025

Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis.

[BibT_eX]

[DOI]

CoRR, April, 2025

Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation.

[BibT_eX]

[DOI]

Tixiao Shan

Abhinav Rajvanshi

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

2024

Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement.

[BibT_eX]

[DOI]

Xingchen Zhao

Abhinav Rajvanshi

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023

Cross-View Visual Geo-Localization for Outdoor Augmented Reality.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2023

C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation.

[BibT_eX]

[DOI]

Nazmul Karim

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

SIGNAV: Semantically-Informed GPS-Denied Navigation and Mapping in Visually-Degraded Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Striking the Right Balance: Recall Loss for Semantic Segmentation.

[BibT_eX]

[DOI]

Junjiao Tian

Zsolt Kira

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

GraphMapper: Efficient Visual Navigation by Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments.

[BibT_eX]

[DOI]

Muhammad Zubair Irshad

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Text-Based Temporal Localization of Novel Events.

[BibT_eX]

[DOI]

Sudipta Paul

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Long-Range Augmented Reality with Dynamic Occlusion Rendering.

[BibT_eX]

[DOI]

Mikhail Sizintsev

IEEE Trans. Vis. Comput. Graph., 2021

Text-Based Localization of Moments in a Video Corpus.

[BibT_eX]

[DOI]

Sudipta Paul

IEEE Trans. Image Process., 2021

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments.

[BibT_eX]

[DOI]

Muhammad Zubair Irshad

CoRR, 2021

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation.

[BibT_eX]

[DOI]

Kowshik Thopalli

CoRR, 2021

MaAST: Map Attention with Semantic Transformers for Efficient Visual Navigation.

[BibT_eX]

[DOI]

Kowshik Thopalli

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

2020

Construction of Diverse Image Datasets From Web Collections With Limited Labeling.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Webly Supervised Image-Text Embedding with Noisy Tag Refinement.

[BibT_eX]

[DOI]

Ravdeep Pasricha

Evangelos E. Papalexakis

Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019

Learning Robust Visual-Semantic Retrieval Models with Limited Supervision

[BibT_eX]

[DOI]

PhD thesis, 2019

Joint embeddings with multimodal cues for video-text retrieval.

[BibT_eX]

[DOI]

Juncheng Li

Florian Metze

Int. J. Multim. Inf. Retr., 2019

Weakly Supervised Video Moment Retrieval From Text Queries.

[BibT_eX]

[DOI]

Sujoy Paul

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Skip Connection Architecture for Localization of Image Manipulations.

[BibT_eX]

[DOI]

Ghazal Mazaheri

Jawadul H. Bappy

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018

Learning Long-Term Invariant Features for Vision-Based Localization.

[BibT_eX]

[DOI]

Cody Simons

Robert Casey

Stefan Hilligardt

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

UCR-VCG @ TRECVID 2018: Video to Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval.

[BibT_eX]

[DOI]

Evangelos E. Papalexakis

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval.

[BibT_eX]

[DOI]

Juncheng Li

Florian Metze

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

ODDS: real-time object detection using depth sensors on embedded GPUs.

[BibT_eX]

[DOI]

Sirajum Munir

Karen Guo

Charles Shelton

Proceedings of the 17th ACM/IEEE International Conference on Information Processing in Sensor Networks, 2018

Deep Learning Based Identity Verification in Renaissance Portraits.

[BibT_eX]

[DOI]

Akash Gupta

Conrad Rudolph

Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

2017

Diversity-Aware Multi-Video Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

CMU-UCR-BOSCH @ TRECVID 2017: VIDEO TO TEXT RETRIEVAL.

[BibT_eX]

[DOI]

Juncheng B. Li

Florian Metze

Samarjit Das

Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

2016

Video-based tracking of vehicles using multiple time-spatial images.

[BibT_eX]

[DOI]

Tamanna Howlader

S. M. Mahbubur Rahman

Expert Syst. Appl., 2016

Generating Diverse Image Datasets with Limited Labeling.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

OSNI: Searching for Needles in a Haystack of Social Network Data.

[BibT_eX]

[DOI]

Dorian Jean Perkins

Moloud Shahbazi

Vassilis J. Tsotras

Proceedings of the 19th International Conference on Extending Database Technology, 2016

2012

Detection and Classification of Vehicles From Video Using Multiple Time-Spatial Images.

[BibT_eX]

[DOI]