Niluthpol Chowdhury Mithun

Orcid: 0000-0003-3611-4141

According to our database1, Niluthpol Chowdhury Mithun authored at least 31 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Cross-View Visual Geo-Localization for Outdoor Augmented Reality.
Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2023

C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
SIGNAV: Semantically-Informed GPS-Denied Navigation and Mapping in Visually-Degraded Environments.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Striking the Right Balance: Recall Loss for Semantic Segmentation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

GraphMapper: Efficient Visual Navigation by Scene Graph Generation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Text-Based Temporal Localization of Novel Events.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Long-Range Augmented Reality with Dynamic Occlusion Rendering.
IEEE Trans. Vis. Comput. Graph., 2021

Text-Based Localization of Moments in a Video Corpus.
IEEE Trans. Image Process., 2021

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments.
CoRR, 2021

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation.
CoRR, 2021

MaAST: Map Attention with Semantic Transformers for Efficient Visual Navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

2020
Construction of Diverse Image Datasets From Web Collections With Limited Labeling.
IEEE Trans. Circuits Syst. Video Technol., 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Webly Supervised Image-Text Embedding with Noisy Tag Refinement.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Joint embeddings with multimodal cues for video-text retrieval.
Int. J. Multim. Inf. Retr., 2019

Weakly Supervised Video Moment Retrieval From Text Queries.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Skip Connection Architecture for Localization of Image Manipulations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Learning Long-Term Invariant Features for Vision-Based Localization.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

UCR-VCG @ TRECVID 2018: Video to Text Retrieval.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval.
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

ODDS: real-time object detection using depth sensors on embedded GPUs.
Proceedings of the 17th ACM/IEEE International Conference on Information Processing in Sensor Networks, 2018

Deep Learning Based Identity Verification in Renaissance Portraits.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

2017
Diversity-Aware Multi-Video Summarization.
IEEE Trans. Image Process., 2017

CMU-UCR-BOSCH @ TRECVID 2017: VIDEO TO TEXT RETRIEVAL.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

2016
Video-based tracking of vehicles using multiple time-spatial images.
Expert Syst. Appl., 2016

Generating Diverse Image Datasets with Limited Labeling.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

OSNI: Searching for Needles in a Haystack of Social Network Data.
Proceedings of the 19th International Conference on Extending Database Technology, 2016

2012
Detection and Classification of Vehicles From Video Using Multiple Time-Spatial Images.
IEEE Trans. Intell. Transp. Syst., 2012


  Loading...