Pankaj Wasnik

CoRR, March, 2026

Face Time Traveller : Travel Through Ages Without Losing Identity.

[BibT_eX]

[DOI]

CoRR, February, 2026

EW-DETR: Evolving World Object Detection via Incremental Low-Rank DEtection TRansformer.

[BibT_eX]

[DOI]

CoRR, February, 2026

Windowed SummaryMixing: An Efficient Fine-Tuning of Self-Supervised Learning Models for Low-resource Speech Recognition.

[BibT_eX]

[DOI]

Aditya Srinivas Menon

Raj Prakash Gohil

CoRR, February, 2026

Listen like a Teacher: Mitigating Whisper Hallucinations Using Adaptive Layer Attention and Knowledge Distillation.

[BibT_eX]

[DOI]

Aditya Srinivas Menon

Aman Gaurav

Raj Prakash Gohil

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

In-Domain African Languages Translation Using LLMs and Multi-armed Bandits.

[BibT_eX]

[DOI]

CoRR, May, 2025

AdaPrefix++: Integrating Adapters, Prefixes and Hypernetwork for Continual Learning.

[BibT_eX]

[DOI]

Sayanta Adhikari

Dupati Srikar Chandra

P. K. Srijith

Naoyuki Onoe

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion.

[BibT_eX]

[DOI]

Chowdam Venkata Kumar

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

LASPA: Language Agnostic Speaker Disentanglement with Prefix-Tuned Cross-Attention.

[BibT_eX]

[DOI]

Aditya Srinivas Menon

Raj Prakash Gohil

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion.

[BibT_eX]

[DOI]

Ishan D. Biyani

Nirmesh J. Shah

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization.

[BibT_eX]

[DOI]

Raj Gothi

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance.

[BibT_eX]

[DOI]

Sanchayan Santra

Vishal M. Chudasama

Vineeth N. Balasubramanian

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Graph-Assisted Culturally Adaptable Idiomatic Translation for Indic languages.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Entertainment Translation for Indian Languages Using Adaptive Context, Style and LLMs.

[BibT_eX]

[DOI]

Pratik Rakesh Singh

Mohammadi Zaki

Ashishkumar Prabhakar Gudmalwar

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion.

[BibT_eX]

[DOI]

Ashishkumar Gudmalwar

CoRR, 2024

Beyond Few-shot Object Detection: A Detailed Survey.

[BibT_eX]

[DOI]

Vishal M. Chudasama

Hiran Sarkar

Vineeth N. Balasubramanian

Jayateja Kalla

CoRR, 2024

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing.

[BibT_eX]

[DOI]

Neha Sahipjohn

CoRR, 2024

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient infusion of self-supervised representations in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Darshan Prabhu

Sai Ganesh Mirishkar

CoRR, 2024

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning.

[BibT_eX]

[DOI]

Shivam Ratnakant Mhaskar

Nirmesh J. Shah

Mohammadi Zaki

Vineeth N. Balasubramanian

CoRR, 2024

Open-Set Object Detection By Aligning Known Class Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning.

[BibT_eX]

[DOI]

Shivam Mhaskar

Mohammadi Zaki

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing.

[BibT_eX]

[DOI]

Neha Sahipjohn

Ashishkumar Gudmalwar

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech.

[BibT_eX]

[DOI]

Ashishkumar Gudmalwar

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Revisiting Class Imbalance for End-to-end Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fiducial Focus Augmentation for Facial Landmark Detection.

[BibT_eX]

[DOI]

Vineeth Balasubramanian

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2020

Seamless Payment System Using Face And Low-Energy Bluetooth.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Smartphone Multi-modal Biometric Authentication: Database and Evaluation.

[BibT_eX]

[DOI]

CoRR, 2019

Custom silicone Face Masks: Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the 7th International Workshop on Biometrics and Forensics, 2019

Using Demographic Features for the Prediction of Basic Human Values Underlying Stakeholder Motivation.

[BibT_eX]

[DOI]

Adam Szekeres

Einar Arthur Snekkenes

Proceedings of the 21st International Conference on Enterprise Information Systems, 2019

2018

Presentation Attack Detection for Smartphone Based Fingerphoto Recognition Using Second Order Local Structures.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Signal-Image Technology & Internet-Based Systems, 2018

Hessian-based robust ray-tracing of implicit surfaces on GPU.

[BibT_eX]

[DOI]

Jag Mohan Singh

Proceedings of the SIGGRAPH Asia 2018 Technical Briefs, Tokyo, Japan, December 04-07, 2018, 2018

Subjective Logic Based Score Level Fusion: Combining Faces and Fingerprints.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Information Fusion, 2018

Fusion of Multi-Scale Local Phase Quantization Features for Face Presentation Attack Detection.

[BibT_eX]

[DOI]

Sushma Venkatesh

Martin Stokkenes

Proceedings of the 21st International Conference on Information Fusion, 2018

An Empirical Evaluation of Deep Architectures on Generalization of Smartphone-based Face Image Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE International Conference on Biometrics Theory, 2018

Improved Fingerphoto Verification System Using Multi-scale Second Order Local Structures.

[BibT_eX]

[DOI]

Martin Stokkenes

Proceedings of the 2018 International Conference of the Biometrics Special Interest Group, 2018

Fake Face Detection Methods: Can They Be Generalized?

[BibT_eX]

[DOI]

Ali Khodabakhsh

Proceedings of the 2018 International Conference of the Biometrics Special Interest Group, 2018

2017

Assessing face image quality for smartphone based face recognition system.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Biometrics and Forensics, 2017

Robust face presentation attack detection on smartphones : An approach based on variable focus.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017

Factors Influencing the Participation of Information Security Professionals in Electronic Communities of Practice.

[BibT_eX]

[DOI]

Vivek Agrawal

Einar Arthur Snekkenes

Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2017

Fusing Biometric Scores Using Subjective Logic for Gait Recognition on Smartphone.

[BibT_eX]

[DOI]

Kristina Schäfer

Proceedings of the International Conference of the Biometrics Special Interest Group, 2017

2016

Presentation Attack Detection in Face Biometric Systems Using Raw Sensor Data from Smartphones.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Signal-Image Technology & Internet-Based Systems, 2016

Eye region based multibiometric fusion to mitigate the effects of body weight variations in face recognition.

[BibT_eX]

[DOI]