Pankaj Wasnik

According to our database1, Pankaj Wasnik authored at least 48 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?
CoRR, March, 2026

Face Time Traveller : Travel Through Ages Without Losing Identity.
CoRR, February, 2026

EW-DETR: Evolving World Object Detection via Incremental Low-Rank DEtection TRansformer.
CoRR, February, 2026

Windowed SummaryMixing: An Efficient Fine-Tuning of Self-Supervised Learning Models for Low-resource Speech Recognition.
CoRR, February, 2026

Listen like a Teacher: Mitigating Whisper Hallucinations Using Adaptive Layer Attention and Knowledge Distillation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
In-Domain African Languages Translation Using LLMs and Multi-armed Bandits.
CoRR, May, 2025

AdaPrefix++: Integrating Adapters, Prefixes and Hypernetwork for Continual Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

LASPA: Language Agnostic Speaker Disentanglement with Prefix-Tuned Cross-Attention.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Graph-Assisted Culturally Adaptable Idiomatic Translation for Indic languages.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Entertainment Translation for Indian Languages Using Adaptive Context, Style and LLMs.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion.
CoRR, 2024

Beyond Few-shot Object Detection: A Detailed Survey.
CoRR, 2024

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing.
CoRR, 2024

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech.
CoRR, 2024

Efficient infusion of self-supervised representations in Automatic Speech Recognition.
CoRR, 2024

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning.
CoRR, 2024

Open-Set Object Detection By Aligning Known Class Representations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Revisiting Class Imbalance for End-to-end Semi-Supervised Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fiducial Focus Augmentation for Facial Landmark Detection.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2020
Seamless Payment System Using Face And Low-Energy Bluetooth.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Smartphone Multi-modal Biometric Authentication: Database and Evaluation.
CoRR, 2019

Custom silicone Face Masks: Vulnerability of Commercial Face Recognition Systems & Presentation Attack Detection.
Proceedings of the 7th International Workshop on Biometrics and Forensics, 2019

Using Demographic Features for the Prediction of Basic Human Values Underlying Stakeholder Motivation.
Proceedings of the 21st International Conference on Enterprise Information Systems, 2019

2018
Presentation Attack Detection for Smartphone Based Fingerphoto Recognition Using Second Order Local Structures.
Proceedings of the 14th International Conference on Signal-Image Technology & Internet-Based Systems, 2018

Hessian-based robust ray-tracing of implicit surfaces on GPU.
Proceedings of the SIGGRAPH Asia 2018 Technical Briefs, Tokyo, Japan, December 04-07, 2018, 2018

Subjective Logic Based Score Level Fusion: Combining Faces and Fingerprints.
Proceedings of the 21st International Conference on Information Fusion, 2018

Fusion of Multi-Scale Local Phase Quantization Features for Face Presentation Attack Detection.
Proceedings of the 21st International Conference on Information Fusion, 2018

An Empirical Evaluation of Deep Architectures on Generalization of Smartphone-based Face Image Quality Assessment.
Proceedings of the 9th IEEE International Conference on Biometrics Theory, 2018

Improved Fingerphoto Verification System Using Multi-scale Second Order Local Structures.
Proceedings of the 2018 International Conference of the Biometrics Special Interest Group, 2018

Fake Face Detection Methods: Can They Be Generalized?
Proceedings of the 2018 International Conference of the Biometrics Special Interest Group, 2018

2017
Assessing face image quality for smartphone based face recognition system.
Proceedings of the 5th International Workshop on Biometrics and Forensics, 2017

Robust face presentation attack detection on smartphones : An approach based on variable focus.
Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017

Factors Influencing the Participation of Information Security Professionals in Electronic Communities of Practice.
Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2017

Fusing Biometric Scores Using Subjective Logic for Gait Recognition on Smartphone.
Proceedings of the International Conference of the Biometrics Special Interest Group, 2017

2016
Presentation Attack Detection in Face Biometric Systems Using Raw Sensor Data from Smartphones.
Proceedings of the 12th International Conference on Signal-Image Technology & Internet-Based Systems, 2016

Eye region based multibiometric fusion to mitigate the effects of body weight variations in face recognition.
Proceedings of the 19th International Conference on Information Fusion, 2016


  Loading...