Girish

According to our database1, Girish authored at least 23 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

DIVINE : Coordinating Multimodal Disentangled Representations for Oro-Facial Neurological Disorder Assessment.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Investigating Polyglot Speech Foundation Models for Learning Collective Emotion from Crowds.
CoRR, September, 2025

Towards Neural Audio Codec Source Parsing.
CoRR, June, 2025

Towards Machine Unlearning for Paralinguistic Speech Processing.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

HYFuse: Aligning Heterogeneous Speech Pre-Trained Representations in Hyperbolic Space for Speech Emotion Recognition.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Towards Source Attribution of Singing Voice Deepfake with Multimodal Foundation Models.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SNIFR : Boosting Fine-Grained Child Harmful Content Detection Through Audio-Visual Alignment with Cascaded Cross-Transformer.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

PARROT: Synergizing Mamba and Attention-based SSL Pre-Trained Models via Parallel Branch Hadamard Optimal Transport for Speech Emotion Recognition.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Source Tracing of Synthetic Speech Systems Through Paralinguistic Pre-Trained Representations.
Proceedings of the 33rd European Signal Processing Conference, 2025

ARE Mamba-Based Audio Foundation Models the Best Fit for Non-Verbal Emotion Recognition?
Proceedings of the 33rd European Signal Processing Conference, 2025

Investigating Polyglot Speech Foundation Models for Learning Collective Emotion from Crowds.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Beyond Speech and More: Investigating the Emergent Ability of Speech Pre-Trained Models for Classifying Physiological Time-Series Signals.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Rethinking Cross-Corpus Speech Emotion Recognition Benchmarking: are Paralinguistic Pre-Trained Representations Sufficient?
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Are Multimodal Foundation Models All That Is Needed for Emofake Detection?
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024
Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection.
CoRR, 2024

NeuRO: an application for code-switched autism detection in children.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
MLlab4CS at SemEval-2023 Task 2: Named Entity Recognition in Low-resource Language Bangla Using Multilingual Language Models.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023


  Loading...