Orchid Chetia Phukan

Orcid: 0000-0002-2542-8084

According to our database1, Orchid Chetia Phukan authored at least 41 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Towards Neural Audio Codec Source Parsing.
CoRR, June, 2025

SNIFR : Boosting Fine-Grained Child Harmful Content Detection Through Audio-Visual Alignment with Cascaded Cross-Transformer.
CoRR, June, 2025

Towards Source Attribution of Singing Voice Deepfake with Multimodal Foundation Models.
CoRR, June, 2025

Are Mamba-based Audio Foundation Models the Best Fit for Non-Verbal Emotion Recognition?
CoRR, June, 2025

Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction.
CoRR, June, 2025

Towards Machine Unlearning for Paralinguistic Speech Processing.
CoRR, June, 2025

Source Tracing of Synthetic Speech Systems Through Paralinguistic Pre-Trained Representations.
CoRR, June, 2025

Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism.
CoRR, June, 2025

PARROT: Synergizing Mamba and Attention-based SSL Pre-Trained Models via Parallel Branch Hadamard Optimal Transport for Speech Emotion Recognition.
CoRR, June, 2025

Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Attention based hybrid deep learning model for wearable based stress recognition.
Eng. Appl. Artif. Intell., January, 2024

Hybrid deep learning model for wearable sensor-based stress recognition for Internet of Medical Things (IoMT) system.
Int. J. Commun. Syst., 2024

Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks.
CoRR, 2024

SeQuiFi: Mitigating Catastrophic Forgetting in Speech Emotion Recognition with Sequential Class-Finetuning.
CoRR, 2024

Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection.
CoRR, 2024

Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection.
CoRR, 2024

Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models.
CoRR, 2024

How Paralingual are Paralinguistic Representations? A Case Study in Speech Emotion Recognition.
CoRR, 2024

A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd Counting.
CoRR, 2024

Heterogeneity over Homogeneity: Investigating Multilingual Speech Pre-Trained Models for Detecting Audio Deepfake.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

AVR: synergizing foundation models for audio-visual humor detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Towards Multilingual Audio-Visual Question Answering.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Are Paralinguistic Representations all that is needed for Speech Emotion Recognition?
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

ComFeAT: combination of neural and spectral features for improved depression detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

VoxMed: one-step respiratory disease classifier using digital stethoscope sounds.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

PERSONA: an application for emotion recognition, gender recognition and age estimation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

The reasonable effectiveness of speaker embeddings for violence detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

ASGIR: audio spectrogram transformer guided classification and information retrieval for birds.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

NeuRO: an application for code-switched autism detection in children.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

SONIC: Synergizing Vision Foundation Models for Stress Recognition from ECG Signals.
Proceedings of the 32nd European Signal Processing Conference, 2024

Whispers of Trauma: Leveraging Social Media for Assessing Mental Health in Victims of Childhood Sexual Abuse.
Proceedings of the Social Networks Analysis and Mining - 16th International Conference, 2024

2023
Stress recognition with multi-modal sensing using bootstrapped ensemble deep learning model.
Expert Syst. J. Knowl. Eng., July, 2023

From Simulations to Reality: Enhancing Multi-Robot Exploration for Urban Search and Rescue.
CoRR, 2023

Trauma lurking in the shadows: A Reddit case study of mental health issues in online posts about Childhood Sexual Abuse.
CoRR, 2023

Roulette-Wheel Selection-Based PSO Algorithm for Solving the Vehicle Routing Problem with Time Windows.
CoRR, 2023

A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition.
CoRR, 2023

"Can We Detect Substance Use Disorder?": Knowledge and Time Aware Classification on Social Media from Darkweb.
CoRR, 2023

Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2023

2022
CNN-LSTM Based Stress Recognition Using Wearables.
Proceedings of the Joint Proceedings of the Second International Workshop on Multilingual Semantic Web and Second International Workshop on Deep Learning for Question Answering and First International Workshop on Semantic Reasoning and Representation in IoT, 2022


  Loading...