Krishna Somandepalli

Oliver Siy

Brendan Jou

Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2024

2023

A study of bias mitigation strategies for speaker recognition.

[BibT_eX]

[DOI]

Raghuveer Peri

Comput. Speech Lang., April, 2023

Cross Modal Video Representations for Weakly Supervised Active Speaker Localization.

[BibT_eX]

[DOI]

Rahul Sharma

IEEE Trans. Multim., 2023

MovieCLIP: Visual Scene Recognition in Movies.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

MM-AU: Towards Multimodal Understanding of Advertisement Videos.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

LanSER: Language-Model Supported Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Heterogeneous Graph Learning for Acoustic Event Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

A Dataset for Audio-Visual Sound Event Detection in Movies.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Contextually-Rich Human Affect Perception Using Multimodal Scene Information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Robust Character Labeling in Movie Videos: Data Resources and Self-Supervised Feature Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Self-Supervised Graphs for Audio Representation Learning With Limited Labeled Data.

[BibT_eX]

[DOI]

Amir Shirian

Tanaya Guha

IEEE J. Sel. Top. Signal Process., 2022

Studying Large-Scale Behavioral Differences in Auschwitz-Birkenau with Simulation of Gendered Narratives.

[BibT_eX]

[DOI]

Digit. Humanit. Q., 2022

Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers.

[BibT_eX]

[DOI]

CoRR, 2022

Visually-aware Acoustic Event Detection using Heterogeneous Graphs.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Federated Learning for Affective Computing Tasks.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022

2021

Generalized Multiview Shared Subspace Learning Using View Bootstrapping.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2021

Computational Media Intelligence: Human-Centered Machine Analysis of Media.

[BibT_eX]

[DOI]

Proc. IEEE, 2021

Understanding of Emotion Perception from Art.

[BibT_eX]

[DOI]

CoRR, 2021

Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis.

[BibT_eX]

[DOI]

Sabyasachee Baruah

CoRR, 2021

Loss Function Approaches for Multi-label Music Tagging.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Content-Based Multimedia Indexing, 2021

A Computational Tool to Study Vocal Participation of Women in UN-ITU Meetings.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Content-Based Multimedia Indexing, 2021

2020

Multi-Face: Self-supervised Multiview Adaptation for Robust Face Clustering in Videos.

[BibT_eX]

[DOI]

CoRR, 2020

Victim or Perpetrator? Analysis of Violent Characters Portrayals from Movie Scripts.

[BibT_eX]

[DOI]

CoRR, 2020

Generalized Multi-view Shared Subspace Learning using View Bootstrapping.

[BibT_eX]

[DOI]

CoRR, 2020

Crossmodal learning for audio-visual speech event localization.

[BibT_eX]

[DOI]

Rahul Sharma

CoRR, 2020

An Empirical Analysis of Information Encoded in Disentangled Neural Speaker Representations.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

MediaEval 2020 Emotion and Theme Recognition in Music Task: Loss Function Approaches for Multi-label Music Tagging.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020

Robust Speaker Recognition Using Unsupervised Adversarial Invariance.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Vocal Tract Articulatory Contour Detection in Real-Time Magnetic Resonance Images Using Spatio-Temporal Context.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Joint Estimation and Analysis of Risk Behavior Ratings in Movie Scripts.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Multimodal Representation Learning using Deep Multiset Canonical Correlation.

[BibT_eX]

[DOI]

Ruchir Travadi

CoRR, 2019

Multiview Shared Subspace Learning Across Speakers and Speech Commands.

[BibT_eX]

[DOI]

Arindam Jati

Panayiotis G. Georgiou

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Identifying Therapist and Client Personae for Therapeutic Alliance Estimation.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Toward Visual Voice Activity Detection for Unconstrained Videos.

[BibT_eX]

[DOI]

Rahul Sharma

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Reinforcing Self-expressive Representation with Constraint Propagation for Face Clustering in Movies.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Speaker Agnostic Foreground Speech Detection from Audio Recordings in Workplace Settings from Wearable Recorders.

[BibT_eX]

[DOI]

Amrutha Nadarajan

Proceedings of the IEEE International Conference on Acoustics, 2019

Robust Speech Activity Detection in Movie Audio: Data Resources and Experimental Evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Violence Rating Prediction from Movie Scripts.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Unsupervised Discovery of Character Dictionaries in Animation Movies.

[BibT_eX]

[DOI]

Tanaya Guha

IEEE Trans. Multim., 2018

Improving Gender Identification in Movie Audio Using Cross-Domain Data.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multimodal Representation of Advertisements Using Segment-level Autoencoders.

[BibT_eX]

[DOI]

Victor R. Martinez

Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

2017

The Neural Correlates of Emotional Lability in Children with Autism Spectrum Disorder.

[BibT_eX]

[DOI]

Brain Connect., 2017

Semantic Edge Detection for Tracking Vocal Tract Air-Tissue Boundaries in Real-Time Magnetic Resonance Images.

[BibT_eX]

[DOI]

Asterios Toutios

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Online Affect Tracking with Multimodal Kalman Filters.

[BibT_eX]

[DOI]

Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, 2016

Articulatory Synthesis Based on Real-Time Magnetic Resonance Imaging Data.

[BibT_eX]

[DOI]