Naveen Kumar

Prashanth Gurunath Shivakumar

Maike Paetzel-Prüsmann

Proceedings of the Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

2023

Transformer-Based Neural Augmentation of Robot Simulation Representations.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., June, 2023

2021

Computational Media Intelligence: Human-Centered Machine Analysis of Media.

[BibT_eX]

[DOI]

Proc. IEEE, 2021

RNN Based Incremental Online Spoken Language Understanding.

[BibT_eX]

[DOI]

Prashanth Gurunath Shivakumar

Shrikanth Narayanan

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019

Incremental Online Spoken Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2019

Multimodal Representation Learning using Deep Multiset Canonical Correlation.

[BibT_eX]

[DOI]

Ruchir Travadi

CoRR, 2019

Multiview Shared Subspace Learning Across Speakers and Speech Commands.

[BibT_eX]

[DOI]

Arindam Jati

Shrikanth Narayanan

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.

[BibT_eX]

[DOI]

Shrikanth Narayanan

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection.

[BibT_eX]

[DOI]

Arindam Jati

Ruxin Chen

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Unsupervised Discovery of Character Dictionaries in Animation Movies.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Multimodal Representation of Advertisements Using Segment-level Autoencoders.

[BibT_eX]

[DOI]

Victor R. Martinez

Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

A Deep Reinforcement Learning Framework for Identifying Funny Scenes in Movies.

[BibT_eX]

[DOI]

Haoqi Li

Ruxin Chen

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2016

Online rate adjustment for adaptive random access compressed sensing of time-varying fields.

[BibT_eX]

[DOI]

Fatemeh Fazel

Milica Stojanovic

EURASIP J. Adv. Signal Process., 2016

Active Target Localization using Low-Rank Matrix Completion and Unimodal Regression.

[BibT_eX]

[DOI]

CoRR, 2016

Novel affective features for multiscale prediction of emotion in music.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Robust Multichannel Gender Classification from Speech in Movie Audio.

[BibT_eX]

[DOI]

Md. Nasir

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Opening big in box office? Trailer content can help.

[BibT_eX]

[DOI]

Adarsh Tadimari

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Pathological speech processing: State-of-the-art, current challenges, and future directions.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A multimodal mixture-of-experts model for dynamic emotion prediction in movies.

[BibT_eX]

[DOI]

Ankit Goyal

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Automatic intelligibility classification of sentence-level pathological speech.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2015

Structured sparse methods for active ocean observation systems with communication constraints.

[BibT_eX]

[DOI]

Milica Stojanovic

Gaurav S. Sukhatme

IEEE Commun. Mag., 2015

A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Gender Representation in Cinematic Content: A Multimodal Approach.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Computationally deconstructing movie narratives: An informatics approach.

[BibT_eX]

[DOI]

Stacy L. Smith

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Affect prediction in music using boosted ensemble of filters.

[BibT_eX]

[DOI]

Rahul Gupta

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Detection of Musical Event Drop from Crowdsourced Annotations Using a Noisy Channel Model.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Affective Feature Design and Predicting Continuous Affective Dimensions from Music.

[BibT_eX]

[DOI]

Maarten Van Segbroeck

Jangwon Kim

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Active target detection with mobile agents.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Fusion of diverse denoising systems for robust automatic speech recognition.

[BibT_eX]

[DOI]

Maarten Van Segbroeck

Kartik Audhkhasi

Peter Drotár

Proceedings of the IEEE International Conference on Acoustics, 2014

Hull detection based on largest empty sector angle with application to analysis of realtime MR images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Active target detection with navigation costs: A randomized benchmark.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Allerton Conference on Communication, 2014

2012

Features for comparing tune similarity of songs across different languages.

[BibT_eX]

[DOI]

Andreas Tsiartas

Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Intelligibility classification of pathological speech using fusion of multiple high level descriptors.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Object classification in sidescan sonar images with sparse representation techniques.

[BibT_eX]

[DOI]

Qun Feng Tan

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Directional descriptors using zernike moment phases for object orientation estimation in underwater sonar images.

[BibT_eX]

[DOI]