Naveen Kumar

Orcid: 0000-0003-0604-2466

Affiliations:
  • Disney Research, Glendale, CA, USA
  • University of Southern California, Signal Analysis and Interpretation Lab, Los Angeles, CA, USA (former)


According to our database1, Naveen Kumar authored at least 36 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Name Pronunciation Extraction and Reuse in Human-Robot Conversations.
Proceedings of the Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

2023
Transformer-Based Neural Augmentation of Robot Simulation Representations.
IEEE Robotics Autom. Lett., June, 2023

2021
Computational Media Intelligence: Human-Centered Machine Analysis of Media.
Proc. IEEE, 2021

RNN Based Incremental Online Spoken Language Understanding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

2020
ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
Incremental Online Spoken Language Understanding.
CoRR, 2019

Multimodal Representation Learning using Deep Multiset Canonical Correlation.
CoRR, 2019

Multiview Shared Subspace Learning Across Speakers and Speech Commands.
Proceedings of the Interspeech 2019, 2019

Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.
Proceedings of the Interspeech 2019, 2019

Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Unsupervised Discovery of Character Dictionaries in Animation Movies.
IEEE Trans. Multim., 2018

Multimodal Representation of Advertisements Using Segment-level Autoencoders.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

A Deep Reinforcement Learning Framework for Identifying Funny Scenes in Movies.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2016
Online rate adjustment for adaptive random access compressed sensing of time-varying fields.
EURASIP J. Adv. Signal Process., 2016

Active Target Localization using Low-Rank Matrix Completion and Unimodal Regression.
CoRR, 2016

Novel affective features for multiscale prediction of emotion in music.
Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Robust Multichannel Gender Classification from Speech in Movie Audio.
Proceedings of the Interspeech 2016, 2016

Opening big in box office? Trailer content can help.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Pathological speech processing: State-of-the-art, current challenges, and future directions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A multimodal mixture-of-experts model for dynamic emotion prediction in movies.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Automatic intelligibility classification of sentence-level pathological speech.
Comput. Speech Lang., 2015

Structured sparse methods for active ocean observation systems with communication constraints.
IEEE Commun. Mag., 2015

A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech.
Proceedings of the INTERSPEECH 2015, 2015

Gender Representation in Cinematic Content: A Multimodal Approach.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Computationally deconstructing movie narratives: An informatics approach.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Affect prediction in music using boosted ensemble of filters.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Detection of Musical Event Drop from Crowdsourced Annotations Using a Noisy Channel Model.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Affective Feature Design and Predicting Continuous Affective Dimensions from Music.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Active target detection with mobile agents.
Proceedings of the IEEE International Conference on Acoustics, 2014

Fusion of diverse denoising systems for robust automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Hull detection based on largest empty sector angle with application to analysis of realtime MR images.
Proceedings of the IEEE International Conference on Acoustics, 2014

Active target detection with navigation costs: A randomized benchmark.
Proceedings of the 52nd Annual Allerton Conference on Communication, 2014

2012
Features for comparing tune similarity of songs across different languages.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Intelligibility classification of pathological speech using fusion of multiple high level descriptors.
Proceedings of the INTERSPEECH 2012, 2012

Object classification in sidescan sonar images with sparse representation techniques.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Directional descriptors using zernike moment phases for object orientation estimation in underwater sonar images.
Proceedings of the IEEE International Conference on Acoustics, 2011


  Loading...