Malcolm Slaney

According to our database1, Malcolm Slaney authored at least 90 papers between 1994 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2010, "For contributions to perceptual signal processing and tomographic imaging".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Decoding the auditory brain with canonical component analysis.
NeuroImage, 2018

Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards mobile gaze-directed beamforming: a novel neuro-technology for hearing loss.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2017
Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers.
CoRR, 2017

CNN architectures for large-scale audio classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
CNN Architectures for Large-Scale Audio Classification.
CoRR, 2016

2015
A Study of Multimodal Addressee Detection in Human-Human-Computer Interaction.
IEEE Trans. Multimedia, 2015

Multimodal addressee detection in multiparty dialogue systems.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Probabilistic features for connecting eye gaze to spoken language understanding.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Artificial neural network features for speaker diarization.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Eye gaze for understanding conversational speech.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The influence of pitch and noise on the discriminability of filterbank features.
Proceedings of the INTERSPEECH 2014, 2014

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks.
Proceedings of the INTERSPEECH 2014, 2014

The Relation of Eye Gaze and Face Pose: Potential Impact on Speech Recognition.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Eye Gaze for Spoken Language Understanding in Multi-modal Conversational Interactions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Gaze-enhanced speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Introduction to the special section on the 20th anniversary of the ACM international conference on multimedia.
TOMCCAP, 2013

Micro Stories and Mega Stories.
IEEE MultiMedia, 2013

Data driven suppression rule for speech enhancement.
Proceedings of the 2013 Information Theory and Applications Workshop, 2013

QBT-Extended: An Annotated Dataset of Melodically Contoured Tapped Queries.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Pitch-gesture modeling using subband autocorrelation change detection.
Proceedings of the INTERSPEECH 2013, 2013

Characteristic contours of syllabic-level units in laughter.
Proceedings of the INTERSPEECH 2013, 2013

2012
Optimal Parameters for Locality-Sensitive Hashing.
Proceedings of the IEEE, 2012

Web-Scale Multimedia Processing and Applications [Scanning the Issue].
Proceedings of the IEEE, 2012

Don't Click Here.
IEEE MultiMedia, 2012

Tell Me a Story.
IEEE MultiMedia, 2012

Collaborative Filtering and the Missing at Random Assumption
CoRR, 2012

Coulda, woulda, shoulda: 20 years of multimedia opportunities.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Learning Sparse Feature Representations for Music Annotation and Retrieval.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

A model of attention-driven scene analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Audio and Acoustic Signal Processing [In the Spotlight].
IEEE Signal Process. Mag., 2011

Academia Meets Industry at the Multimedia Grand Challenge.
IEEE MultiMedia, 2011

Precision-Recall Is Wrong for Multimedia.
IEEE MultiMedia, 2011

Web-Scale Multimedia Analysis: Does Content Matter?
IEEE MultiMedia, 2011

Identifying authoritative sources of multimedia content: mining specificity and expertise from large-scale multimedia databases.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Recommender Systems, Missing Data and Statistical Model Estimation.
Proceedings of the IJCAI 2011, 2011

Using gaze patterns to study and predict reading struggles due to distraction.
Proceedings of the International Conference on Human Factors in Computing Systems, 2011

2010
Solving Demodulation as an Optimization Problem.
IEEE Trans. Audio, Speech & Language Processing, 2010

Scalable Audio-Content Analysis.
EURASIP J. Audio, Speech and Music Processing, 2010

Processing web-scale multimedia data.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Image classification using the web graph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multimodal retrieval and ranking: more than waveforms.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

The information content of demodulated speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Unsupervised image ranking.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Periodicity Detection and Localization using Spike Timing from the AER EAR.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Reconciliation of human and machine speech recognition performance.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio.
IEEE Trans. Audio, Speech & Language Processing, 2008

Analysis of Minimum Distances in High-Dimensional Musical Spaces.
IEEE Trans. Audio, Speech & Language Processing, 2008

Content-Based Music Information Retrieval: Current Directions and Future Challenges.
Proceedings of the IEEE, 2008

Resolving tag ambiguity.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Learning a Metric for Music Similarity.
Proceedings of the ISMIR 2008, 2008

Comparing Local Feature Descriptors in pLSA-Based Image Models.
Proceedings of the Pattern Recognition, 2008

Continuous visual vocabulary modelsfor pLSA-based scene recognition.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
Collaborative Filtering and the Missing at Random Assumption.
Proceedings of the UAI 2007, 2007

Similarity Based on Rating Data.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A Unified System for Chord Transcription and Key Extraction Using Hidden Markov Models.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

PLSA on Large Scale Image Databases.
Proceedings of the IEEE International Conference on Acoustics, 2007

Fast Recognition of Remixed Music Audio.
Proceedings of the IEEE International Conference on Acoustics, 2007

Varying Time Constants and Gain Adaptation in Feature Extraction for Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2007

Image retrieval on large-scale image databases.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations.
IEEE Trans. Audio, Speech & Language Processing, 2006

Automatic Chord Recognition from Audio Using a HMM with Supervised Learning.
Proceedings of the ISMIR 2006, 2006

Song Intersection by Approximate Nearest Neighbor Search.
Proceedings of the ISMIR 2006, 2006

A statistical model of timbre perception.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

The Importance of Sequences in Musical Similarity.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Being Literate with Large Document Collections: Observational Studies and Cost Structure Tradeoffs.
Proceedings of the 39th Hawaii International International Conference on Systems Science (HICSS-39 2006), 2006

2005
A timbre space for speech.
Proceedings of the INTERSPEECH 2005, 2005

Analytic Worksheets: A Framework to Support Human Analysis of Large Streaming Data Volumes.
Proceedings of the Human-Computer Interaction, 2005

Measuring Information Understanding in Large Document Collections.
Proceedings of the 38th Hawaii International Conference on System Sciences (HICSS-38 2005), 2005

2004
Low-power audio classification for ubiquitous sensor networks.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Speech discrimination based on multiscale spectro-temporal modulations.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
BabyEars: A recognition system for affective vocalizations.
Speech Communication, 2003

Modeling Multitasking Users.
Proceedings of the User Modeling 2003, 2003

2002
Mixtures of probability experts for audio retrieval and indexing.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Semantic-audio retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Multimedia edges: finding hierarchy in all dimensions.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Hierarchical segmentation using latent semantic indexing in scale space.
Proceedings of the IEEE International Conference on Acoustics, 2001

FastMPEG: time-scale modification of bit-compressed audio information.
Proceedings of the IEEE International Conference on Acoustics, 2001

Temporal Events in All Dimensions and Scales.
Proceedings of the IEEE Workshop on Detection and Recognition of Events in Video, 2001

2000
FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

1998
Baby Ears: a recognition system for affective vocalizations.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

MACH1: nonuniform time-scale modification of speech.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Video Rewrite: driving visual speech with audio.
Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, 1997

Construction and evaluation of a robust multifeature speech/music discriminator.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Video rewrite: visual speech synthesis from video.
Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997

1996
Automatic audio morphing.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1994
Pattern Playback in the 90s.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

Auditory model inversion for sound separation.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994


  Loading...