Daniel P. W. Ellis

According to our database1, Daniel P. W. Ellis authored at least 164 papers between 1990 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2019
Learning Sound Event Classifiers from Web Audio with Noisy Labels.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies.
Proceedings of the Interspeech 2018, 2018

Unsupervised Learning of Semantic Audio Representations.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Large-scale audio event discovery in one million YouTube videos.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

CNN architectures for large-scale audio classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Audio Set: An ontology and human-labeled dataset for audio events.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Extracting Ground-Truth Information from MIDI Files: A MIDIfesto.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Pruning subsequence search with attention-based embedding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Optimizing DTW-based audio-to-MIDI alignment and matching.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Exploring Low Cost Laser Sensors to Identify Flying Insect Species - Evaluation of Machine Learning and Signal Processing Methods.
Journal of Intelligent and Robotic Systems, 2015

Large-Scale Content-Based Matching of MIDI and Audio Files.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Content-Aware Collaborative Music Recommendation Using Pre-trained Neural Networks.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Micbots: Collecting large realistic datasets for speech and audio research using mobile robots.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Melody Extraction from Polyphonic Music Signals: Approaches, applications, and challenges.
IEEE Signal Process. Mag., 2014

MIR_EVAL: A Transparent Implementation of Common MIR Metrics.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Analyzing Song Structure with Spectral Clustering.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Codebook-based Scalable Music Tagging with Poisson Matrix Factorization.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Detecting proximity from personal audio recordings.
Proceedings of the INTERSPEECH 2014, 2014

Speech enhancement by low-rank and convolutive dictionary spectrogram decomposition.
Proceedings of the INTERSPEECH 2014, 2014

Estimating timing and channel distortion across related signals.
Proceedings of the IEEE International Conference on Acoustics, 2014

Leveraging repetition for improved automatic lyric transcription in popular music.
Proceedings of the IEEE International Conference on Acoustics, 2014

Learning to segment songs with ordinal linear discriminant analysis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Better beat tracking through robust onset aggregation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Speech decoloration based on the product-of-filters model.
Proceedings of the IEEE International Conference on Acoustics, 2014

Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Music-Content-Adaptive Robust Principal Component Analysis for a Semantically Consistent Separation of Foreground and Background in Music Audio Signals.
Proceedings of the 17th International Conference on Digital Audio Effects, 2014

2013
Modeling nonlinear circuits with linearized dynamical models via kernel regression.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

A Video Compression-Based Approach to Measure Music Structural Similarity.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Beta Process Sparse Nonnegative Matrix Factorization for Music.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

All for one: feature combination for highly channel-degraded speech activity detection.
Proceedings of the INTERSPEECH 2013, 2013

Applying Machine Learning and Audio Analysis Techniques to Insect Recognition in Intelligent Traps.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Subband autocorrelation features for video soundtrack classification.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Data-driven voice source waveform analysis and synthesis.
Speech Communication, 2012

The million song dataset challenge.
Proceedings of the 21st World Wide Web Conference, 2012

AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Making a scene: alignment of complete sets of clips based on pairwise audio match.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Inharmonic speech: a tool for the study of speech perception and separation.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Noise Robust Pitch Tracking by Subband Autocorrelation Classification.
Proceedings of the INTERSPEECH 2012, 2012

2011
Combining localization cues and source model constraints for binaural source separation.
Speech Communication, 2011

Introduction to the Special Issue on Music Signal Processing.
J. Sel. Topics Signal Processing, 2011

Signal Processing for Music Analysis.
J. Sel. Topics Signal Processing, 2011

Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments.
J. Sel. Topics Signal Processing, 2011

General chair's introduction.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Spectral vs. spectro-temporal features for acoustic event detection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Large-scale cover song recognition using hashed chroma landmarks.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Consumer video understanding: a benchmark database and an evaluation of human and machine performance.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

The Million Song Dataset.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors.
Proceedings of the INTERSPEECH 2011, 2011

Direct processing of mpeg audio using companding and BFP techniques.
Proceedings of the IEEE International Conference on Acoustics, 2011

Classifying soundtracks with audio texture features.
Proceedings of the IEEE International Conference on Acoustics, 2011

Soundtrack classification by transient events.
Proceedings of the IEEE International Conference on Acoustics, 2011

Evaluating music sequence models through missing data.
Proceedings of the IEEE International Conference on Acoustics, 2011

Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition.
Wiley, ISBN: 978-0-470-19536-9, 2011

2010
Audio-visual atoms for generic video concept classification.
TOMCCAP, 2010

Model-Based Expectation-Maximization Source Separation and Localization.
IEEE Trans. Audio, Speech & Language Processing, 2010

Evaluating Source Separation Algorithms With Reverberant Speech.
IEEE Trans. Audio, Speech & Language Processing, 2010

Audio-Based Semantic Concept Classification for Consumer Video.
IEEE Trans. Audio, Speech & Language Processing, 2010

Speech separation using speaker-adapted eigenvoice speech models.
Computer Speech & Language, 2010

Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Clustering Beat-Chroma Patterns in a Large Music Database.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Cover song detection: From high scores to general classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Detecting local semantic concepts in environmental sounds using Markov model based clustering.
Proceedings of the IEEE International Conference on Acoustics, 2010

Audio fingerprinting to identify multiple videos of an event.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Quantitative Analysis of a Common Audio Similarity Measure.
IEEE Trans. Audio, Speech & Language Processing, 2009

Guided harmonic sinusoid estimation in a multi-pitch environment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

The Ideal Interaural Parameter Mask: A bound on binaural separation systems.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Multi-voice polyphonic music transcription using eigeninstruments.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Improving MIDI-audio alignment with acoustic features.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Finding similar acoustic events using matching pursuit and locality-sensitive hashing.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Short-term audio-visual atoms for generic video concept classification.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling.
Proceedings of the INTERSPEECH 2009, 2009

Structured Prediction Models for Chord Transcription of Music Audio.
Proceedings of the International Conference on Machine Learning and Applications, 2009

Workshop summary: Sparse methods for music audio.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Handling Asynchrony in Audio-Score Alignment.
Proceedings of the 2009 International Computer Music Conference, 2009

A variational EM algorithm for learning eigenvoice parameters in mixed signals.
Proceedings of the IEEE International Conference on Acoustics, 2009

A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Active Learning for Interactive Multimedia Retrieval.
Proceedings of the IEEE, 2008

Multiple-Instance Learning for Music Information Retrieval.
Proceedings of the ISMIR 2008, 2008

Source separation based on binaural cues and source model constraints.
Proceedings of the INTERSPEECH 2008, 2008

Data-driven articulatory inversion incorporating articulator priors.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Preliminary intelligibility tests of a monaural speech segregation system.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Stylization of pitch with syllable-based linear segments.
Proceedings of the IEEE International Conference on Acoustics, 2008

Detecting music in ambient audio by long-window autocorrelation.
Proceedings of the IEEE International Conference on Acoustics, 2008

A tempo-insensitive distance measure for cover song identification based on chroma features.
Proceedings of the IEEE International Conference on Acoustics, 2008

Cross-correlation of beat-synchronous representations for music similarity.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Autoregressive Modeling of Temporal Envelopes.
IEEE Trans. Signal Processing, 2007

Using Broad Phonetic Group Experts for Improved Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2007

Melody Transcription From Music Audio: Approaches and Evaluation.
IEEE Trans. Audio, Speech & Language Processing, 2007

A Discriminative Model for Polyphonic Piano Transcription.
EURASIP J. Adv. Sig. Proc., 2007

Multimodal Segmentation of Lifelog Data.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 2007, 8th International Conference, Carnegie Mellon University, Pittsburgh, PA, USA, May 30, 2007

Kodak's consumer video benchmark data set: concept definition and annotation.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Large-scale multimodal semantic concept detection for consumer video.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

A Web-Based Game for Collecting Music Metadata.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Classifying Music Audio with Timbral and Chroma Features.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Fingerprinting to Identify Repeated Sound Events in Long-Duration Personal Audio Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2007

Identifying 'Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
White Worms Don't Work.
;login:, 2006

Support vector machine active learning for music retrieval.
Multimedia Syst., 2006

Classification-based melody transcription.
Machine Learning, 2006

Accessing Minimal-Impact Personal Audio Archives.
IEEE MultiMedia, 2006

Extracting information from music audio.
Commun. ACM, 2006

An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

A probability model for interaural phase difference.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Voice activity detection in personal audio recordings using autocorrelogram compensation.
Proceedings of the INTERSPEECH 2006, 2006

Estimating the Number of Marine Mammals Using Recordings of Clicks from One Microphone.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Model-Based Monaural Source Separation Using a Vector-Quantized Phase-Vocoder Representation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Decoding speech in the presence of other sources.
Speech Communication, 2005

A Classification Approach to Melody Transcription.
Proceedings of the ISMIR 2005, 2005

Song-Level Features and Support Vector Machines for Music Classification.
Proceedings of the ISMIR 2005, 2005

Clap detection and discrimination for rhythm therapy.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Speech Feature Smoothing for Robust ASR.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Deformable Spectrograms.
Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, 2005

Evaluating Speech Separation Systems.
Proceedings of the Speech Separation by Humans and Machines, 2005

2004
Reflections on Witty.
;login:, 2004

Introduction to the special issue on the recognition and organization of real-world sound.
Speech Communication, 2004

A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures.
Computer Music Journal, 2004

Automatic Record Reviews.
Proceedings of the ISMIR 2004, 2004

Eigenrhythms: Drum pattern basis sets for classification and generation.
Proceedings of the ISMIR 2004, 2004

Towards single-channel unsupervised source separation of speech mixtures: the layered harmonics/formants separation-tracking model.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Features for segmenting and classifying long-duration recordings of "personal" audio.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

LP-TRAP: linear predictive temporal patterns.
Proceedings of the INTERSPEECH 2004, 2004

Multiband audio modeling for single-channel acoustic source separation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Worms vs. perimeters: the case for hard-LANs.
Proceedings of the 12th Annual IEEE Symposium on High Performance Interconnects, 2004

2003
Worm anatomy and model.
Proceedings of the 2003 ACM Workshop on Rapid Malcode, 2003

Ground-truth transcriptions of real music from force-aligned MIDI syntheses.
Proceedings of the ISMIR 2003, 2003

Chord segmentation and recognition using EM-trained hidden markov models.
Proceedings of the ISMIR 2003, 2003

A large-scale evalutation of acoustic and subjective music similarity measures.
Proceedings of the ISMIR 2003, 2003

Using mutual information to design class-specific phone recognizers.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Anchor space for classification and similarity measurement of music.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Audio information access from meeting rooms.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

The ICSI Meeting Corpus.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Multi-channel source separation by factorial HMMs.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Sound texture modelling with linear prediction in both time and frequency domains.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Connectionist speech recognition of Broadcast News.
Speech Communication, 2002

The Quest for Ground Truth in Musical Artist Similarity.
Proceedings of the ISMIR 2002, 2002

Error visualization for tandem acoustic modeling on the Aurora task.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
The auditory organization of speech and other sources in listeners and computational models.
Speech Communication, 2001

The Meeting Project at ICSI.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Investigations into tandem acoustic modeling for the Aurora task.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Tandem acoustic modeling in large-vocabulary recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Using acoustic condition clustering to improve acoustic change detection on broadcast news.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Using mutual information to design feature combinations.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Decoding speech in the presence of other sound sources.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Feature extraction using non-linear transformation for robust speech recognition on the Aurora database.
Proceedings of the IEEE International Conference on Acoustics, 2000

Tandem connectionist feature extraction for conventional HMM systems.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures.
Speech Communication, 1999

The THISL SDR System At TREC-8.
Proceedings of The Eighth Text REtrieval Conference, 1999

Speech/music discrimination based on posterior probability features.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Multi-stream speech recognition: ready for prime time?
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Size matters: an empirical study of neural network training for large vocabulary continuous speech recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1997
The weft: a representation for periodic sounds.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Prediction-driven computational auditory scene analysis.
PhD thesis, 1996

1994
Barefoot multimedia, or, All is not what it seems, Moriarty.
Proceedings of the Interactive Multimedia in University Education: Designing for Change in Teaching and Learning, 1994

A computer implementation of psychoacoustic grouping rules.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

1992
Timescale Modification and Wavelet Representations.
Proceedings of the 1992 International Computer Music Conference, 1992

1991
A Wavelet Based Sinusoid Model of Sound for Auditory Signal Separation.
Proceedings of the 1991 International Computer Music Conference, 1991

1990
Real-time CSound: Software Synthesis with Sensing and Control.
Proceedings of the 1990 International Computer Music Conference, 1990


  Loading...