Daniel P. W. Ellis

Proceedings of the 17th International Conference on Digital Audio Effects, 2014

2013

Modeling nonlinear circuits with linearized dynamical models via kernel regression.

[BibT_eX]

[DOI]

Daniel J. Gillespie

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition.

[BibT_eX]

[DOI]

Zhuo Chen

Gustavo Enrique De Almeida Prado Alves Batista

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems.

[BibT_eX]

[DOI]

Rogério Schmidt Feris

Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

A Video Compression-Based Approach to Measure Music Structural Similarity.

[BibT_eX]

[DOI]

Diego Furtado Silva

Hélène Papadopoulos

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Beta Process Sparse Nonnegative Matrix Factorization for Music.

[BibT_eX]

[DOI]

Dawen Liang

Matthew D. Hoffman

Gustavo E. A. P. A. Batista

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

All for one: feature combination for highly channel-degraded speech activity detection.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Applying Machine Learning and Audio Analysis Techniques to Insect Recognition in Intelligent Traps.

[BibT_eX]

[DOI]

Diego Furtado Silva

Vinícius M. A. de Souza

Eamonn J. Keogh

Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Subband autocorrelation features for video soundtrack classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Data-driven voice source waveform analysis and synthesis.

[BibT_eX]

[DOI]

Speech Commun., 2012

The million song dataset challenge.

[BibT_eX]

[DOI]

Brian McFee

Gert R. G. Lanckriet

Proceedings of the 21st World Wide Web Conference, 2012

IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems.

[BibT_eX]

[DOI]

Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis.

[BibT_eX]

[DOI]

Gerald Friedland

Florian Metze

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Making a scene: alignment of complete sets of clips based on pairwise audio match.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Retrieval, 2012

Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude.

[BibT_eX]

[DOI]

Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Inharmonic speech: a tool for the study of speech perception and separation.

[BibT_eX]

[DOI]

Josh H. McDermott

Hideki Kawahara

Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Noise Robust Pitch Tracking by Subband Autocorrelation Classification.

[BibT_eX]

[DOI]

Byung Suk Lee

Proceedings of the INTERSPEECH 2012, 2012

2011

Combining localization cues and source model constraints for binaural source separation.

[BibT_eX]

[DOI]

Speech Commun., 2011

Introduction to the Special Issue on Music Signal Processing.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

Signal Processing for Music Analysis.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

General chair's introduction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Spectral vs. spectro-temporal features for acoustic event detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Large-scale cover song recognition using hashed chroma landmarks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System.

[BibT_eX]

[DOI]

Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Consumer video understanding: a benchmark database and an evaluation of human and machine performance.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

The Million Song Dataset.

[BibT_eX]

[DOI]

Brian Whitman

Paul Lamere

Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors.

[BibT_eX]

[DOI]

Fadi Biadsy

Julia Hirschberg

Proceedings of the INTERSPEECH 2011, 2011

Direct processing of mpeg audio using companding and BFP techniques.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Classifying soundtracks with audio texture features.

[BibT_eX]

[DOI]

Xiaohong Zeng

Josh H. McDermott

Proceedings of the IEEE International Conference on Acoustics, 2011

Soundtrack classification by transient events.

[BibT_eX]

[DOI]

Alexander C. Loui

Proceedings of the IEEE International Conference on Acoustics, 2011

Evaluating music sequence models through missing data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition.

[BibT_eX]

[DOI]

Ben Gold

Nelson Morgan

Wiley, ISBN: 978-0-470-19536-9, 2011

2010

Audio-visual atoms for generic video concept classification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2010

Model-Based Expectation-Maximization Source Separation and Localization.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Evaluating Source Separation Algorithms With Reverberant Speech.

[BibT_eX]

[DOI]

Barbara G. Shinn-Cunningham

Scott Bressler

IEEE Trans. Speech Audio Process., 2010

Audio-Based Semantic Concept Classification for Consumer Video.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Speech separation using speaker-adapted eigenvoice speech models.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2010

Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.

[BibT_eX]

[DOI]

Subhabrata Bhattacharya

Mubarak Shah

Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Clustering Beat-Chroma Patterns in a Large Music Database.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Cover song detection: From high scores to general classification.

[BibT_eX]

[DOI]

Suman V. Ravuri

Proceedings of the IEEE International Conference on Acoustics, 2010

Detecting local semantic concepts in environmental sounds using Markov model based clustering.

[BibT_eX]

[DOI]

Alexander C. Loui

Proceedings of the IEEE International Conference on Acoustics, 2010

Audio fingerprinting to identify multiple videos of an event.

[BibT_eX]

[DOI]

Mads Græsbøll Christensen

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Quantitative Analysis of a Common Audio Similarity Measure.

[BibT_eX]

[DOI]

Jesper Højvang Jensen

Søren Holdt Jensen

IEEE Trans. Speech Audio Process., 2009

Guided harmonic sinusoid estimation in a multi-pitch environment.

[BibT_eX]

[DOI]

Christine Smit

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

The Ideal Interaural Parameter Mask: A bound on binaural separation systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Multi-voice polyphonic music transcription using eigeninstruments.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Improving MIDI-audio alignment with acoustic features.

[BibT_eX]

[DOI]

Johanna Devaney

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Finding similar acoustic events using matching pursuit and locality-sensitive hashing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Short-term audio-visual atoms for generic video concept classification.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2009, 2009

Structured Prediction Models for Chord Transcription of Music Audio.

[BibT_eX]

[DOI]

Adrian Weller

Tony Jebara

Proceedings of the International Conference on Machine Learning and Applications, 2009

Workshop summary: Sparse methods for music audio.

[BibT_eX]

[DOI]

Douglas Eck

Philippe Hamel

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Handling Asynchrony in Audio-Score Alignment.

[BibT_eX]

[DOI]

Johanna Devaney

Proceedings of the 2009 International Computer Music Conference, 2009

A variational EM algorithm for learning eigenvoice parameters in mixed signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation.

[BibT_eX]

[DOI]

Jesper Bünsow Boldt

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Active Learning for Interactive Multimedia Retrieval.

[BibT_eX]

[DOI]

Proc. IEEE, 2008

Multiple-Instance Learning for Music Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

Source separation based on binaural cues and source model constraints.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2008, 2008

Data-driven articulatory inversion incorporating articulator priors.

[BibT_eX]

[DOI]

Adam C. Lammert

Barbara G. Shinn-Cunningham

Pierre L. Divenyi

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Preliminary intelligibility tests of a monaural speech segregation system.

[BibT_eX]

[DOI]

DeLiang Wang

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Stylization of pitch with syllable-based linear segments.

[BibT_eX]

[DOI]

Suman V. Ravuri

Proceedings of the IEEE International Conference on Acoustics, 2008

Detecting music in ambient audio by long-window autocorrelation.

[BibT_eX]

[DOI]

Mads Græsbøll Christensen

Proceedings of the IEEE International Conference on Acoustics, 2008

A tempo-insensitive distance measure for cover song identification based on chroma features.

[BibT_eX]

[DOI]

Jesper Højvang Jensen

Søren Holdt Jensen

Proceedings of the IEEE International Conference on Acoustics, 2008

Cross-correlation of beat-synchronous representations for music similarity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Autoregressive Modeling of Temporal Envelopes.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2007

Using Broad Phonetic Group Experts for Improved Speech Recognition.

[BibT_eX]

[DOI]

Patricia Scanlon

Richard B. Reilly

IEEE Trans. Speech Audio Process., 2007

Melody Transcription From Music Audio: Approaches and Evaluation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

A Discriminative Model for Polyphonic Piano Transcription.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2007

Multimodal Segmentation of Lifelog Data.

[BibT_eX]

[DOI]

Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 2007, 8th International Conference, Carnegie Mellon University, Pittsburgh, PA, USA, May 30, 2007

Kodak's consumer video benchmark data set: concept definition and annotation.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Large-scale multimodal semantic concept detection for consumer video.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

A Web-Based Game for Collecting Music Metadata.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs.

[BibT_eX]

[DOI]

Jesper Højvang Jensen

Mads Græsbøll Christensen

Søren Holdt Jensen

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Classifying Music Audio with Timbral and Chroma Features.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Fingerprinting to Identify Repeated Sound Events in Long-Duration Personal Audio Recordings.

[BibT_eX]

[DOI]

James P. Ogle

Proceedings of the IEEE International Conference on Acoustics, 2007

Identifying 'Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

White Worms Don't Work.

[BibT_eX]

[DOI]

Nicholas Weaver

Support vector machine active learning for music retrieval.

[BibT_eX]

[DOI]

Multim. Syst., 2006

Classification-based melody transcription.

[BibT_eX]

[DOI]

Mach. Learn., 2006

Accessing Minimal-Impact Personal Audio Archives.

[BibT_eX]

[DOI]

IEEE Multim., 2006

Extracting information from music audio.

[BibT_eX]

[DOI]

Commun. ACM, 2006

An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments.

[BibT_eX]

[DOI]

Tony Jebara

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

A probability model for interaural phase difference.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Voice activity detection in personal audio recordings using autocorrelogram compensation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2006, 2006

Estimating the Number of Marine Mammals Using Recordings of Clicks from One Microphone.

[BibT_eX]

[DOI]

Xanadu Halkias

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Model-Based Monaural Source Separation Using a Vector-Quantized Phase-Vocoder Representation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Pushing the envelope - aside [speech recognition].

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2005

Decoding speech in the presence of other sources.

[BibT_eX]

[DOI]

Jon P. Barker

Martin P. Cooke

Speech Commun., 2005

A Classification Approach to Melody Transcription.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2005, 2005

Song-Level Features and Support Vector Machines for Music Classification.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2005, 2005

Clap detection and discrimination for rhythm therapy.

[BibT_eX]

[DOI]

Nathan Lesser

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Speech Feature Smoothing for Robust ASR.

[BibT_eX]

[DOI]

Chia-Ping Chen

Jeff A. Bilmes

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Deformable Spectrograms.

[BibT_eX]

[DOI]

Manuel Reyes-Gomez

Nebojsa Jojic

Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, 2005

Evaluating Speech Separation Systems.

[BibT_eX]

[DOI]

Proceedings of the Speech Separation by Humans and Machines, 2005

2004

Reflections on Witty.

[BibT_eX]

[DOI]

Nicholas Weaver

Introduction to the special issue on the recognition and organization of real-world sound.

[BibT_eX]

[DOI]

Martin P. Cooke

Speech Commun., 2004

A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures.

[BibT_eX]

[DOI]

Comput. Music. J., 2004

Automatic Record Reviews.

[BibT_eX]

[DOI]

Brian Whitman

Proceedings of the ISMIR 2004, 2004

Eigenrhythms: Drum pattern basis sets for classification and generation.

[BibT_eX]

[DOI]

John Arroyo

Proceedings of the ISMIR 2004, 2004

Towards single-channel unsupervised source separation of speech mixtures: the layered harmonics/formants separation-tracking model.

[BibT_eX]

[DOI]

Manuel Reyes-Gomez

Nebojsa Jojic

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Features for segmenting and classifying long-duration recordings of "personal" audio.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns.

[BibT_eX]

[DOI]

Hynek Hermansky

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

LP-TRAP: linear predictive temporal patterns.

[BibT_eX]

[DOI]

Hynek Hermansky

Proceedings of the INTERSPEECH 2004, 2004

Multiband audio modeling for single-channel acoustic source separation.

[BibT_eX]

[DOI]

Nebojsa Jojic

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Worms vs. perimeters: the case for hard-LANs.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual IEEE Symposium on High Performance Interconnects, 2004

2003

Worm anatomy and model.

[BibT_eX]

[DOI]

Proceedings of the 2003 ACM Workshop on Rapid Malcode, 2003

Ground-truth transcriptions of real music from force-aligned MIDI syntheses.

[BibT_eX]

[DOI]

Robert J. Turetsky

Proceedings of the ISMIR 2003, 2003

Chord segmentation and recognition using EM-trained hidden markov models.

[BibT_eX]

[DOI]

Alexander Sheh

Proceedings of the ISMIR 2003, 2003

A large-scale evalutation of acoustic and subjective music similarity measures.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2003, 2003

Using mutual information to design class-specific phone recognizers.

[BibT_eX]

[DOI]

Patricia Scanlon

Richard B. Reilly

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Anchor space for classification and similarity measurement of music.

[BibT_eX]

[DOI]

Adam Berenzweig

Steve Lawrence

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Audio information access from meeting rooms.

[BibT_eX]

[DOI]

Steve Renals

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

The ICSI Meeting Corpus.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Multi-channel source separation by factorial HMMs.

[BibT_eX]

[DOI]

Bhiksha Raj

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Sound texture modelling with linear prediction in both time and frequency domains.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Connectionist speech recognition of Broadcast News.

[BibT_eX]

[DOI]

Speech Commun., 2002

The Quest for Ground Truth in Musical Artist Similarity.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2002, 2002

Error visualization for tandem acoustic modeling on the Aurora task.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

The auditory organization of speech and other sources in listeners and computational models.

[BibT_eX]

[DOI]

Martin Cooke

Speech Commun., 2001

The Meeting Project at ICSI.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Human Language Technology Research, 2001

Investigations into tandem acoustic modeling for the Aurora task.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Tandem acoustic modeling in large-vocabulary recognition.

[BibT_eX]

[DOI]

Rita Singh

Sunil Sivadas

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Using acoustic condition clustering to improve acoustic change detection on broadcast news.

[BibT_eX]

[DOI]

Javier Ferreiros López

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Using mutual information to design feature combinations.

[BibT_eX]

[DOI]

Jeff A. Bilmes

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Decoding speech in the presence of other sound sources.

[BibT_eX]

[DOI]

Jon Barker

Martin Cooke

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Feature extraction using non-linear transformation for robust speech recognition on the Aurora database.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Tandem connectionist feature extraction for conventional HMM systems.

[BibT_eX]

[DOI]

Hynek Hermansky

Sangita Sharma

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures.

[BibT_eX]

[DOI]