Frank Kurth

Orcid: 0000-0002-9992-083X

Affiliations:
  • Fraunhofer Institute for Communication, Information Processing and Ergonomics (FKIE), Wachtberg, Germany


According to our database1, Frank Kurth authored at least 72 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Why Do Angular Margin Losses Work Well for Semi-Supervised Anomalous Sound Detection?
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Projected Belief Networks With Discriminative Alignment for Acoustic Event Classification: Rivaling State of the Art CNNs.
CoRR, 2024

2022
SCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Data.
Proceedings of the 52. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2022, Informatik in den Naturwissenschaften, 26., 2022

Using the Projected Belief Network at High Dimensions.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Separation of Bird Calls and DOA estimation using a 4-Microphone Array.
Proceedings of the 29th European Signal Processing Conference, 2021

A Data Generation Framework for Acoustic Drone Detection Algorithms.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

2020
Towards Robust Speech Interfaces for the ISS.
Proceedings of the IUI '20: 25th International Conference on Intelligent User Interfaces, 2020

2019
Efficient Phase-Based Acoustic Tracking of Drones using a Microphone Array.
Proceedings of the 27th European Signal Processing Conference, 2019

Open-Set Acoustic Scene Classification with Deep Convolutional Autoencoders.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Robust Detection of Jittered Multiply Repeating Audio Events Using Iterated Time-Warped ACF.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Robust Speaker Identification by Fusing Classification Scores with a Neural Network.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Characterization of discrete linear shift-invariant systems.
Proceedings of the 25th European Signal Processing Conference, 2017

Glottal mixture model (GLOMM) for speaker identification on telephone channels.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Robust Detection of Multiple Bioacoustic Events with Repetitive Structures.
Proceedings of the Interspeech 2016, 2016

Robust compressive shift retrieval in linear time.
Proceedings of the 24th European Signal Processing Conference, 2016

General Detection of Speech Signals in the Time-Frequency Plane.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Evaluation of Enhanced F0-Trajectories for Speech Detection and Classification in Acoustic Monitoring.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
Using enhanced F0-trajectories for multiple speaker detection in audio monitoring scenarios.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Robust F0 estimation in noisy speech signals using shift autocorrelation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Robust Detection and Pattern Extraction of Repeated Signal Components Using Subband Shift-ACF.
Proceedings of the 2014 IEEE International Conference on Cloud Engineering, 2014

Detection of Audio Events with Repetitive Structure Using Generalized Autocorrelations.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

2013
The shift-ACF: Detecting multiply repeated signal components.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Decompositions of 2D Feature Representations with Applications to Acoustic Event Detection.
Proceedings of the 43. Jahrestagung der Gesellschaft für Informatik, 2013

2012
A digital library framework for heterogeneous music collections: from document acquisition to cross-modal interaction.
Int. J. Digit. Libr., 2012

Probado Music: a Multimodal Online Music Library.
Proceedings of the Non-Cochlear Sound: Proceedings of the 38th International Computer Music Conference, 2012

Unsupervised Techniques for Audio Summarization in Acoustic Environment Monitoring.
Proceedings of the Future Security - 7th Security Research Conference, 2012

A system for audio summarization in acoustic monitoring scenarios.
Proceedings of the 20th European Signal Processing Conference, 2012

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines.
Proceedings of the Multimodal Music Processing, 2012

Perceptual Hashing for the Identification of Telephone Speech.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011
SyncTS: Automatic Synchronization of Speech and Text Documents.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

2010
Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring.
Pattern Recognit. Lett., 2010

Perceptual audio features for unsupervised key-phrase detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Cyclic tempogram - A mid-level tempo representation for musicsignals.
Proceedings of the IEEE International Conference on Acoustics, 2010

An Analysis of MFCC-like Parametric Audio Features for Keyphrase Spotting Applications.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

2009
A construction of compact MFCC-type features using short-time statistics for applications in audio segmentation.
Proceedings of the 17th European Signal Processing Conference, 2009

A Concept for Using Combined Multimodal Queries in Digital Music Libraries.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2009

2008
Efficient Index-Based Audio Matching.
IEEE Trans. Speech Audio Process., 2008

SyncPlayer - Multimodale Wiedergabe, Navigation und Suche in heterogenen digitalen Musikkollektionen.
Proceedings of the LWA 2008, 2008

Automatic Mapping of Scanned Sheet Music to Audio Recordings.
Proceedings of the ISMIR 2008, 2008

Multimodal presentation and browsing of music.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

A Framework for Managing Multimodal Digitized Music Collections.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2008

2007
Towards Structural Analysis of Audio Recordings in the Presence of Musical Variations.
EURASIP J. Adv. Signal Process., 2007

Large data methods for multimedia.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Automated Synchronization of Scanned Sheet Music with Audio Recordings.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A Demonstration of the SyncPlayer System.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

The Probado Music Repository at the Bavarian State Library.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Lyrics-Based Audio Retrieval and Multimodal Navigation in Music Collections.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2007

PROBADO - A Generic Repository Integration Framework.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2007

2006
Aktuelle Aspekte des Music Information Retrieval.
Datenbank-Spektrum, 2006

An Efficient Multiscale Approach to Audio Synchronization.
Proceedings of the ISMIR 2006, 2006

The Cyclic Beat Spectrum: Tempo-Related Audio Features for Time-Scale Invariant Audio Identification.
Proceedings of the ISMIR 2006, 2006

Enhancing Similarity Matrices for Music Audio Analysis.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Audio Matching via Chroma-Based Statistical Features.
Proceedings of the ISMIR 2005, 2005

Syncplayer - An Advanced System for Multimodal Music Access.
Proceedings of the ISMIR 2005, 2005

Automatisierte Annotation von Audiodaten mittels Synchronisationstechniken.
Proceedings of the 35. Jahrestagung der Gesellschaft für Informatik, 2005

Musikinformatik.
Proceedings of the 35. Jahrestagung der Gesellschaft für Informatik, 2005

Ein verteiltes Medienarchiv für bioakustische Datenbestände.
Proceedings of the 35. Jahrestagung der Gesellschaft für Informatik, 2005

2004
A unified approach to content-based and fault-tolerant music recognition.
IEEE Trans. Multim., 2004

Towards an Efficient Algorithm for Automatic Score-to-Audio Synchronization.
Proceedings of the ISMIR 2004, 2004

A Prototypical Service for Real-Time Access to Local Context-Based Music Information.
Proceedings of the ISMIR 2004, 2004

Content-Based Retrieval in Digital Music Libraries.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2004

Score-PCM Music Synchronization Based on Extracted Score Parameters.
Proceedings of the Computer Music Modeling and Retrieval: Second International Symposium, 2004

2003
Automatic synchronization of music data in score-, MIDI- and PCM-format.
Proceedings of the ISMIR 2003, 2003

2002
A Unified Approach to Content-Based and Fault Tolerant Music Identification.
Proceedings of the Second International Conference on WEB Delivering of Music, 2002

A full-text retrieval approach to content-based audio identification.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

A ranking technique for fast audio identification.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2000
MCML - Music Contents Markup Language.
Proceedings of the ISMIR 2000, 2000

Perceptually Transparent Attachment of Content-Based Data to Audio-Visual Documents.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

1999
Vermeidung von Generationseffekten in der Audiocodierung.
PhD thesis, 1999

Filter bank tree and M-band wavelet packet algorithms in audio signal processing.
IEEE Trans. Signal Process., 1999

Cosine Modulated Structures for Rational Multirate Filter Banks.
Proceedings of the Signal and Image Processing (SIP), 1999

Vermeidung von Generationseffekten in der Audiocodierung.
Proceedings of the Ausgezeichnete Informatikdissertationen 1999, 1999


  Loading...