Benjamin Elizalde

Huaming Wang

Proceedings of the IEEE International Conference on Acoustics, 2024

Prompting Audios Using Acoustic Properties for Emotion Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Training Audio Captioning Models without Audio.

[BibT_eX]

[DOI]

Dimitra Emmanouilidou

Rita Singh

Huaming Wang

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session.

[BibT_eX]

[DOI]

CoRR, 2023

Pengi: An Audio Language Model for Audio Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Audio Retrieval with WavText5K and CLAP Training.

[BibT_eX]

[DOI]

Huaming Wang

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-View Learning for Speech Emotion Recognition with Categorical Emotion, Categorical Sentiment, and Dimensional Scores.

[BibT_eX]

[DOI]

Daniel Tompkins

Dimitra Emmanouilidou

Proceedings of the IEEE International Conference on Acoustics, 2023

CLAP Learning Audio Concepts from Natural Language Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Describing emotions with acoustic property prompts for speech emotion recognition.

[BibT_eX]

[DOI]

CoRR, 2022

2021

COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge.

[BibT_eX]

[DOI]

Daniel Tompkins

CoRR, 2021

Identifying Actions for Sound Event Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

2020

Never-Ending Learning of Sounds.

[BibT_eX]

[DOI]

PhD thesis, 2020

Multi-Label Sound Event Retrieval Using A Deep Learning-Based Siamese Structure With A Pairwise Presence Matrix.

[BibT_eX]

[DOI]

Jianyu Fan

Eric Nichols

Daniel Tompkins

Ana Elisa Méndez Méndez

Philippe Pasquier

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Sound Event Detection in the DCASE 2017 Challenge.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio.

[BibT_eX]

[DOI]

Shuayb Zarar

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

AudioPairBank: towards a large-scale tag-pair-based audio content analysis.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2018

NELS - Never-Ending Learner of Sounds.

[BibT_eX]

[DOI]

CoRR, 2018

Content-Based Representations of Audio Using Siamese Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines.

[BibT_eX]

[DOI]

Abelino Jimenez

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Framework for Evaluation of Sound Event Detection in Web Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Audio Content Based Geotagging in Multimedia.

[BibT_eX]

[DOI]

Anurag Kumar

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An approach for self-training audio event detectors using web data.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features.

[BibT_eX]

[DOI]

Abelino Jimenez

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016

An Approach for Self-Training Audio Event Detectors Using Web Data.

[BibT_eX]

[DOI]

CoRR, 2016

AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis.

[BibT_eX]

[DOI]

CoRR, 2016

YFCC100M: the new data in multimedia research.

[BibT_eX]

[DOI]

Commun. ACM, 2016

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

City-Identification of Flickr Videos Using Semantic Acoustic Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015

The New Data and New Challenges in Multimedia Research.

[BibT_eX]

[DOI]

CoRR, 2015

The YLI-MED Corpus: Characteristics, Procedures, and Plans.

[BibT_eX]

[DOI]

CoRR, 2015

Insights into Audio-Based Multimedia Event Classification with Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Kickstarting the Commons: The YFCC100M and the YLI Corpora.

[BibT_eX]

[DOI]

Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

2014

The Placing Task: A Large-Scale Geo-Estimation Challenge for Social-Media Videos and Images.

[BibT_eX]

[DOI]

Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014

Audio-concept features and hidden Markov models for multimedia event detection.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

Audio concept classification with Hierarchical Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

An i-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content.

[BibT_eX]

[DOI]

Howard Lei

Gerald Friedland

Proceedings of the 2013 IEEE International Symposium on Multimedia, 2013

Audio Concept Ranking for Video Event Detection on User-Generated Content.

[BibT_eX]

[DOI]

Mirco Ravanelli

Gerald Friedland

Proceedings of the First Workshop on Speech, 2013

Lost in segmentation: Three approaches for speech/non-speech detection in consumer-produced videos.

[BibT_eX]

[DOI]