Benjamin Elizalde

Orcid: 0000-0001-6461-5790

According to our database1, Benjamin Elizalde authored at least 42 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
PAM: Prompting Audio-Language Models for Audio Quality Assessment.
CoRR, 2024

NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription.
CoRR, 2024

2023
Prompting Audios Using Acoustic Properties For Emotion Representation.
CoRR, 2023

Training Audio Captioning Models without Audio.
CoRR, 2023

Natural Language Supervision for General-Purpose Audio Representations.
CoRR, 2023

Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session.
CoRR, 2023

Pengi: An Audio Language Model for Audio Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-View Learning for Speech Emotion Recognition with Categorical Emotion, Categorical Sentiment, and Dimensional Scores.
Proceedings of the IEEE International Conference on Acoustics, 2023

CLAP Learning Audio Concepts from Natural Language Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Describing emotions with acoustic property prompts for speech emotion recognition.
CoRR, 2022

Audio Retrieval with WavText5K and CLAP Training.
CoRR, 2022

2021
COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge.
CoRR, 2021

Identifying Actions for Sound Event Classification.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

2020
Multi-Label Sound Event Retrieval Using A Deep Learning-Based Siamese Structure With A Pairwise Presence Matrix.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Sound Event Detection in the DCASE 2017 Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
AudioPairBank: towards a large-scale tag-pair-based audio content analysis.
EURASIP J. Audio Speech Music. Process., 2018

NELS - Never-Ending Learner of Sounds.
CoRR, 2018

Content-Based Representations of Audio Using Siamese Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Framework for Evaluation of Sound Event Detection in Web Videos.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Audio Content Based Geotagging in Multimedia.
Proceedings of the Interspeech 2017, 2017

An approach for self-training audio event detectors using web data.
Proceedings of the 25th European Signal Processing Conference, 2017

DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
An Approach for Self-Training Audio Event Detectors Using Web Data.
CoRR, 2016

AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis.
CoRR, 2016

YFCC100M: the new data in multimedia research.
Commun. ACM, 2016

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

City-Identification of Flickr Videos Using Semantic Acoustic Features.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
The New Data and New Challenges in Multimedia Research.
CoRR, 2015

The YLI-MED Corpus: Characteristics, Procedures, and Plans.
CoRR, 2015

Insights into Audio-Based Multimedia Event Classification with Neural Networks.
Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Kickstarting the Commons: The YFCC100M and the YLI Corpora.
Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

2014
The Placing Task: A Large-Scale Geo-Estimation Challenge for Social-Media Videos and Images.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014

Audio-concept features and hidden Markov models for multimedia event detection.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

Audio concept classification with Hierarchical Deep Neural Networks.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
An i-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content.
Proceedings of the 2013 IEEE International Symposium on Multimedia, 2013

Audio Concept Ranking for Video Event Detection on User-Generated Content.
Proceedings of the First Workshop on Speech, 2013

Lost in segmentation: Three approaches for speech/non-speech detection in consumer-produced videos.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

2012
SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012


  Loading...