Xavier Anguera Miró

Orcid: 0000-0001-8659-3991

According to our database1, Xavier Anguera Miró authored at least 96 papers between 1996 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
Teaching American English pronunciation using a TTS service.
Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

2017
The zero resource speech challenge 2017.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
The Zero Resource Speech Challenge 2015: Proposed Approaches and Results.
Proceedings of the SLTU-2016, 2016

Zero-Cost Speech Recognition Task at Mediaeval 2016.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

English Language Speech Assistant.
Proceedings of the Interspeech 2016, 2016

2015
Fast Single- and Cross-Show Speaker Diarization Using Binary Key Speaker Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Automatic Extraction of the Passing Strategies of Soccer Teams.
CoRR, 2015

Query by Example Search on Speech at Mediaeval 2015.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

The zero resource speech challenge 2015.
Proceedings of the INTERSPEECH 2015, 2015

Effect of gender and call duration on customer satisfaction in call center big data.
Proceedings of the INTERSPEECH 2015, 2015

Novel clustering selection criterion for fast binary key speaker diarization.
Proceedings of the INTERSPEECH 2015, 2015

Multimodal read-aloud ebooks for language learning.
Proceedings of the INTERSPEECH 2015, 2015

An information-theoretic metric of fingerprint effectiveness.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

MASK+: Data-driven regions selection for acoustic fingerprinting.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

QUESST2014: Evaluating Query-by-Example Speech Search in a zero-resource setting with real-life queries.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improved binary key speaker diarization system.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Language independent search in MediaEval's Spoken Web Search task.
Comput. Speech Lang., 2014

Query-by-example spoken term detection evaluation on low-resource languages.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Query by Example Search on Speech at Mediaeval 2014.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Query-by-example spoken term detection on multilingual unconstrained speech.
Proceedings of the INTERSPEECH 2014, 2014

Audio-to-text alignment for speech recognition with very limited resources.
Proceedings of the INTERSPEECH 2014, 2014

Inferring social relationships in a phone call from a single party's speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Sentiment retrieval on web reviews using spontaneous natural speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Phoneme-Lattice to Phoneme-Sequence Matching Algorithm Based on Dynamic Programming.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warping.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarization.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

On the modeling of natural vocal emotion expressions through binary key.
Proceedings of the 22nd European Signal Processing Conference, 2014

Combining temporal and spectral information for Query-by-Example Spoken Term Detection.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion.
EURASIP J. Audio Speech Music. Process., 2013

The CMTECH Spoken Web Search System for MediaEval 2013.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

The Telefonica Research Spoken Web Search System for MediaEval 2013.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

The Spoken Web Search Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Information retrieval-based dynamic time warping.
Proceedings of the INTERSPEECH 2013, 2013

Two-Level Clustering towards Unsupervised Discovery of Acoustic Classes.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

A Riemannian Stopping Criterion for Unsupervised Phonetic Segmentation.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Memory efficient subsequence DTW for Query-by-Example Spoken Term Detection.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

The spoken web search task at MediaEval 2012.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering.
Proceedings of the IEEE International Conference on Acoustics, 2013

Perceptually inspired features for speaker likability classification.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Speaker Diarization: A Review of Recent Research.
IEEE Trans. Speech Audio Process., 2012

The ICSI RT-09 Speaker Diarization System.
IEEE Trans. Speech Audio Process., 2012

The Spoken Web Search Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Telefonica Research System for the Spoken Web Search task at Mediaeval 2012.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

MASK: Robust Local Features for Audio Fingerprinting.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Expert Talk for Time Machine Session: Dynamic Time Warping New Youth.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

The Spoken Web Search Task at MediaEval 2011.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speaker independent discriminant feature extraction for acoustic pattern-matching.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Combining Features at Search Time: PRISMA at Video Copy Detection Task.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Telefonica Research at TRECVID 2011 Content-Based Copy Detection.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Multimodal fusion for video copy detection.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Telefonica System for the Spoken Web Search Task at Mediaeval 2011.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Speaker Modeling Using Local Binary Decisions.
Proceedings of the INTERSPEECH 2011, 2011

Real-time synchronisation of multimedia streams in a mobile device.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Automatic synchronization of electronic and audio books via TTS alignment and silence filtering.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Closed-form expressions vs. BIC: A comparison for speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2011

Fast speaker diarization based on binary keys.
Proceedings of the IEEE International Conference on Acoustics, 2011

Discriminant binary data representation for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Spoken WordCloud: Clustering recurrent patterns in speech.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

2010
Telefonica Research at TRECVID 2010 Content-Based Copy Detection.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Improvements to the equal-parameter BIC for speaker diarization.
Proceedings of the INTERSPEECH 2010, 2010

System output combination for improved speaker diarization.
Proceedings of the INTERSPEECH 2010, 2010

A novel speaker binary key derived from anchor models.
Proceedings of the INTERSPEECH 2010, 2010

Enriching music mood annotation by semantic association reasoning.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

MuViSync: Realtime music video alignment.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Partial sequence matching using an Unbounded Dynamic Time Warping algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Telefonica Research Content-Based Copy Detection TRECVID Submission.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

The role of tags and image aesthetics in social image search.
Proceedings of the first SIGMM workshop on Social media, 2009

Multimodal video copy detection applied to social media.
Proceedings of the first SIGMM workshop on Social media, 2009

Text versus speech: a comparison of tagging input modalities for camera phones.
Proceedings of the 11th Conference on Human-Computer Interaction with Mobile Devices and Services, 2009

Minivectors: an improved GMM-SVM approach for speaker verification.
Proceedings of the INTERSPEECH 2009, 2009

Audio-based automatic management of TV commercials.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Multimodal photo annotation and retrieval on a mobile phone.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

MAMI: multimodal annotations on a camera phone.
Proceedings of the 10th Conference on Human-Computer Interaction with Mobile Devices and Services, 2008

TV Advertisements Detection and Clustering Based on Acoustic Information.
Proceedings of the 2008 International Conferences on Computational Intelligence for Modelling, 2008

2007
Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information.
IEEE Trans. Computers, 2007

Acoustic Beamforming for Speaker Diarization of Meetings.
IEEE Trans. Speech Audio Process., 2007

Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

Model Complexity Selection and Cross-Validation EM Training for Robust Speaker Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2007

The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Speaker Diarization for Conference Room: The UPC RT07s Evaluation System.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Robust speaker diarization for meetings.
PhD thesis, 2006

Hybrid Speech/non-speech detector applied to Speaker Diarization of Meetings.
Proceedings of the Odyssey 2006, 2006

Speaker Diarization for Multi-microphone Meetings Using Only Between-Channel Differences.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

The ICSI-SRI Spring 2006 Meeting Recognition System.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Robust Speaker Diarization for Meetings: ICSI RT06S Meetings Evaluation System.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences.
Proceedings of the INTERSPEECH 2006, 2006

Multi-stream speaker diarization systems for the meetings domain.
Proceedings of the INTERSPEECH 2006, 2006

Robust speaker diarization for meetings: ICSI RT06s evaluation system.
Proceedings of the INTERSPEECH 2006, 2006

Friends and enemies: a novel initialization for speaker diarization.
Proceedings of the INTERSPEECH 2006, 2006

Purity Algorithms for Speaker Diarization of Meetings Data.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

2004
Evolutive speaker segmentation using a repository system.
Proceedings of the INTERSPEECH 2004, 2004

1998
A VQ based speaker recognition system based in histogram distances. text independent and for noisy environments.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1996
Text independent speaker identification on noisy environments by means of self organizing maps.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996


  Loading...