Guillaume Gravier

Orcid: 0000-0002-2266-5682

Affiliations:
  • IRISA Rennes, France


According to our database1, Guillaume Gravier authored at least 180 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification.
CoRR, 2024

2023
Derrière les plongements de relations.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 1 : travaux de recherche originaux, 2023

Géométrie de l'auto-attention en classification : quand la géométrie remplace l'attention.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 1 : travaux de recherche originaux, 2023

Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation.
Proceedings of the Natural Language Processing and Information Systems, 2023

A Novel Method for Temporal Graph Classification based on Transitive Reduction.
Proceedings of the 10th IEEE International Conference on Data Science and Advanced Analytics, 2023

Filtering Safe Temporal Motifs in Dynamic Graphs for Dissemination Purposes.
Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2023

2022
Affect in Multimedia: Benchmarking Violent Scenes Detection.
IEEE Trans. Affect. Comput., 2022

Filtrage et régularisation pour améliorer la plausibilité des poids d'attention dans la tâche d'inférence en langue naturelle (Filtering and regularization to improve the plausibility of attention weights in NLI).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Une étude statistique des plongements dans les modèles transformers pour le français (An empirical statistical study of embeddings in French transformers).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

2021
Understanding the phenomenology of reading through modelling.
Semantic Web, 2021

Hierarchical multi-label propagation using speaking face graphs for multimodal person discovery.
Multim. Tools Appl., 2021

A survey on training and evaluation of word embeddings.
Int. J. Data Sci. Anal., 2021

Unsupervised Tree Extraction in Embedding Spaces for Taxonomy Induction.
Proceedings of the WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence, Melbourne VIC Australia, December 14, 2021

Active Learning for Interactive Relation Extraction in a French Newspaper's Articles.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

2020
Relation, es-tu là ? Détection de relations par LSTM pour améliorer l'extraction de relations (Relation, are you there ? LSTM-based relation detection to improve knowledge extraction ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

A Novel Path-Based Entity Relatedness Measure for Efficient Collective Entity Linking.
Proceedings of the Semantic Web - ISWC 2020, 2020

On the Correlation of Word Embedding Evaluation Metrics.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A correlation-based entity embedding approach for robust entity linking.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Rethinking deep active learning: Using unlabeled data at model training.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

HierarX : un outil pour la découverte de hiérarchies dans des espaces hyperboliques à partir de similarités.
Proceedings of the Extraction et Gestion des Connaissances, 2020

The Reading Experiences Ontology (REO): Reusing and Extending CIDOC CRM.
Proceedings of the 15th Annual International Conference of the Alliance of Digital Humanities Organizations, 2020

IRISA System for Entity Detection and Linking at CLEF HIPE 2020.
Proceedings of the Working Notes of CLEF 2020, 2020

2019
AI in the media and creative industries.
CoRR, 2019

Using Knowledge Base Semantics in Context-Aware Entity Linking.
Proceedings of the ACM Symposium on Document Engineering 2019, 2019

Modelling changes in diaries, correspondence and authors' libraries to support research on reading: the READ-IT approach.
Proceedings of the First International Workshop on Open Data and Ontologies for Cultural Heritage co-located with the 31st International Conference on Advanced Information Systems Engineering, 2019

2018
A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking.
IEEE Multim., 2018

A Study on Multimodal Video Hyperlinking with Visual Aggregation.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

2017
Content-based unsupervised segmentation of recurrent TV programs using grammatical inference.
Multim. Tools Appl., 2017

Special Issue on Content Based Multimedia Indexing.
Multim. Tools Appl., 2017

The Benchmarking Initiative for Multimedia Evaluation: MediaEval 2016.
IEEE Multim., 2017

IRISA at TrecVid 2017: Beyond Crossmodal and Multimodal Models for Video Hyperlinking.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Exploiting Multimodality in Video Hyperlinking to Improve Target Diversity.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

NexGenTV: Providing Real-Time Insight during Political Debates in a Second Screen Application.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Generative Adversarial Networks for Multimodal Representation Learning in Video Hyperlinking.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Linking Multimedia Content for Efficient News Browsing.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

One-Step Time-Dependent Future Video Frame Prediction with a Convolutional Encoder-Decoder Neural Network.
Proceedings of the Image Analysis and Processing - ICIAP 2017, 2017

Language-based Construction of Explorable News Graphs for Journalists.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017

Graphes typés pour l'exploration d'actualités.
Proceedings of the Actes des 13èmes journées francophones sur les Entrepôts de Données et l'Analyse en Ligne, 2017


Tag Propagation Approaches within Speaking Face Graphs for Multimodal Person Discovery.
Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, 2017

2016
Partial least squares for face hashing.
Neurocomputing, 2016

Reports on CBMI 16 and ICME 16.
IEEE Multim., 2016

IRISA at TrecVid2016: Crossmodality, Multimodality and Monomodality for Video Hyperlinking.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Évaluation dune nouvelle structuration thématique hiérarchique des textes dans un cadre de résumé automatique et de détection d'ancres au sein de vidéos (Evaluation of a novel hierarchical thematic structuring of texts in the framework of text summarization and anchor detection for video hyperlinking).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Articles longs), 2016

Shaping-Up Multimedia Analytics: Needs and Expectations of Media Professionals.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video Hyperlinking.
Proceedings of the 2016 ACM workshop on Vision and Language Integration Meets Multimedia Fusion, 2016

Bidirectional Joint Representation Learning with Symmetrical Deep Neural Networks for Multimodal and Crossmodal Applications.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

PUCMinas and IRISA at Multimodal Person Discovery.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

A Step Beyond Local Observations with a Dialog Aware Bidirectional GRU Network for Spoken Language Understanding.
Proceedings of the Interspeech 2016, 2016

Audio word similarity for clustering with zero resources based on iterative HMM classification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Near-duplicate video detection based on an approximate similarity self-join strategy.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

2015
Variability modelling for audio events detection in movies.
Multim. Tools Appl., 2015

VSD, a public dataset for the detection of violent scenes in movies: design, annotation, analysis and evaluation.
Multim. Tools Appl., 2015

IRISA at TrecVid 2015: Leveraging Multimodal LDA for Video Hyperlinking.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Vers une typologie de liens entre contenus journalistiques.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

Hierarchical Topic Structuring: From Dense Segmentation to Topically Focused Fragments via Burst Analysis.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Sequential Pattern Mining on Multimedia Data.
Proceedings of the 1st International Workshop on Advanced Analytics and Learning on Temporal Data, 2015

Content-Based Discovery of Multiple Structures from Episodes of Recurrent TV Programs Based on Grammatical Inference.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Hierarchical Topic Models for Language-based Video Hyperlinking.
Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia, 2015

Overview of the 2015 Workshop on Speech, Language and Audio in Multimedia.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

IRISA at MediaEval 2015: Search and Anchoring in Video Archives Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

SSIG and IRISA at Multimodal Person Discovery.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Recording and Analyzing Benchmarking Results: The Aims of the MediaEval Working Notes Proceedings.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Is it time to Switch to word embedding and recurrent neural networks for spoken language understanding?
Proceedings of the INTERSPEECH 2015, 2015

Learning to hash faces using large feature vectors.
Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

2014
Multimodal Violence Detection in Hollywood Movies: State-of-the-Art and Benchmarking.
Proceedings of the Fusion in Computer Vision - Understanding Complex Visual Content, 2014

Classification-oriented structure learning in Bayesian networks for multimodal event detection in videos.
Multim. Tools Appl., 2014

Language independent search in MediaEval's Spoken Web Search task.
Comput. Speech Lang., 2014

IRISA and KUL at MediaEval 2014: Search and Hyperlinking Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The ETAPE speech processing evaluation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Investigating domain-independent nlp techniques for precise target selection in video hyperlinking.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

Audio thumbnails for spoken content without transcription based on a maximum motif coverage criterion.
Proceedings of the INTERSPEECH 2014, 2014

Content-based inference of hierarchical structural grammar for recurrent TV programs using multiple sequence alignment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013
Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding.
IEEE Trans. Speech Audio Process., 2013

A probabilistic segment model combining lexical cohesion and disruption for topic segmentation (Un modèle segmental probabiliste combinant cohésion lexicale et rupture lexicale pour la segmentation thématique) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

Sim-min-hash: an efficient matching technique for linking large image collections.
Proceedings of the ACM Multimedia Conference, 2013

Retrieving geo-location of videos with a divide & conquer hierarchical multimodal approach.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Multimedia information seeking through search and hyperlinking.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Technicolor/INRIA Team at the MediaEval 2013 Violent Scenes Detection Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

HITS and IRISA at MediaEval 2013: Search and Hyperlinking Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Searching for Near-Duplicate Video Sequences from a Scalable Sequence Aligner.
Proceedings of the 2013 IEEE International Symposium on Multimedia, 2013

A Framework for Integrating Heterogeneous Sporadic Knowledge Sources into Automatic Speech Recognition.
Proceedings of the First Workshop on Speech, 2013

MODIS: an audio motif discovery software.
Proceedings of the INTERSPEECH 2013, 2013

The spoken web search task at MediaEval 2012.
Proceedings of the IEEE International Conference on Acoustics, 2013

Leveraging Lexical Cohesion and Disruption for Topic Segmentation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Audio event detection in movies using multiple audio words and contextual Bayesian networks.
Proceedings of the 11th International Workshop on Content-Based Multimedia Indexing, 2013

Oriented pooling for dense and non-dense rotation-invariant features.
Proceedings of the British Machine Vision Conference, 2013

2012
Unsupervised Motif Acquisition in Speech via Seeded Discovery and Template Matching Combination.
IEEE Trans. Speech Audio Process., 2012

Enhancing lexical cohesion measure with confidence measures, semantic relations and language model interpolation for multimedia spoken content topic segmentation.
Comput. Speech Lang., 2012

Automates lexico-phonétiques pour l'indexation et la recherche de segments de parole (Lexical-phonetic automata for spoken utterance indexing and retrieval) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Towards a new speech event detection approach for landmark-based speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Improving Cluster Selection and Event Modeling in Unsupervised Mining for Automatic Audiovisual Video Structuring.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Texmix: an automatically generated news navigation portal.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene Detection Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

The Spoken Web Search Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

IRISA at MediaEval 2012: Search and Hyperlinking Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

The MediaEval 2012 Affect Task: Violent Scenes Detection.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

The ETAPE corpus for the evaluation of speech-based TV content processing in the French language.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Using broad phonetic classes to guide search in automatic speech recognition.
Proceedings of the INTERSPEECH 2012, 2012

Integrating Stress Information in Large Vocabulary Continuous Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Lexical-phonetic automata for spoken utterance indexing and retrieval.
Proceedings of the INTERSPEECH 2012, 2012

Unsupervised Mining of Multiple Audiovisually Consistent Clusters for Video Structure Analysis.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Multimodal information fusion and temporal integration for violence detection in movies.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

The Spoken Web Search Task at MediaEval 2011.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

BABAZ: A large scale audio search system for video copy detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Efficient Mining of Repetitions in Large-Scale TV Streams with Product Quantization Hashing.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

A Benchmarking Campaign for the Multimodal Detection of Violent Scenes in Movies.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

2011
Exploiting Speech for Automatic TV Delinearization: From Streams to Cross-Media Semantic Navigation.
EURASIP J. Image Video Process., 2011

A Scalable Video Search Engine Based on Audio Content Indexing and Topic Segmentation
CoRR, 2011

Technicolor and INRIA/IRISA at MediaEval 2011: learning temporal modality integration with Bayesian Networks.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Irisa MediaEval 2011 Spoken Web Search System.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

The MediaEval 2011 Affect Task: Violent Scenes Detection in Hollywood movies.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Zero-Resource Audio-Only Spoken Term Detection Based on a Combination of Template Matching Techniques.
Proceedings of the INTERSPEECH 2011, 2011

A Study on Auditory Feature Spaces for Speech-Driven Lip Animation.
Proceedings of the INTERSPEECH 2011, 2011

Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Towards robust word discovery by self-similarity matrix comparison.
Proceedings of the IEEE International Conference on Acoustics, 2011

Automatically finding semantically consistent n-grams to add new words in LVCSR systems.
Proceedings of the IEEE International Conference on Acoustics, 2011

An efficient method for the unsupervised discovery of signalling motifs in large audio streams.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

2010
Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition.
Comput. Speech Lang., 2010

INRIA LEAR-TEXMEX: Video Copy Detection Task.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Utilisation de relations sémantiques pour améliorer la segmentation thématique de documents télévisuels.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2010

Improving ASR-based topic segmentation of TV programs with confidence measures and semantic relations.
Proceedings of the INTERSPEECH 2010, 2010

CRF-based combination of contextual features to improve a posteriori word-level confidence measures.
Proceedings of the INTERSPEECH 2010, 2010

Reshaping automatic speech transcripts for robust high-level spoken document analysis.
Proceedings of the Fourth Workshop on Analytics for Noisy Unstructured Text Data, 2010

2009
Model-based similarity estimation of multidimensional temporal sequences.
Ann. des Télécommunications, 2009

Variability Tolerant Audio Motif Discovery.
Proceedings of the Advances in Multimedia Modeling, 2009

Can Automatic Speech Transcripts Be Used for Large Scale TV Stream Description and Structuring?
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

Audio keyword extraction by unsupervised word discovery.
Proceedings of the INTERSPEECH 2009, 2009

Constraint selection for topic-based MDI adaptation of language models.
Proceedings of the INTERSPEECH 2009, 2009

The ester 2 evaluation campaign for the rich transcription of French radio broadcasts.
Proceedings of the INTERSPEECH 2009, 2009

Speaker adaptation by variable reference model subspace and application to large vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Automatic alignment and phonetic studies: Comparing alignment systems for the analysis of the schwa.
Trait. Autom. des Langues, 2008

Audiovisual integration with Segment Models for tennis video parsing.
Comput. Vis. Image Underst., 2008

Un modèle multi-sources pour la segmentation en sujets de journaux radiophoniques.
Proceedings of the Actes de la 15ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2008

On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Morphosyntactic Resources for Automatic Speech Recognition.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Structure learning in a Bayesian network-based video indexing framework.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Generalized driven decoding for speech recognition system combination.
Proceedings of the IEEE International Conference on Acoustics, 2008

An unsupervised web-based topic language model adaptation method.
Proceedings of the IEEE International Conference on Acoustics, 2008

Toward the Integration of Natural Language Processing and Automatic Speech Recognition: Using Morpho-Syntax and Pragmatics for Transcription.
Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

Stochastic Models for Multimodal Video Analysis.
Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

2007
Towards Phonetically-Driven Hidden Markov Models: Can We Incorporate Phonetic Landmarks in HMM-Based ASR?
Proceedings of the Advances in Nonlinear Speech Processing, 2007

Rapid speaker adaptation by reference model interpolation.
Proceedings of the INTERSPEECH 2007, 2007

Morphosyntactic processing of n-best lists for improved recognition and confidence measure computation.
Proceedings of the INTERSPEECH 2007, 2007

Estimation de similarité entre séquences de descripteurs à l'aide de machines à vecteurs supports.
Proceedings of the 23èmes Journées Bases de Données Avancées, 2007

2006
Experiments in audio source separation with one sensor for robust speech recognition.
Speech Commun., 2006

Audiovisual integration for tennis broadcast structuring.
Multim. Tools Appl., 2006

Utilisation de la linguistique en reconnaissance de la parole : un état de l'art
CoRR, 2006

Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Score oriented Viterbi search in sport video structuring using HMM and segment models.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Fast Structuring of Large Television Streams Using Program Guides.
Proceedings of the Adaptive Multimedia Retrieval: User, 2006

2005
Experiments on speaker tracking and segmentation in radio broadcast news.
Proceedings of the INTERSPEECH 2005, 2005

The ESTER phase II evaluation campaign for the rich transcription of French broadcast news.
Proceedings of the INTERSPEECH 2005, 2005

A model space framework for efficient speaker detection.
Proceedings of the INTERSPEECH 2005, 2005

Multimodal Segmental-Based Modeling of Tennis Video Broadcasts.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
A Tutorial on Text-Independent Speaker Verification.
EURASIP J. Adv. Signal Process., 2004

Enhancing the robustness of Bayesian methods for text-independent automatic speaker verification.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Tennis video abstraction from audio and visual cues.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

The ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs.
Proceedings of the INTERSPEECH 2004, 2004

Multiple events tracking in sound tracks.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
Recent advances in the automatic recognition of audiovisual speech.
Proc. IEEE, 2003

Audio source separation with one sensor for robust speech recognition.
Proceedings of the ITRW on Non-Linear Speech Processing, 2003

HMM based structuring of tennis videos using visual and audio cues.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2002
Maximum entropy and MCE based HMM stream weight estimation for audio-visual ASR.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Overview of the 2000-2001 ELISA Consortium research activities.
Proceedings of the 2001: A Speaker Odyssey, 2001

Integrating contextual phonological rules in a large vocabulary decoder.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
On the Use of Prior Knowledge in Normalization Schemes for Speaker Verification.
Digit. Signal Process., 2000

Résumés de thèse.
Ann. des Télécommunications, 2000

A further investigation on speech features for speaker characterization.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speech modeling with state constrained Markov fields over frequency bands.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A Markov Random Field Model for Automatic Speech Recognition.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

A Markov random field based multi-band model.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
Toward Markov random field modeling of speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Optimal state dependent spectral representation for HMM modeling : a new theoretical framework.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Model dependent spectral representations for speaker recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Combining methods to improve speaker verification decision.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996


  Loading...