Laurent Besacier

Orcid: 0000-0001-7411-9125

According to our database1, Laurent Besacier authored at least 282 papers between 1997 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.
Comput. Speech Lang., 2024

Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond.
CoRR, 2024

mHuBERT-147: A Compact Multilingual HuBERT Model.
CoRR, 2024

ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models.
CoRR, 2024

2023
Encoding Sentence Position in Context-Aware Neural Machine Translation with Concatenation.
CoRR, 2023

2022
A Textless Metric for Speech-to-Speech Comparison.
CoRR, 2022

Focused Concatenation for Context-Aware Neural Machine Translation.
Proceedings of the Seventh Conference on Machine Translation, 2022

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

What Do Compressed Multilingual Machine Translation Models Forget?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Fashioning Local Designs from Generic Speech Technologies in an Australian Aboriginal Community.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Weakly Supervised Word Segmentation for Computational Language Documentation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Divide and Rule: Effective Pre-Training for Context-Aware Multi-Encoder Translation Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Learning From Failure: Data Capture in an Australian Aboriginal Community.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Joint source-target encoding with pervasive attention.
Mach. Transl., 2021

On the Evaluation of Machine Translation for Terminology Consistency.
CoRR, 2021

Spoken Term Detection Methods for Sparse Transcription in Very Low-resource Settings.
CoRR, 2021

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings.
CoRR, 2021

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
CoRR, 2021

Divide and Rule: Training Context-Aware Multi-Encoder Translation Models with Little Resources.
CoRR, 2021

Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis.
CoRR, 2021

Findings of the WMT Shared Task on Machine Translation Using Terminologies.
Proceedings of the Sixth Conference on Machine Translation, 2021

Contribution d'informations syntaxiques aux capacités de généralisation compositionelle des modèles seq2seq convolutifs (Assessing the Contribution of Syntactic Information for Compositional Generalization of seq2seq Convolutional Networks).
Proceedings of the Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2021

Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

<i>LeBenchmark</i>: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Empirical Study of End-To-End Simultaneous Speech Translation Decoding Strategies.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multilingual Unsupervised Neural Machine Translation with Denoising Adapters.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Lightweight Adapter Tuning for Multilingual Speech Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Speech Technology for Unwritten Languages.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Investigating alignment interpretability for low-resource NMT.
Mach. Transl., 2020

Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English.
CoRR, 2020

Modeling ASR Ambiguity for Dialogue State Tracking Using Word Confusion Networks.
CoRR, 2020

ConfNet2Seq - Full Length Answer Generation from Spoken Questions.
Proceedings of the Text, Speech, and Dialogue, 2020

FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français (FlauBERT : Unsupervised Language Model Pre-training for French).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Pratiques d'évaluation en ASR et biais de performance (Evaluation methodology in ASR and performance bias).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Représentation du genre dans des données open source de parole (Gender representation in open source speech resources 1 With the rise of artificial intelligence (AI) and the growing use of deep-learning architectures, the question of ethics and transparency in AI systems has become a central concern within the research community).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Investigating Language Impact in Bilingual Approaches for Computational Language Documentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

FlauBERT: Unsupervised Language Model Pre-training for French.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Gender Representation in Open Source Speech Resources.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Modeling ASR Ambiguity for Neural Dialogue State Tracking.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Investigating Self-Supervised Pre-Training for End-to-End Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Efficient Wait-k Models for Simultaneous Machine Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Data Efficient End-to-End Spoken Language Understanding Architecture.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Monolingual Adapters for Zero-Shot Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Enabling Interactive Transcription in an Indigenous Community.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages.
Nat. Lang. Eng., 2019

Character-based NMT with Transformer.
CoRR, 2019

ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task.
CoRR, 2019

How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages.
CoRR, 2019

Gender Representation in French Broadcast Corpora and Its Impact on ASR Performance.
Proceedings of the 1st International Workshop on AI for Smart TV Content Production, 2019

The LIG system for the English-Czech Text Translation Task of IWSLT 2019.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Controlling Utterance Length in NMT-based Word Segmentation with Attention.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

The Zero Resource Speech Challenge 2019: TTS Without T.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-Resource Settings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Models of Visually Grounded Speech Signal Pay Attention to Nouns: A Bilingual Experiment on English and Japanese.
Proceedings of the IEEE International Conference on Acoustics, 2019

Naver Labs Europe's Systems for the Document-Level Generation and Translation Task at WNGT 2019.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Word Recognition, Competition, and Activation in a Model of Visually Grounded Speech.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

2018
Automatic quality estimation for speech translation using joint ASR and MT features.
Mach. Transl., 2018

A Small Griko-Italian Speech Translation Corpus.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville).
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Unsupervised Word Segmentation from Speech with Attention.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Automatic Recognition of Affective Laughter in Spontaneous Dyadic Interactions from Audiovisual Signals.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Models for Unit Discovery on a Very Low Resource Language.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

ASR Performance Prediction on Unseen Broadcast Programs Using Convolutional Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

End-to-End Automatic Speech Translation of Audiobooks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Analyzing Learned Representations of a Deep ASR Performance Prediction Model.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Unsupervised Learning of Word Segmentation: Does Tone Matter?
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

Token-level and sequence-level loss smoothing for RNN language models.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Exploring Textual and Speech information in Dialogue Act Classification with Speaker Domain Adaptation.
Proceedings of the Australasian Language Technology Association Workshop 2018, 2018

2017
Find the errors, get the better: Enhancing machine translation via word confidence estimation.
Nat. Lang. Eng., 2017

SPEECH-COCO: 600k Visually Grounded Spoken Captions Aligned to MSCOCO Data Set.
CoRR, 2017

UsingWord Embedding for Cross-Language Plagiarism Detection.
CoRR, 2017

LIG-CRIStAL System for the WMT17 Automatic Post-Editing Task.
CoRR, 2017

LIG-CRIStAL Submission for the WMT 2017 Automatic Post-Editing Task.
Proceedings of the Second Conference on Machine Translation, 2017

Traitement des Mots Hors Vocabulaire pour la Traduction Automatique de Document OCRisés en Arabe (This article presents a new system that automatically translates images of arabic documents).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles, 2017

CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual Similarity.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Disentangling ASR and MT Errors in Speech Translation.
Proceedings of Machine Translation Summit XVI, Volume 1: Research Track, 2017

Machine Assisted Analysis of Vowel Length Contrasts in Wolof.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Corpus for Amharic-English Speech Translation: The Case of Tourism Domain.
Proceedings of the Information and Communication Technology for Development for Africa, 2017

Amharic-English Speech Translation in Tourism Domain.
Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

Using Word Embedding for Cross-Language Plagiarism Detection.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

The zero resource speech challenge 2017.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Unwritten languages demand attention too! Word discovery with encoder-decoder models.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Deep Investigation of Cross-Language Plagiarism Detection Methods.
Proceedings of the 10th Workshop on Building and Using Comparable Corpora, 2017

2016
Naming multi-modal clusters to identify persons in TV broadcast.
Multim. Tools Appl., 2016

A unified framework for translation and understanding allowing discriminative joint decoding for multilingual speech semantic interpretation.
Comput. Speech Lang., 2016

Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features.
CoRR, 2016

Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation.
CoRR, 2016

Projection Interlingue d'Étiquettes pour l'Annotation Sémantique Non Supervisée (Cross-lingual Annotation Projection for Unsupervised Semantic Tagging).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Articles longs), 2016

Word2Vec vs DBnary ou comment (ré)concilier représentations distribuées et réseaux lexico-sémantiques ? Le cas de l'évaluation en traduction automatique (Word2Vec vs DBnary or how to bring back together vector representations and lexical resources ? A case study for machine translation evaluation).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Articles longs), 2016

Automatic Speech Recognition for African Languages with Vowel Length Contrast.
Proceedings of the SLTU-2016, 2016

Parallel Speech Collection for Under-resourced Language Studies Using the Lig-Aikuma Mobile Device App.
Proceedings of the SLTU-2016, 2016


Deep complementary features for speaker identification in TV broadcast data.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Joint ASR and MT Features for Quality Estimation in Spoken Language Translation.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Better Evaluation of ASR in Speech Translation Context Using Word Embeddings.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Preliminary Experiments on Unsupervised Word Discovery in Mboshi.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speed Perturbation and Vowel Duration Modeling for ASR in Hausa and Wolof Languages.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Lig-Aikuma: A Mobile App to Collect Parallel Speech for Under-Resourced Language Studies.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

OCR-aided person annotation and label propagation for speaker modeling in TV shows.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

First Automatic Fongbe Continuous Speech Recognition System: Development of Acoustic Models and Language Models.
Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, 2016

Inducing Multilingual Text Analysis Tools Using Bidirectional Recurrent Neural Networks.
Proceedings of the COLING 2016, 2016

Word2Vec vs DBnary: Augmenting METEOR using Vector Representations or Lexical Resources?
Proceedings of the COLING 2016, 2016

2015
Unsupervised Speaker Identification in TV Broadcast Based on Written Names.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Towards accurate predictors of word quality for Machine Translation: Lessons learned on French-English and English-Spanish systems.
Data Knowl. Eng., 2015

Utilisation des réseaux de neurones récurrents pour la projection interlingue d'étiquettes morpho-syntaxiques à partir d'un corpus parallèle.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

Utilisation de mesures de confiance pour améliorer le décodage en traduction de parole.
Proceedings of the Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2015

Merging of Native and Non-native Speech for Low-resource Accented ASR.
Proceedings of the Statistical Language and Speech Processing, 2015

Unsupervised and Lightly Supervised Part-of-Speech Tagging Using Recurrent Neural Networks.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

METEOR for multiple target languages using DBnary.
Proceedings of Machine Translation Summit XV: Papers, 2015

LIG at MediaEval 2015 Multimodal Person Discovery in Broadcast TV Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

An open-source toolkit for word-level confidence estimation in machine translation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Using resources from a closely-related language to develop ASR for a very under-resourced language: a case study for iban.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Collaborative annotation for person identification in TV shows.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speech technologies for african languages: example of a multilingual calculator for education.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Automated Translation of a Literary Work: a Pilot Study.
Proceedings of the Fourth Workshop on Computational Linguistics for Literature, 2015

Spoken language translation graphs re-decoding using automatic quality assessment.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Phoneme-based English-Amharic Statistical Machine Translation.
Proceedings of the AFRICON 2015, Addis Ababa, Ethiopia, September 14-17, 2015, 2015

2014
Préface.
Trait. Autom. des Langues, 2014

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic.
Speech Commun., 2014

SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian.
Speech Commun., 2014

Automatic speech recognition for under-resourced languages: A survey.
Speech Commun., 2014

Introduction to the special issue on processing under-resourced languages.
Speech Commun., 2014

Nommage non supervisé des personnes dans les émissions de télévision. Utilisation des noms écrits, des noms prononcés ou des deux ?
Document Numérique, 2014

LIG System for Word Level QE task at WMT14.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

RecordMe: A Smartphone Application for Experimental Collections of Large Amount of Data Respecting Volunteer's Privacy.
Proceedings of the Ubiquitous Computing and Ambient Intelligence. Personalisation and User Adapted Services, 2014

Machine translation for litterature: a pilot study (Traduction automatisée d'une oeuvre littéraire: une étude pilote) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

Semi-supervised G2p bootstrapping and its application to ASR for a very under-resourced language: Iban.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Using closely-related language to build an ASR for a very under-resourced language: Iban.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Word confidence estimation for speech translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Active selection with label propagation for minimizing human effort in speaker annotation of TV shows.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

Acoustic model merging using acoustic models from multilingual speakers for automatic speech recognition.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

An efficient two-pass decoder for SMT using word confidence estimation.
Proceedings of the 17th Annual conference of the European Association for Machine Translation, 2014

Word Confidence Estimation for SMT N-best List Re-ranking.
Proceedings of the Workshop on Humans and Computer-assisted Translation, 2014

Automatic propagation of manual annotations for multimodal person identification in TV shows.
Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014

Data selection for compact adapted SMT models.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
Comparison and Combination of Lightly Supervised Approaches for Language Portability of a Spoken Language Understanding System.
IEEE Trans. Speech Audio Process., 2013

LIG System for WMT13 QE Task: Investigating the Usefulness of Features in Word Confidence Estimation for MT.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Driven Decoding for machine translation (Vers un décodage guidé pour la traduction automatique) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

Discriminative statistical approaches for multilingual speech understanding (Approches statistiques discriminantes pour l'interprétation sémantique multilingue de la parole) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

Generalization of Discriminative Approaches for Speech Language Understanding in a Multilingual Context.
Proceedings of the Statistical Language and Speech Processing, 2013

Statistical Error Correction Methods for Domain-Specific ASR Systems.
Proceedings of the Statistical Language and Speech Processing, 2013

Word Confidence Estimation and Its Integration in Sentence Quality Estimation for Machine Translation.
Proceedings of the Knowledge and Systems Engineering, 2013

Unsupervised naming of speakers in broadcast TV: using written names, pronounced names or both?
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Towards a Better Integration of Written Names for Unsupervised Speakers Identification in Videos.
Proceedings of the First Workshop on Speech, 2013


Nommage non-supervisé des personnes dans les émissions de télévision : une revue du potentiel de chaque modalité.
Proceedings of the CORIA 2013 - Conférence en Recherche d'Infomations et Applications, 2013

2012
Robustesse et portabilités multilingue et multi-domaines des systèmes de compréhension de la parole : les corpus du projet PortMedia (Robustness and portability of spoken language understanding systems among languages and domains : the PORTMEDIA project) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Développement de ressources en swahili pour un sytème de reconnaisance automatique de la parole (Developments of Swahili resources for an automatic speech recognition system) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Preliminary experiments on English-Amharic statistical machine translation.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Syllable-based and hybrid acoustic models for Amharic speech recognition.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Pronunciation learning system for the 32 vowel system of Nasa Yuwe language.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Developments of Swahili resources for an automatic speech recognition system.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Collection of a Large Database of French-English SMT Output Corrections.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Towards a better understanding of statistical post-edition usefulness.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The LIG English to French machine translation system for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Portability of Semantic Annotations for Fast Development of Dialogue Corpora.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

From Text Detection in Videos to Person Identification.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Analysis of Malay Speech Recognition for Different Speaker Origins.
Proceedings of the 2012 International Conference on Asian Language Processing, 2012

ASR domain adaptation methods for low-resourced languages: Application to Romanian language.
Proceedings of the 20th European Signal Processing Conference, 2012

Fusion of Speech, Faces and Text for Person Identification in TV Broadcast.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

2011
The LIGA (LIG/LIA) Machine Translation System for WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Comparaison et combinaison d'approches pour la portabilité vers une nouvelle langue d'un système de compréhension de l'oral (Comparison and combination of approaches for the portability to a new language of an oral comprehension system).
Proceedings of the Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2011

LIG English-French spoken language translation system for IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Speech Modulation Features for Robust Nonnative Speech Accent Detection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Quality Assessment of Crowdsourcing Transcriptions for African Languages.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Combination of stochastic understanding and machine translation systems for language portability of dialogue systems.
Proceedings of the IEEE International Conference on Acoustics, 2011

Mining Parallel Data from Comparable Corpora via Triangulation.
Proceedings of the International Conference on Asian Language Processing, 2011

Oracle-based Training for Phrase-based Statistical Machine Translation.
Proceedings of the 15th Annual conference of the European Association for Machine Translation, 2011

Text detection and recognition for person identification in videos.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

Investigating the role of machine translated text in ASR domain adaptation: Unsupervised and semi-supervised methods.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet.
Multim. Tools Appl., 2010

Recherche par le contenu dans des documents audiovisuels multilingues.
Document Numérique, 2010

The LIG Machine Translation System for WMT 2010.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Weak Translation Problems - a case study of Scriptural Translation.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, 2010

Apprentissage non supervisé pour la traduction automatique : application à un couple de langues peu doté.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2010

Voice aided input for phrase selection using a low level ASR approach - application to French and Khmer phrasebooks.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Autonomous acoustic model adaptation for multilingual meeting transcription involving high- and low-resourced languages.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Unsupervised SMT for a low-resourced language pair.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Automatic Identification of Arabic Dialects.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Improved Vietnamese-French parallel corpus mining using English language.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

LIG statistical machine translation systems for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Unsupervised acoustic model adaptation for multi-origin non native ASR.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Investigating multiple approaches for SLU portability to a new language.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A fully unsupervised approach for mining parallel data from comparable corpora.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

2009
Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language.
IEEE Trans. Speech Audio Process., 2009

Mining a Comparable Text Corpus for a Vietnamese-French Statistical Machine Translation System.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Segmentation multiple d'un flux de données textuelles pour la modélisation statistique du langage.
Proceedings of the Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, 2009

Exploitation d'un corpus bilingue pour la création d'un système de traduction probabiliste Vietnamien - Français.
Proceedings of the Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2009

LIG approach for IWSLT09.
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2009, 2009

LIG approach for IWSLT09 : using multiple morphological segmenters for spoken language translation of Arabic.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Human translations guided language discovery for ASR systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Multiple text segmentation for statistical language modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

The LIG multi-criteria system for video retrieval.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

A Hybrid Model for Urdu Hindi Transliteration.
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009

2008
Which units for acoustic and language modeling for Khmer automatic speech recognition?
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Recent advances in automatic speech recognition for vietnamese.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

First Broadcast News Transcription System for Khmer Language.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

The LIG Arabic/English speech translation system at IWSLT08.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Improving pronunciation modeling for non-native speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Feature adaptation of hearing-impaired lip shapes: the vowel case in the cued speech context.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Word/sub-word lattices decomposition and combination for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
The LIG Arabic/English speech translation system at IWSLT07.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

Modeling context and language variation for non-native speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Automatic question detection: prosodic-lexical features and crosslingual experiments.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

On Efficient Coupling of ASR and SMT for Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Acoustic Model Interpolation for Non-Native Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Information Extraction From Sound for Medical Telemonitoring.
IEEE Trans. Inf. Technol. Biomed., 2006

Step-by-step and integrated approaches in broadcast news speaker diarization.
Comput. Speech Lang., 2006

Towards speech Translation of Non Written Languages.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

A French Non-Native Corpus for Automatic Speech Recognition.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

On the use of morphological analysis for dialectal Arabic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Characterization of cued speech vowels from the inner lip contour.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Acoustic-Phonetic Unit Similarities For Context Dependent Acoustic Model Portability.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

ASR and Translation for Under-Resourced Languages.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Hand and Lip Desynchronization Analysis in French Cued Speech: Automatic Temporal Segmentation of Hand Flow.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
NIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

A speaker independent "liveness" test for audio-visual biometrics.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Audio, video and audio-visual signatures for short video clip detection: experiments on Trecvid2003.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

First Steps in Fast Acoustic Modeling for a New Target Language: Application to Vietnamese.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Voice Biometrics over the Internet in the Framework of COST Action 275.
EURASIP J. Adv. Signal Process., 2004

CLIPS-LIS-LSR-LABRI Experiments in TRECVID 2004 (DRAFT).
Proceedings of the 2004 TREC Video Retrieval Evaluation, 2004

Modèle de langage sémantique pour la reconnaissance automatique de parole dans un contexte de traduction.
Proceedings of the Actes de la 11ème conférence sur le Traitement Automatique des Langues Naturelles. Posters, 2004

Traduction de dialogue: résultats du projet NESPOLE! et pistes pour le domaine.
Proceedings of the Actes de la 11ème conférence sur le Traitement Automatique des Langues Naturelles. Posters, 2004

Interchange Format-based Language Model for Automatic Speech Recognition in Speech-to-Speech Translation.
Proceedings of the Actes de la Deuxieme Conference Internationale Associant Chercheurs Vietnamiens et Francophones en Informatique, 2004

ELISA nist RT03 broadcast news speaker diarization experiments.
Proceedings of the Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Using a priori information for speaker diarization.
Proceedings of the Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Video story segmentation with multi-modal features: experiments on TRECvid 2003.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Spoken and Written Language Resources for Vietnamese.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Spoken dialogue translation systems evaluation: results, new trends, problems and proposals.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Benefits of prior acoustic segmentation for automatic speaker segmentation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
CLIPS at TRECVID : Shot Boundary Detection and Feature Detection.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003

Life Sounds Extraction and Classification in Noisy Environment.
Proceedings of the Signal and Image Processing (SIP 2003), 2003

Nonlinear acoustical preprocessing for multiple sampling rates ASR and ASR in noisy condition.
Proceedings of the ITRW on Non-Linear Speech Processing, 2003

The NESPOLE! voIP multilingual corpora in tourism and medical domains.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Using the web for fast language model construction in minority languages.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
CLIPS at TREC 11: Experiments in Video Retrieval.
Proceedings of The Eleventh Text REtrieval Conference, 2002

Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Recovering of packet loss for Distributed Speech Recognition.
Proceedings of the 11th European Signal Processing Conference, 2002

2001
The effect of speech and audio compression on speech recognition performance.
Proceedings of the Fourth IEEE Workshop on Multimedia Signal Processing, 2001

The nespole! voIP dialogue database.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speech translation for French in the NESPOLE! European project.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Localization and selection of speaker-specific information with statistical modeling.
Speech Commun., 2000

Subband architecture for automatic speaker recognition.
Signal Process., 2000

A New Methodology for Speech Corpora Definition from Internet Documents.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

GSM speech coding and speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

Influence of GSM speech coding on the performance of text-independent speaker recognition.
Proceedings of the 10th European Signal Processing Conference, 2000

Automatic sound detection and recognition for noisy environment.
Proceedings of the 10th European Signal Processing Conference, 2000

1999
Experimental evaluation of text-independent speaker verification on laboratory and field test databases in the M2VTS project.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999


1998
Time and frequency pruning for speaker identification.
Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998

Frame pruning for speaker recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Frame pruning for automatic speaker identification.
Proceedings of the 9th European Signal Processing Conference, 1998

1997
Subband Approach for Automatic Speaker Recognition: Optimal Division of the Frequency Domain.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 1997


  Loading...