Marcely Zanon Boito

Orcid: 0000-0003-0134-6719

According to our database1, Marcely Zanon Boito authored at least 23 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts.
CoRR, 2023

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech.
CoRR, 2023

NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

2022
Speech Resources in the Tamasheq Language.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022


A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems.
Proceedings of the Interspeech 2022, 2022

2021
Models and resources for attention-based unsupervised word segmentation : an application to computational language documentation. (Modèles et ressources pour la segmentation non supervisée des mots basée sur l'attention).
PhD thesis, 2021

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings.
CoRR, 2021

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
CoRR, 2021

Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

<i>LeBenchmark</i>: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Investigating alignment interpretability for low-resource NMT.
Mach. Transl., 2020

Investigating Language Impact in Bilingual Approaches for Computational Language Documentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task.
CoRR, 2019

How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages.
CoRR, 2019

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-Resource Settings.
Proceedings of the Interspeech 2019, 2019

2018
A Small Griko-Italian Speech Translation Corpus.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Unsupervised Word Segmentation from Speech with Attention.
Proceedings of the Interspeech 2018, 2018

2017
Unwritten languages demand attention too! Word discovery with encoder-decoder models.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2014
Size Does Not Matter. Frequency Does. A Study of Features for Measuring Lexical Complexity.
Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014


  Loading...