David Dale

According to our database1, David Dale authored at least 18 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Red Teaming in Multimodal and Multilingual Translation.
CoRR, 2024

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector.
CoRR, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation.
CoRR, 2023

SpeechAlign: a Framework for Speech Translation Alignment Evaluation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

Don't Lose the Message While Paraphrasing: A Study on Content Preserving Style Transfer.
Proceedings of the Natural Language Processing and Information Systems, 2023

Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
The first neural machine translation system for the Erzya language.
CoRR, 2022

Studying the Role of Named Entities for Content Preservation in Text Style Transfer.
Proceedings of the Natural Language Processing and Information Systems, 2022

ParaDetox: Detoxification with Parallel Data.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

A large-scale computational study of content preservation measures for text style transfer and paraphrase generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

2021
Methods for Detoxification of Texts for the Russian Language.
Multimodal Technol. Interact., 2021

SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

Text Detoxification using Large Pre-trained Neural Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification.
Proceedings of the 2nd Crowd Science Workshop: Trust, 2021


  Loading...