Saumitra Yadav

Orcid: 0009-0002-1859-3353

According to our database1, Saumitra Yadav authored at least 8 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation.
CoRR, January, 2026

2025
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance.
CoRR, November, 2025

A Preliminary Exploration of Phrase-Based SMT and Multi-BPE Segmentations through Concatenated Tokenised Corpora for Low-Resource Indian Languages.
Proceedings of the Tenth Conference on Machine Translation, 2025

Why should only High-Resource-Languages have all the fun? Pivot Based Evaluation in Low Resource Setting.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
A3-108 Controlling Token Generation in Low Resource Machine Translation Systems.
Proceedings of the Ninth Conference on Machine Translation, 2024

CoST of breaking the LLMs.
Proceedings of the Ninth Conference on Machine Translation, 2024

2021
A3-108 Machine Translation System for Similar Language Translation Shared Task 2021.
Proceedings of the Sixth Conference on Machine Translation, 2021

2020
A3-108 Machine Translation System for Similar Language Translation Shared Task 2020.
Proceedings of the Fifth Conference on Machine Translation, 2020


  Loading...