Aleksei Dorkin

Orcid: 0000-0002-5427-5232

According to our database1, Aleksei Dorkin authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training.
CoRR, March, 2026

2025
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation.
CoRR, November, 2025

Prune or Retrain: Optimizing the Vocabulary of Multilingual Models for Estonian.
CoRR, January, 2025

TartuNLP at SemEval-2025 Task 5: Subject Tagging as Two-Stage Information Retrieval.
Proceedings of the 19th International Workshop on Semantic Evaluation, 2025

GliLem: Leveraging GliNER for Contextualized Lemmatization in Estonian.
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies, 2025

2024
TartuNLP at EvaLatin 2024: Emotion Polarity Detection.
CoRR, 2024

Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation.
Proceedings of the 13th Joint Conference on Lexical and Computational Semantics, 2024

TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages.
Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, 2024

TartuNLP @ AXOLOTL-24: Leveraging Classifier Output for New Sense Detection in Lexical Semantics.
Proceedings of the 5th Workshop on Computational Approaches to Historical Language Change, 2024

2023
Comparison of Current Approaches to Lemmatization: A Case Study in Estonian.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023


  Loading...