Jannis Vamvas

Orcid: 0009-0002-1821-1837

According to our database1, Jannis Vamvas authored at least 30 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
RUMLEM: A Dictionary-Based Lemmatizer for Romansh.
CoRR, April, 2026

Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties.
CoRR, March, 2026

Robust Language Identification for Romansh Varieties.
CoRR, March, 2026

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning.
CoRR, March, 2026

The Mediomatix Corpus: Parallel Data for Romansh Language Varieties via Comparable Schoolbooks.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents.
CoRR, December, 2025

QueST: Incentivizing LLMs to Generate Difficult Problems.
CoRR, October, 2025

The Mediomatix Corpus: Parallel Data for Romansh Idioms via Comparable Schoolbooks.
CoRR, August, 2025

20min-XD: A Comparable Corpus of Swiss News Articles.
CoRR, April, 2025

Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader.
Proceedings of the Tenth Conference on Machine Translation, 2025

UZH at SemEval-2025 Task 3: Token-Level Self-Consistency for Hallucination Detection.
Proceedings of the 19th International Workshop on Semantic Evaluation, 2025

Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Machine Translation Models are Zero-Shot Detectors of Translation Direction.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Leveraging In-Context Learning for Political Bias Testing of LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents.
CoRR, 2024

Linear-time Minimum Bayes Risk Decoding with Reference Aggregation.
CoRR, 2024

Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect.
CoRR, 2024

Thesis: Model-based Evaluation of Multilinguality.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models.
CoRR, 2023

Towards Unsupervised Recognition of Semantic Differences in Related Documents.
CoRR, 2023

SwissBERT: The Multilingual Language Model for Switzerland.
CoRR, 2023

Trained MT Metrics Learn to Cope with Machine-translated References.
Proceedings of the Eighth Conference on Machine Translation, 2023

Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
NMTScore: A Multilingual Analysis of Translation-based Text Similarity Measures.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

2021
Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

On the Limits of Minimal Pairs in Contrastive Evaluation.
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

Proceedings of the ACL-IJCNLP 2021 Student Research Workshop.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

2020
X -stance: A Multilingual Multi-Target Dataset for Stance Detection.
Proceedings of the 5th Swiss Text Analytics Conference and the 16th Conference on Natural Language Processing, 2020


  Loading...