Simon Hengchen

Orcid: 0000-0002-8453-7221

According to our database¹, Simon Hengchen authored at least 23 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation.

[BibT_eX]

[DOI]

Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, 2026

2025

Arabizi vs LLMs: Can the Genie Understand the Language of Aladdin?

[BibT_eX]

[DOI]

Perla Al Almaoui

Pierrette Bouillon

Simon Hengchen

Proceedings of Machine Translation Summit XX: MTSummit 2025- Volume 2, 2025

2024

Detection of Non-recorded Word Senses in English and Swedish.

[BibT_eX]

[DOI]

Jonathan Lautenschlager

Emma Sköldberg

Simon Hengchen

Dominik Schlechtweg

CoRR, 2024

2023

Superlim: A Swedish Language Understanding Evaluation Benchmark.

[BibT_eX]

[DOI]

Aleksandrs Berdicevskis

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2021

A data-driven approach to studying changing vocabularies in historical newspaper collections.

[BibT_eX]

[DOI]

Digit. Scholarsh. Humanit., 2021

Lexical semantic change for Ancient Greek and Latin.

[BibT_eX]

[DOI]

CoRR, 2021

Challenges for Computational Lexical Semantic Change.

[BibT_eX]

[DOI]

CoRR, 2021

SuperSim: a test set for word similarity and relatedness in Swedish.

[BibT_eX]

[DOI]

Simon Hengchen

Nina Tahmasebi

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish.

[BibT_eX]

[DOI]

Quan Duong

Mika Hämäläinen

Simon Hengchen

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Data for "Dataset for Temporal Analysis of English-French Cognates".

[BibT_eX]

[DOI]

Dataset, March, 2020

SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Dataset for Temporal Analysis of English-French Cognates.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Topic Modelling Discourse Dynamics in Historical Newspapers.

[BibT_eX]

[DOI]

Proceedings of the Post-Proceedings of the 5th Conference Digital Humanities in the Nordic Countries (DHN 2020), 2020

2019

A computational approach to lexical polysemy in Ancient Greek.

[BibT_eX]

[DOI]

Digit. Scholarsh. Humanit., 2019

Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study.

[BibT_eX]

[DOI]

Mark J. Hill

Simon Hengchen

Digit. Scholarsh. Humanit., 2019

From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction.

[BibT_eX]

[DOI]

Mika Hämäläinen

Simon Hengchen

Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Time-Out: Temporal Referencing for Robust Modeling of Lexical Semantic Change.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

GASC: Genre-Aware Semantic Change for Ancient Greek.

[BibT_eX]

[DOI]

Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2017

Semantic Enrichment of a Multilingual Archive with Linked Open Data.

[BibT_eX]

[DOI]

Max De Wilde

Simon Hengchen

Digit. Humanit. Q., 2017

Text Mining for User Query Analysis - A 5-Step Method for Cultural Heritage Institutions.

[BibT_eX]

[DOI]

Anne Chardonnens

Simon Hengchen

Proceedings of the Everything Changes, 2017

2016

How hot is .brussels? Impact of the uptake of the .brussels top-level domain name extension.

[BibT_eX]

[DOI]

CoRR, 2016

Exploring archives with probabilistic models: Topic modelling for the valorisation of digitised archives of the European Commission.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Simon Hengchen

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...