Simon Hengchen

Orcid: 0000-0002-8453-7221

According to our database1, Simon Hengchen authored at least 20 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Detection of Non-recorded Word Senses in English and Swedish.
CoRR, 2024

2023
Superlim: A Swedish Language Understanding Evaluation Benchmark.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2021
A data-driven approach to studying changing vocabularies in historical newspaper collections.
Digit. Scholarsh. Humanit., 2021

Lexical semantic change for Ancient Greek and Latin.
CoRR, 2021

Challenges for Computational Lexical Semantic Change.
CoRR, 2021

SuperSim: a test set for word similarity and relatedness in Swedish.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Dataset for Temporal Analysis of English-French Cognates.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Topic Modelling Discourse Dynamics in Historical Newspapers.
Proceedings of the Post-Proceedings of the 5th Conference Digital Humanities in the Nordic Countries (DHN 2020), 2020

2019
A computational approach to lexical polysemy in Ancient Greek.
Digit. Scholarsh. Humanit., 2019

Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study.
Digit. Scholarsh. Humanit., 2019

From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Time-Out: Temporal Referencing for Robust Modeling of Lexical Semantic Change.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

GASC: Genre-Aware Semantic Change for Ancient Greek.
Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2017
Semantic Enrichment of a Multilingual Archive with Linked Open Data.
Digit. Humanit. Q., 2017

Text Mining for User Query Analysis - A 5-Step Method for Cultural Heritage Institutions.
Proceedings of the Everything Changes, 2017

2016
How hot is .brussels? Impact of the uptake of the .brussels top-level domain name extension.
CoRR, 2016

Exploring archives with probabilistic models: Topic modelling for the valorisation of digitised archives of the European Commission.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016


  Loading...