Kenneth C. Enevoldsen

Orcid: 0000-0001-8733-0966

According to our database1, Kenneth C. Enevoldsen authored at least 27 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
HUME: Measuring the Human-Model Performance Gap in Text Embedding Tasks.
CoRR, October, 2025

Continuous sentiment scores for literary and multilingual contexts.
CoRR, August, 2025

Dynaword: From One-shot to Continuously Developed Datasets.
CoRR, August, 2025

Turftopic: Topic Modelling with Contextual Representations from Sentence Transformers.
J. Open Source Softw., July, 2025

Maintaining MTEB: Towards Long Term Usability and Reproducibility of Embedding Benchmarks.
CoRR, June, 2025

MIEB: Massive Image Embedding Benchmark.
CoRR, April, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.
CoRR, February, 2025

Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks.
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies, 2025

topicwizard - a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation.
Proceedings of the 8th International Conference on Natural Language and Speech Processing, 2025


S³ - Semantic Signal Separation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Epistemic consequences of unfair tools.
Digit. Scholarsh. Humanit., 2024

Exposing Assumptions in AI Benchmarks through Cognitive Modelling.
CoRR, 2024

S<sup>3</sup> - Semantic Signal Separation.
CoRR, 2024

DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity Recognition.
CoRR, 2024

The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
TextDescriptives: A Python package for calculating a large variety of metrics from text.
J. Open Source Softw., June, 2023

timeseriesflattener: A Python package for summarizing features from (medical) time series.
J. Open Source Softw., March, 2023

Augmenty: A Python Library for Structured Text Augmentation.
CoRR, 2023

Danish Foundation Models.
CoRR, 2023

Embed-Search-Align: DNA Sequence Alignment using Transformer Models.
CoRR, 2023

TextDescriptives: A Python package for calculating a large variety of statistics from text.
CoRR, 2023

DanSumT5: Automatic Abstractive Summarization for Danish.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

2021
From close listening to distant listening: Developing tools for Speech-Music discrimination of Danish music radio.
Digit. Humanit. Q., 2021

When no news is bad news - Detection of negative events from news media content.
CoRR, 2021

News Information Decoupling: An Information Signature of Catastrophes in Legacy News Media.
CoRR, 2021

DaCy: A Unified Framework for Danish NLP.
Proceedings of the Conference on Computational Humanities Research, 2021


  Loading...