Dawn Knight

Orcid: 0000-0002-4745-6502

According to our database1, Dawn Knight authored at least 17 papers between 2008 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
GaelEval: Benchmarking LLM Performance for Scottish Gaelic.
CoRR, April, 2026

FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation.
CoRR, March, 2026

Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation.
CoRR, January, 2026

2025
SENTimental - a Simple Multilingual Sentiment Annotation Tool.
Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing, 2025

FreeTxt: Analyse and Visualise Multilingual Qualitative Survey Data for Cultural Heritage Sites.
Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing, 2025

UniversalCEFR: Enabling Open Multilingual Research on Language Proficiency Assessment.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2023
Open-Source Thesaurus Development for Under-Resourced Languages: a Welsh Case Study.
Proceedings of the 4th Conference on Language, Data and Knowledge, 2023

2022
Introducing the Welsh Text Summarisation Dataset and Baseline Systems.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
A systematic review of unsupervised approaches to grammar induction.
Nat. Lang. Eng., 2021

Developing computational infrastructure for the CorCenCC corpus: The National Corpus of Contemporary Welsh.
Lang. Resour. Evaluation, 2021

2020
The National Corpus of Contemporary Welsh: Project Report | Y Corpws Cenedlaethol Cymraeg Cyfoes: Adroddiad y Prosiect.
CoRR, 2020

A Cognitive Approach to Parsing with Neural Networks.
Proceedings of the Statistical Language and Speech Processing, 2020

2019
Leveraging Pre-Trained Embeddings for Welsh Taggers.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

2018
Towards a Welsh Semantic Annotation System.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Leveraging Lexical Resources and Constraint Grammar for Rule-Based Part-of-Speech Tagging in Welsh.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2016
Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2008
Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis.
Proceedings of the International Conference on Language Resources and Evaluation, 2008


  Loading...