Petr Knoth

Orcid: 0000-0003-1161-7359

Affiliations:
  • Open University, Knowledge Media institute, Milton Keynes, UK (PhD 2014)


According to our database1, Petr Knoth authored at least 78 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Compare: A Framework for Scientific Comparisons.
CoRR, September, 2025

USRN Discovery Pilot: Increasing the Discoverability of Open Access Content Through a National Network.
CoRR, August, 2025

Interoperable verification and dissemination of software assets in repositories using COAR Notify.
CoRR, August, 2025

Making Software FAIR: A machine-assisted workflow for the research software lifecycle.
CoRR, January, 2025

LongEval at CLEF 2025: Longitudinal Evaluation of IR Model Performance.
Proceedings of the Advances in Information Retrieval, 2025

LongEval at CLEF 2025: Longitudinal Evaluation of IR Systems on Web and Scientific Data.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

2023
Effective matching of patients to clinical trials using entity extraction and neural re-ranking.
J. Biomed. Informatics, August, 2023

An analysis of work saved over sampling in the evaluation of automated citation screening in systematic literature reviews.
Intell. Syst. Appl., May, 2023

Explainable online health information truthfulness in Consumer Health Search.
Frontiers Artif. Intell., February, 2023

Predicting article quality scores with machine learning: The U.K. Research Excellence Framework.
Quant. Sci. Stud., 2023

VoMBaT: A Tool for Visualising Evaluation Measure Behaviour in High-Recall Search Tasks.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Outcome-based Evaluation of Systematic Review Automation.
Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, 2023

CORE-GPT: Combining Open Access Research and Large Language Models for Credible, Trustworthy Question Answering.
Proceedings of the Linking Theory and Practice of Digital Libraries: 27th International Conference on Theory and Practice of Digital Libraries, 2023

Ranking for Learning: Studying Users' Perceptions of Relevance, Understandability, and Engagement.
Proceedings of the Linking Theory and Practice of Digital Libraries: 27th International Conference on Theory and Practice of Digital Libraries, 2023

Readability Measures as Predictors of Understandability and Engagement in Searching to Learn.
Proceedings of the Linking Theory and Practice of Digital Libraries: 27th International Conference on Theory and Practice of Digital Libraries, 2023

CRUISE-Screening: Living Literature Reviews Toolbox.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Prompting Strategies for Citation Classification.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Formal Analysis and Estimation of Chance in Datasets Based on Their Properties.
IEEE Trans. Knowl. Data Eng., 2022

Indicators of research quality, quantity, openness, and responsibility in institutional review, promotion, and tenure policies across seven countries.
Quant. Sci. Stud., 2022

Predicting article quality scores with machine learning: The UK Research Excellence Framework.
CoRR, 2022

Confidence estimation of classification based on the distribution of the neural network output layer.
CoRR, 2022

A Systematic Review of Data Management Platforms.
Proceedings of the Information Systems and Technologies, 2022

Enhancing discovery and enriching the scholarly graph with the Research Outputs Metadata Schema (Rioxx).
Proceedings of the Workshop on Open Citations and Open Scholarly Metadata 2022, 2022

ACT2: A multi-disciplinary semi-structured dataset for importance and purpose classification of citations.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Dynamic Context Extraction for Citation Classification.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Cui Bono? Cumulative Advantage in Open Access Publishing.
Proceedings of the Linking Theory and Practice of Digital Libraries, 2022

Automation of Citation Screening for Systematic Literature Reviews Using Neural Networks: A Replicability Study.
Proceedings of the Advances in Information Retrieval, 2022

Benchmark for Research Theme Classification of Scholarly Documents.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Overview of the Third Workshop on Scholarly Document Processing.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

2021
A meta-analysis of semantic classification of citations.
Quant. Sci. Stud., 2021

Information Retrieval Evaluation in Knowledge Acquisition Tasks.
Proceedings of the Joint Proceedings of the Second Workshop on Bridging the Gap between Information Science, 2021

2020
Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

An Authoritative Approach to Citation Classification.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

8th International Workshop on Mining Scientific Publications (WOSP 2020).
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

Open Access 2007 - 2017: Country and University Level Perspective.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

2019
ACT: An Annotation Platform for Citation Typing at Scale.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019

Do Authors Deposit on Time? Tracking Open Access Policy Compliance.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019

Online Evaluations for Everyone: Mr. DLib's Living Lab for Scholarly Recommendations.
Proceedings of the Advances in Information Retrieval, 2019

2018
Do citations and readership identify seminal publications?
Scientometrics, 2018

Mr. DLib's Living Lab for Scholarly Recommendations.
CoRR, 2018

Using citation-context to reduce topic drifting on pure citation-based recommendation.
Proceedings of the 12th ACM Conference on Recommender Systems, 2018

Analyzing Citation-Distance Networks for Evaluating Publication Impact.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Peer Review and Citation Data in Predicting University Rankings, a Large-Scale Analysis.
Proceedings of the Digital Libraries for Open Knowledge, 2018

Research Collaboration Analysis Using Text and Graph Features.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

2017
Towards effective research recommender systems for repositories.
CoRR, 2017

Workshop on Scholarly Web Mining (SWM 2017).
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Building recommender systems for scholarly information.
Proceedings of the 1st Workshop on Scholarly Web Mining, 2017

Citations and Readership are Poor Indicators of Research Excellence: Introducing TrueImpactDataset, a New Dataset for Validating Research Evaluation Metrics.
Proceedings of the 1st Workshop on Scholarly Web Mining, 2017

Can we do better than Co-Citations? - Bringing Citation Proximity Analysis from idea to practice in research article recommendation.
Proceedings of the 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2017) co-located with the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), 2017

Building Scalable Digital Library Ingestion Pipelines Using Microservices.
Proceedings of the Metadata and Semantic Research - 11th International Conference, 2017

Incidental or influential? - A decade of using text-mining for citation function classification.
Proceedings of the 16th International Conference on Scientometrics and Informetrics, 2017

Incidental or Influential? - Challenges in Automatically Detecting Citation Importance Using Publication Full Texts.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2017

What Others Say About This Work? Scalable Extraction of Citation Contexts from Research Papers.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2017

Classifying Document Types to Enhance Search and Recommendations in Digital Libraries.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2017

2016
Stepping up Open Science Training for European Research.
Publ., 2016

An Analysis of the Microsoft Academic Graph.
D Lib Mag., 2016

Current Research on Mining Scientific Publications.
D Lib Mag., 2016

Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking.
CoRR, 2016

5th International Workshop on Mining Scientific Publications (WOSP 2016).
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

Semantometrics: Towards Fulltext-based Research Evaluation.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

2015
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysing Patterns of Research Collaboration.
D Lib Mag., 2015

WOSP2015: 4th International Workshop on Mining Scientific Publications.
Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries, 2015

Semantometrics: Fulltext-Based Measures for Analysing Research Collaboration.
Proceedings of the 15th International Conference on Scientometrics and Informetrics, Istanbul, Turkey, June 29, 2015

Fostering open science to research using a taxonomy and an eLearning portal.
Proceedings of the 15th International Conference on Knowledge Technologies and Data-driven Business, 2015

2014
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing a Research Publication's Contribution.
D Lib Mag., 2014

Design of Europeana Cloud technical infrastructure.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

2013
Simple Yet Effective Methods for Cross-Lingual Link Discovery (CLLD) - KMI @ NTCIR-10 CrossLink-2.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

CORE: aggregation use cases for open access.
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, 2013

Sharing and Reusing Multimedia Multilingual Educational Resources in Medicine.
Proceedings of the Data and Knowledge for Medical Decision Support, 2013

2012
Special Issue on Mining Scientific Publications.
D Lib Mag., 2012

CORE: Three Access Levels to Underpin Open Access.
D Lib Mag., 2012

Visual Search for Supporting Content Exploration in Large Document Collections.
D Lib Mag., 2012

2011
Cross-Lingual Web API Classification and Annotation.
Proceedings of the 2nd International Workshop on the Multilingual Semantic Web, 2011

KMI, The Open University at NTCIR-9 CrossLink: Cross-Lingual Link Discovery in Wikipedia Using Explicit Semantic Analysis.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Connecting Repositories in the Open Access Domain Using Text Mining and Semantic Data.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2011

2010
EUROGENE: Multilingual Retrieval and Machine Translation Applied to Human Genetics.
Proceedings of the Advances in Information Retrieval, 2010

Automatic generation of inter-passage links based on semantic similarity.
Proceedings of the COLING 2010, 2010


  Loading...