Jonas Golde

Orcid: 0000-0002-8160-3000

According to our database1, Jonas Golde authored at least 19 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Hierarchical Text Classification with LLM-Refined Taxonomies.
CoRR, January, 2026

What Matters When Building Universal Multilingual Named Entity Recognition Models?
CoRR, January, 2026

Hierarchical Text Classification with LLM-Refined Taxonomies.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements.
CoRR, November, 2025

PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models.
CoRR, October, 2025

Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models.
CoRR, April, 2025

MastermindEval: A Simple But Scalable Reasoning Benchmark.
CoRR, March, 2025

Familarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Question Decomposition for Retrieval-Augmented Generation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2025

2024
Non-rigid point cloud registration for middle ear diagnostics with endoscopic optical coherence tomography.
Int. J. Comput. Assist. Radiol. Surg., January, 2024

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models.
CoRR, 2024

Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data.
CoRR, 2024

Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

PECC: Problem Extraction and Coding Challenges.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
A Handheld Fiber-Optic Probe to Enable Optical Coherence Tomography of Oral Soft Tissue.
IEEE Trans. Biomed. Eng., 2022

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing.
CoRR, 2022



  Loading...