Filip Gralinski

Proceedings of the 20th Conference on Computer Science and Intelligence Systems, 2025

Oddballness: universal anomaly detection with language models.

[BibT_eX]

[DOI]

Ryszard Staruch

Krzysztof Jurkiewicz

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Adapting LLMs for Minimal-edit Grammatical Error Correction.

[BibT_eX]

[DOI]

Ryszard Staruch

Daniel Dzienisiewicz

Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications, 2025

2024

Tackling prediction tasks in relational databases with LLMs.

[BibT_eX]

[DOI]

Marek Wydmuch

CoRR, 2024

POLygraph: Polish Fake News Dataset.

[BibT_eX]

[DOI]

Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024

Two Approaches to Diachronic Normalization of Polish Texts.

[BibT_eX]

[DOI]

Proceedings of the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, 2024

2023

CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Arxiv Tables: Document Understanding Challenge Linking Texts and Tables.

[BibT_eX]

[DOI]

Karolina Konopka

Michal Turski

Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

Modeling Spaced Repetition with LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Computer Supported Education, 2023

2022

Challenging America: Modeling language in longer time scales.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Temporal Language Modeling for Short Text Document Classification with Transformers.

[BibT_eX]

[DOI]

Jakub Pokrywka

Proceedings of the 17th Conference on Computer Science and Intelligence Systems, 2022

Using Transformer models for gender attribution in Polish.

[BibT_eX]

[DOI]

Karol Kaczmarek

Jakub Pokrywka

Proceedings of the 17th Conference on Computer Science and Intelligence Systems, 2022

2021

Dynamic Boundary Time Warping for sub-sequence matching with few examples.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2021

DUE: End-to-End Document Understanding Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

LAMBERT: Layout-Aware Language Modeling for Information Extraction.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Successive Halving Top-k Operator.

[BibT_eX]

[DOI]

Michal Pietruszka

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Sparsifying Transformer Models with Differentiable Representation Pooling.

[BibT_eX]

[DOI]

Michal Pietruszka

CoRR, 2020

On the Multi-Property Extraction and Beyond.

[BibT_eX]

[DOI]

CoRR, 2020

Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout.

[BibT_eX]

[DOI]

CoRR, 2020

LAMBERT: Layout-Aware language Modeling using BERT for information extraction.

[BibT_eX]

[DOI]

CoRR, 2020

ApplicaAI at SemEval-2020 Task 11: On RoBERTa-CRF, Span CLS and Whether Self-Training Helps Them.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Contract Discovery: Dataset and a Few-shot Semantic Retrieval Challenge with Competitive Baselines.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

From Dataset Recycling to Multi-Property Extraction and Beyond.

[BibT_eX]

[DOI]

Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

2019

Searching for Legal Clauses by Analogy. Few-shot Semantic Retrieval Shared Task.

[BibT_eX]

[DOI]

CoRR, 2019

GEval: Tool for Debugging NLP Datasets and Models.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

2017

Automated Normalization and Analysis of Historical Texts.

[BibT_eX]

[DOI]

Pawel Skórzewski

Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2017

The RetroC challenge: how to guess the publication year of a text?

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage, 2017

2016

Vive la Petite Différence! - Exploiting Small Differences for Gender Attribution of Short Texts.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

"He Said She Said" ― a Male/Female Corpus of Polish.

[BibT_eX]

[DOI]

Piotr Wierzchon

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015

RetroC - A Corpus for Evaluating Temporal Classifiers.

[BibT_eX]

[DOI]

Piotr Wierzchon

Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2015

2013

PSI-Toolkit: A Natural Language Processing Pipeline.

[BibT_eX]

[DOI]

Marcin Junczys-Dowmunt

Proceedings of the Computational Linguistics - Applications, 2013

2012

Mining the Web for Idiomatic Expressions Using Metalinguistic Markers.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

2010

Computational Lexicography of Multi-Word Units. How Efficient Can It Be?

[BibT_eX]

[DOI]

Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, 2010

Matura Evaluation Experiment Based on Human Evaluation of Machine Translation.

[BibT_eX]

[DOI]

Aleksandra Wojak

Proceedings of the International Multiconference on Computer Science and Information Technology, 2010

Mining Parenthetical Translations for Polish-English Lexica.

[BibT_eX]

[DOI]

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2010

2009

Acquiring Bilingual Lexica from Keyword Listings.

[BibT_eX]

[DOI]

Roman Kurc

Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

Looking for new words out there.

[BibT_eX]

[DOI]

Marcin Walas

Proceedings of the International Multiconference on Computer Science and Information Technology, 2009

An Environment for Named Entity Recognition and Translation.

[BibT_eX]

[DOI]

Michal Marcinczuk

Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009

2006

Some Methods of Describing Discontinuity in Polish and Their Cost-Effectiveness.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

2003

Applying Transition Networks in Translating Polish E-Mails.

[BibT_eX]