Filip Gralinski

Orcid: 0000-0001-8066-4533

According to our database1, Filip Gralinski authored at least 38 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Oddballness: universal anomaly detection with language models.
CoRR, 2024

Two Approaches to Diachronic Normalization of Polish Texts.
CoRR, 2024

POLygraph: Polish Fake News Dataset.
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024

2023
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Arxiv Tables: Document Understanding Challenge Linking Texts and Tables.
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

Modeling Spaced Repetition with LSTMs.
Proceedings of the 15th International Conference on Computer Supported Education, 2023

2022
Challenging America: Modeling language in longer time scales.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Temporal Language Modeling for Short Text Document Classification with Transformers.
Proceedings of the 17th Conference on Computer Science and Intelligence Systems, 2022

Using Transformer models for gender attribution in Polish.
Proceedings of the 17th Conference on Computer Science and Intelligence Systems, 2022

2021
Dynamic Boundary Time Warping for sub-sequence matching with few examples.
Expert Syst. Appl., 2021

DUE: End-to-End Document Understanding Benchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

LAMBERT: Layout-Aware Language Modeling for Information Extraction.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Successive Halving Top-k Operator.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Sparsifying Transformer Models with Differentiable Representation Pooling.
CoRR, 2020

On the Multi-Property Extraction and Beyond.
CoRR, 2020

Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout.
CoRR, 2020

LAMBERT: Layout-Aware language Modeling using BERT for information extraction.
CoRR, 2020

ApplicaAI at SemEval-2020 Task 11: On RoBERTa-CRF, Span CLS and Whether Self-Training Helps Them.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Contract Discovery: Dataset and a Few-shot Semantic Retrieval Challenge with Competitive Baselines.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

From Dataset Recycling to Multi-Property Extraction and Beyond.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

2019
Searching for Legal Clauses by Analogy. Few-shot Semantic Retrieval Shared Task.
CoRR, 2019

GEval: Tool for Debugging NLP Datasets and Models.
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

2017
Automated Normalization and Analysis of Historical Texts.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2017

The RetroC challenge: how to guess the publication year of a text?
Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage, 2017

2016
Vive la Petite Différence! - Exploiting Small Differences for Gender Attribution of Short Texts.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

"He Said She Said" ― a Male/Female Corpus of Polish.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
RetroC - A Corpus for Evaluating Temporal Classifiers.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2015

2013
PSI-Toolkit: A Natural Language Processing Pipeline.
Proceedings of the Computational Linguistics - Applications, 2013

2012
Mining the Web for Idiomatic Expressions Using Metalinguistic Markers.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

2010
Computational Lexicography of Multi-Word Units. How Efficient Can It Be?
Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, 2010

Matura Evaluation Experiment Based on Human Evaluation of Machine Translation.
Proceedings of the International Multiconference on Computer Science and Information Technology, 2010

Mining Parenthetical Translations for Polish-English Lexica.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2010

2009
Acquiring Bilingual Lexica from Keyword Listings.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

Looking for new words out there.
Proceedings of the International Multiconference on Computer Science and Information Technology, 2009

An Environment for Named Entity Recognition and Translation.
Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009

2006
Some Methods of Describing Discontinuity in Polish and Their Cost-Effectiveness.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

2003
Applying Transition Networks in Translating Polish E-Mails.
Proceedings of the Intelligent Information Processing and Web Mining, 2003


  Loading...