Jan Snajder

Orcid: 0000-0001-8942-5301

According to our database1, Jan Snajder authored at least 114 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
From Robustness to Improved Generalization and Calibration in Pre-trained Language Models.
CoRR, 2024

LLMs for Targeted Sentiment in News Headlines: Exploring Different Levels of Prompt Prescriptiveness.
CoRR, 2024

Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity.
CoRR, 2024

Do Not (Always) Look Right: Investigating the Capabilities of Decoder-Based Large Language Models for Sequence Labeling.
CoRR, 2024

Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
Out-of-Distribution Detection by Leveraging Between-Layer Transformation Smoothness.
CoRR, 2023

Leveraging Open Information Extraction for Improving Few-Shot Trigger Detection Domain Transfer.
CoRR, 2023

Paragraph-level Citation Recommendation based on Topic Sentences as Queries.
CoRR, 2023

Data Augmentation for Neural NLP.
CoRR, 2023

Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ALANNO: An Active Learning Annotation System for Mortals.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

Easy to Decide, Hard to Agree: Reducing Disagreements Between Saliency Methods.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

On Dataset Transferability in Active Learning for Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
SIMPA: Statement-to-Item Matching Personality Assessment from text.
Future Gener. Comput. Syst., 2022

An empirical study of the design choices for local citation recommendation systems.
Expert Syst. Appl., 2022

Smooth Sailing: Improving Active Learning for Pre-trained Language Models with Representation Smoothness Analysis.
CoRR, 2022

Toward Practical Usage of the Attention Mechanism as a Tool for Interpretability.
IEEE Access, 2022

NLPOP: a Dataset for Popularity Prediction of Promoted NLP Research on Twitter.
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

You Are What You Talk About: Inducing Evaluative Topics for Personality Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Large-scale Evaluation of Transformer-based Article Encoders on the Task of Citation Recommendation.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

2021
Word sense induction using leader-follower clustering of automatically generated lexical substitutes.
Expert Syst. Appl., 2021

A Topic Coverage Approach to Evaluation of Topic Models.
IEEE Access, 2021

PANDORA Talks: Personality and Demographics on Reddit.
Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media, 2021

2020
A Survey of Citation Recommendation Tasks and Methods.
J. Comput. Inf. Technol., 2020

Staying True to Your Word: (How) Can Attention Become Explanation?
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Improved Local Citation Recommendation Based on Context Enhanced with Global Information.
Proceedings of the First Workshop on Scholarly Document Processing, 2020

2019
TakeLab at SemEval-2019 Task 4: Hyperpartisan News Detection.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Evaluating Automatic Term Extraction Methods on Individual Documents.
Proceedings of the Joint Workshop on Multiword Expressions and WordNet, 2019

Analysing Rhetorical Structure as a Key Feature of Summary Coherence.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

2018
Document-based topic coherence measures for news media text.
Expert Syst. Appl., 2018

Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval.
Expert Syst. Appl., 2018

Not Just Depressed: Bipolar Disorder Prediction on Reddit.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

TakeLab at SemEval-2018 Task 7: Combining Sparse and Dense Features for Relation Classification in Scientific Texts.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

TakeLab at SemEval-2018 Task12: Argument Reasoning Comprehension with Skip-Thought Vectors.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Lexical Substitution for Evaluating Compositional Distributional Models.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Iterative Recursive Attention Model for Interpretable Sequence Classification.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Combining Shallow and Deep Learning for Aggressive Text Detection.
Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, 2018

Reddit: A Gold Mine for Personality Prediction.
Proceedings of the Second Workshop on Computational Modeling of People's Opinions, 2018

Cross-Domain Detection of Abusive Language Online.
Proceedings of the 2nd Workshop on Abusive Language Online, 2018

Leveraging Lexical Substitutes for Unsupervised Word Sense Induction.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Unsupervised Acquisition of Comprehensive Multiword Lexicons using Competition in an n-gram Lattice.
Trans. Assoc. Comput. Linguistics, 2017

Social Media Argumentation Mining: The Quest for Deliberateness in Raucousness.
CoRR, 2017

Toward Stance Classification Based on Claim Microstructures.
Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, 2017

Does Free Word Order Hurt? Assessing the Practical Lexical Function Model for Croatian.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017

TakeLab-QA at SemEval-2017 Task 3: Classification Experiments for Answer Retrieval in Community QA.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

TakeLab at SemEval-2017 Task 5: Linear aggregation of word embeddings for fine-grained sentiment analysis of financial news.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

TakeLab at SemEval-2017 Task 4: Recent Deaths and the Power of Nostalgia in Sentiment Analysis in Twitter.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

TakeLab at SemEval-2017 Task 6: #RankingHumorIn4Pages.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Detecting Non-covered Questions in Frequently Asked Questions Collections.
Proceedings of the Natural Language Processing and Information Systems, 2017

Combining Linguistic Features for the Detection of Croatian Multiword Expressions.
Proceedings of the 13th Workshop on Multiword Expressions, 2017

Using Analytic Scoring Rubrics in the Automatic Assessment of College-Level Summary Writing Tasks in L2.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Predicting News Values from Headline Text and Emotions.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017

Linguistic Features and Newsworthiness: an Analysis of News style.
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017), 2017

Two Layers of Annotation for Representing Event Mentions in News Stories.
Proceedings of the 11th Linguistic Annotation Workshop, 2017

Comparison of Short-Text Sentiment Analysis Methods for Croatian.
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, 2017

The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages.
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, 2017

Debunking Sentiment Lexicons: A Case of Domain-Specific Sentiment Classification for Croatian.
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, 2017

A Preliminary Study of Croatian Lexical Substitution.
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, 2017

2016
FAQIR - A Frequently Asked Questions Retrieval Test Collection.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

TakeLab at SemEval-2016 Task 6: Stance Classification in Tweets Using a Genetic Algorithm Based Ensemble.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Graph-Based Induction of Word Senses in Croatian.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Analysis of Policy Agendas: Lessons Learned from Automatic Topic Classification of Croatian Political Texts.
Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, 2016

Smoothing Syntax-Based Semantic Spaces: Let The Winner Take It All.
Proceedings of the 13th Conference on Natural Language Processing, 2016

A Pilot Study in Using Argumentation Frameworks for Online Debates.
Proceedings of the First International Workshop on Systems and Algorithms for Formal Argumentation (SAFA) co-located with the 6th International Conference on Computational Models of Argument (COMMA 2016), 2016

Predictability of Distributional Semantics in Derivational Word Formation.
Proceedings of the COLING 2016, 2016

Detecting and Ranking Conceptual Links between Texts Using a Knowledge Base.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Fill the Gap! Analyzing Implicit Premises between Claims from Online Debates.
Proceedings of the Third Workshop on Argument Mining, 2016

2015
Construction and evaluation of event graphs.
Nat. Lang. Eng., 2015

Modeling Semantic Compositionality of Croatian Multiword Expressions.
Informatica (Slovenia), 2015

TKLBLIIR: Detecting Twitter Paraphrases with TweetingJay.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Morphological priming in German: the word is not enough (or is it?).
Proceedings of the NetWordS Final Conference on Word Knowledge and Word Usage: Representations and Processes in the Mental Lexicon, Pisa, Italy, March 30, 2015

Identifying Prominent Arguments in Online Debates Using Semantic Textual Similarity.
Proceedings of the 2nd Workshop on Argumentation Mining, 2015

Obtaining a Better Understanding of Distributional Models of German Derivational Morphology.
Proceedings of the 11th International Conference on Computational Semantics, 2015

Evaluation of Manual Query Expansion Rules on a Domain Specific FAQ Collection.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

Getting the Agenda Right: Measuring Media Agenda using Topic Models.
Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications, 2015

Resolving Entity Coreference in Croatian with a Constrained Mention-Pair Model.
Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, 2015

Experiments on Active Learning for Croatian Word Sense Disambiguation.
Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, 2015

2014
Event graphs for information retrieval and multi-document summarization.
Expert Syst. Appl., 2014

Constructing Coherent Event Hierarchies from News Stories.
Proceedings of TextGraphs@EMNLP 2014: the 9th Workshop on Graph-based Methods for Natural Language Processing, 2014

DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

HiEve: A Corpus for Extracting Event Hierarchies from News Stories.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Towards Semantic Validation of a Derivational Lexicon.
Proceedings of the COLING 2014, 2014

Back up your Stance: Recognizing Arguments in Online Discussions.
Proceedings of the First Workshop on Argument Mining, 2014

2013
CroNER: Recognizing Named Entities in Croatian Using Conditional Random Fields.
Informatica (Slovenia), 2013

Event-Centered Information Retrieval Using Kernels on Event Graphs.
Proceedings of TextGraphs@EMNLP 2013: the 8th Workshop on Graph-based Methods for Natural Language Processing, 2013

Exploring Coreference Uncertainty of Generically Extracted Event Mentions.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Building and Evaluating a Distributional Memory for Croatian.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Derivational Smoothing for Syntactic Distributional Semantics.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Recognizing Identical Events with Graph Kernels.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Frequently Asked Questions Retrieval for Croatian Based on Semantic Textual Similarity.
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 2013

Aspect-Oriented Opinion Mining from User Reviews in Croatian.
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 2013

GPKEX: Genetically Programmed Keyphrase Extraction from Croatian Texts.
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 2013

2012
Optimizing Sentence Boundary Detection for Croatian.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Towards a Constraint Grammar Based Morphological Tagger for Croatian.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Semi-supervised Acquisition of Croatian Sentiment Lexicon.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

TakeLab: Systems for Measuring Semantic Text Similarity.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

From Requirements to Code: Syntax-Based Requirements Analysis for Data-Driven Application Development.
Proceedings of the Natural Language Processing and Information Systems, 2012

Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Unsupervised Topic-Oriented Keyphrase Extraction and Its Application to Croatian.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Question Classification for a Croatian QA System.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Random Indexing Distributional Semantic Models for Croatian Language.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010
Extending lexical association measures for collocation extraction.
Comput. Speech Lang., 2010

Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009
String Distance-Based Stemming of the Highly Inflected Croatian Language.
Proceedings of the Recent Advances in Natural Language Processing, 2009

TermeX: A Tool for Collocation Extraction.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2009

2008
Automatic acquisition of inflectional lexica for morphological normalisation.
Inf. Process. Manag., 2008

Language morphology offset: Text classification on a Croatian-English parallel corpus.
Inf. Process. Manag., 2008

Building a search engine model with morphological normalization support.
Proceedings of the ITI 2008 30th International Conference on Information Technology Interfaces, 2008

Evolving New Lexical Association Measures Using Genetic Programming.
Proceedings of the ACL 2008, 2008

2006
Comparison of Collocation Extraction Measures for Document Indexing.
J. Comput. Inf. Technol., 2006

2005
Computer-Aided Document Indexing System.
J. Comput. Inf. Technol., 2005


  Loading...