Martin Potthast

Proceedings of the Second International Workshop on Scholarly Information Access (SCOLIA 2026) co-located with the 48th European Conference on Information Retrieval (ECIR 2026), 2026

Creating Specialized RAG-Based Search Engines Using the Open Web Index.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

An Open SERP Mining Infrastructure for the Archive Query Log.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Overview of Touché 2026: Argumentation Systems - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Evaluating the Efficiency and Effectiveness of Learned Sparse Retrieval with the lsr_benchmark.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

The Third International Workshop on Open Web Search (WOWS).

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

2025

About Authenticity in Information Retrieval: A Keynote at ICTIR 2025.

[BibT_eX]

[DOI]

SIGIR Forum, December, 2025

Webis Generated Native Ads 2025.

[BibT_eX]

[DOI]

Dataset, December, 2025

Touche 2026 Causality Mock Dataset.

[BibT_eX]

[DOI]

Tim Hagen

Dataset, December, 2025

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Investigating Counterclaims in Causality Extraction from Text.

[BibT_eX]

[DOI]

CoRR, October, 2025

Topic-Specific Classifiers are Better Relevance Judges than Prompted LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

Retrieval-Augmented Generation - The Future of Search? (Dagstuhl Seminar 25391).

[BibT_eX]

[DOI]

Dagstuhl Reports, September, 2025

Webis Generated Native Ads 2025.

[BibT_eX]

[DOI]

Dataset, September, 2025

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, August, 2025

Webis Generated Native Ads 2025.

[BibT_eX]

[DOI]

Dataset, August, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, May, 2025

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, April, 2025

Touché25-Image-Retrieval-and-Generation-for-Arguments.

[BibT_eX]

[DOI]

Dataset, April, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, April, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, April, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, April, 2025

Webis Generated Native Ads 2024.

[BibT_eX]

[DOI]

Dataset, April, 2025

PAN25 Multi-Author Writing Style Analysis.

[BibT_eX]

[DOI]

Dataset, March, 2025

PAN'25 Generative AI Detection (Task 1): Voight-Kampff AI Detection Sensitivity.

[BibT_eX]

[DOI]

Dataset, March, 2025

PAN25 Multi-Author Writing Style Analysis.

[BibT_eX]

[DOI]

Dataset, February, 2025

PAN25 Multi-Author Writing Style Analysis.

[BibT_eX]

[DOI]

Dataset, February, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, January, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, January, 2025

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, January, 2025

Webis Generated Native Ads 2024.

[BibT_eX]

[DOI]

Dataset, January, 2025

Webis Crowd RAG Corpus 2025.

[BibT_eX]

[DOI]

Dataset, January, 2025

TITE: Token-Independent Text Encoder for Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

AiReview: An Open Platform for Accelerating Systematic Reviews with LLMs.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

TIREx Tracker: The Information Retrieval Experiment Tracker.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

The Viability of Crowdsourcing for RAG Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Large Language Model Relevance Assessors Agree With One Another More Than With Human Assessors.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

ReNeuIR at SIGIR 2025: The Fourth Workshop on Reaching Efficiency in Neural Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Ask, Retrieve, Summarize: A Modular Pipeline for Scientific Literature Summarization.

[BibT_eX]

[DOI]

Pierre Achkar

Proceedings of the First International Workshop on Scholarly Information Access co-located with 47th European Conference on Information Retrieval (ECIR 2025), 2025

Axioms for Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval, 2025

Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins.

[BibT_eX]

[DOI]

Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval, 2025

Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-ranking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Set-Encoder: Permutation-Invariant Inter-passage Attention for Listwise Passage Re-ranking with Cross-Encoders.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Web-Scale Retrieval Experimentation with chatnoir-pyterrier.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

A Test Collection for Dataset Retrieval.

[BibT_eX]

[DOI]

Nikolay Kolyada

Alba García Seco de Herrera

Proceedings of the Advances in Information Retrieval, 2025

Overview of Touché 2025: Argumentation Systems - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Counterfactual Query Rewriting to Use Historical Relevance Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications.

[BibT_eX]

[DOI]

Christoph M. Friedrich

Cynthia Sabrina Schmidt

Diandra Fabre

Didier Schwab

Dimitar Dimitrov

Emmanuelle Esperança-Rodier

Mihai Gabriel Constantin

Steven Alexander Hicks

Sushant Gautam

Tabea Margareta Grace Pakull

Proceedings of the Advances in Information Retrieval, 2025

Ranking Generated Answers - On the Agreement of Retrieval Models with Humans on Consumer Health Questions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Call for Research on the Impact of Information Retrieval on Social Norms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Corpus Subsampling: Estimating the Effectiveness of Neural Retrieval Models on Large Corpora.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

The Second International Workshop on Open Web Search (WOWS).

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection - Extended Abstract.

[BibT_eX]

[DOI]

Alexandra-Georgiana Andrei

Proceedings of the Advances in Information Retrieval, 2025

Overview of the Multi-Author Writing Style Analysis Task at PAN 2025.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

Overview of Touché 2025: Argumentation Systems.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

Simplified Longitudinal Retrieval Experiments: A Case Study on Query Expansion and Document Boosting.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

Overview of ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications.

[BibT_eX]

[DOI]

Bogdan Ionescu

Henning Müller

Dan-Cristian Stanciu

Ahmedkhan Radzhabov

Yuri Prokopchuk

Liviu-Daniel Stefan

Mihai Gabriel Constantin

Alba Garcia Seco de Herrera

Christoph M. Friedrich

Cynthia Sabrina Schmidt

Tabea Margareta Grace Pakull

Steven Alexander Hicks

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

Overview of the Plagiarism Detection Task at PAN 2025.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

Overview of the "Voight-Kampff" Generative AI Authorship Verification Task at PAN and ELOQUENT 2025.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

Overview of PAN 2025: Voight-Kampff Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

Team OpenWebSearch at LongEval: Using Historical Data for Scientific Search.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

The Two Paradigms of LLM Detection: Authorship Attribution vs Authorship Verification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, November, 2024

Touché25-Image-Retrieval-and-Generation-for-Arguments.

[BibT_eX]

[DOI]

Dataset, November, 2024

Touché-25-Advertisement-in-Retrieval-Augmented-Generation.

[BibT_eX]

[DOI]

Dataset, October, 2024

Touché21-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, September, 2024

TIRA Integrated Research Architecture.

[BibT_eX]

[DOI]

Dataset, September, 2024

ChatNoir Resiliparse.

[BibT_eX]

[DOI]

Dataset, September, 2024

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, August, 2024

Report on the 1st International Workshop on Open Web Search (WOWS 2024) at ECIR 2024.

[BibT_eX]

[DOI]

SIGIR Forum, June, 2024

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, June, 2024

Impact and development of an Open Web Index for open web search.

[BibT_eX]

[DOI]

J. Assoc. Inf. Sci. Technol., May, 2024

Task-Oriented Paraphrase Analytics.

[BibT_eX]

[DOI]

Dataset, May, 2024

Supplementary run files for the paper "Learning Effective Representations for Retrieval using Self-Distillation with Adaptive Relevance Margins".

[BibT_eX]

[DOI]

Dataset, May, 2024

Manipulating Embeddings of Stable Diffusion Prompts.

[BibT_eX]

[DOI]

Julia Peters

Dataset, May, 2024

Who Determines What Is Relevant? Humans or AI? Why Not Both?

[BibT_eX]

[DOI]

Commun. ACM, April, 2024

Touché24-Image-Retrieval-and-Generation-for-Arguments.

[BibT_eX]

[DOI]

Dataset, April, 2024

Webis Product SERP Corpus 2024.

[BibT_eX]

[DOI]

Dataset, March, 2024

webis-de/WWW-24: Release 0.1.0.

[BibT_eX]

[DOI]

Dataset, March, 2024

Webis Generated Native Ads 2024.

[BibT_eX]

[DOI]

Dataset, March, 2024

PAN24 Voight-Kampff Generative AI Authorship Verification.

[BibT_eX]

[DOI]

Dataset, February, 2024

PAN24 Multi-Author Writing Style Analysis.

[BibT_eX]

[DOI]

Dataset, February, 2024

Touché24-Image-Retrieval-and-Generation-for-Arguments.

[BibT_eX]

[DOI]

Dataset, February, 2024

Wikipedia CRISPR Innovation Tracing Data 2023.

[BibT_eX]

[DOI]

Dataset, January, 2024

A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking.

[BibT_eX]

[DOI]

CoRR, 2024

If there's a Trigger Warning, then where's the Trigger? Investigating Trigger Warnings at the Passage Level.

[BibT_eX]

[DOI]

CoRR, 2024

Detecting Generated Native Ads in Conversational Search.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

A Mastodon Corpus to Evaluate Federated Microblog Search.

[BibT_eX]

[DOI]

Proceedings of the first International Workshop on Open Web Search co-located with the 46th European Conference on Information Retrieval ECIR 2024, 2024

Webis at TREC 2024: Biomedical Generative Retrieval, Retrieval-Augmented Generation, and Tip-of-the-Tongue Tracks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third Text REtrieval Conference, 2024

Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Evaluating Generative Ad Hoc Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Resources for Combining Teaching and Research in Information Retrieval Coursework.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

ReNeuIR at SIGIR 2024: The Third Workshop on Reaching Efficiency in Neural Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Objective Argument Summarization in Search.

[BibT_eX]

[DOI]

Proceedings of the Robust Argumentation Machines - First International Conference, 2024

Classification of Shared Tasks Used in Teaching.

[BibT_eX]

[DOI]

Proceedings of the 2024 on Innovation and Technology in Computer Science Education V. 1, 2024

The Information Retrieval Experiment Platform (Extended Abstract).

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Manipulating Embeddings of Stable Diffusion Prompts.

[BibT_eX]

[DOI]

Julia Peters

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Revisiting Query Variation Robustness of Transformer Models.

[BibT_eX]

[DOI]

Tim Hagen

Harrisen Scells

Theresa Reitis-Münstermann

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Zero-Shot Generative Large Language Models for Systematic Review Screening Automation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

Overview of Touché 2024: Argumentation Systems.

[BibT_eX]

[DOI]

Bertrand De Longueville

Proceedings of the Advances in Information Retrieval, 2024

Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

The Open Web Index - Crawling and Indexing the Web for Public Use.

[BibT_eX]

[DOI]

Gijs Hendriksen

Michael Dinzinger

Proceedings of the Advances in Information Retrieval, 2024

The First International Workshop on Open Web Search (WOWS).

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

TL;DR Progress: Multi-faceted Literature Exploration in Text Summarization.

[BibT_eX]

[DOI]

Shahbaz Syed

Khalid Al Khatib

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Task-Oriented Paraphrase Analytics.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Overview of the Multi-Author Writing Style Analysis Task at PAN 2024.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

De-noising Document Classification Benchmarks via Prompt-Based Rank Pruning: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2024

Overview of the ImageCLEF 2024: Multimedia Retrieval in Medical Applications.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2024

Team OpenWebSearch at CLEF 2024: QuantumCLEF.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

Overview of the "Voight-Kampff" Generative AI Authorship Verification Task at PAN and ELOQUENT 2024.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification Condensed Lab Overview.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2024

Team OpenWebSearch at CLEF 2024: LongEval.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

A User Study on the Acceptance of Native Advertising in Generative IR.

[BibT_eX]

[DOI]

Ines Zelch

Proceedings of the 2024 ACM SIGIR Conference on Human Information Interaction and Retrieval, 2024

Product Spam on YouTube: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM SIGIR Conference on Human Information Interaction and Retrieval, 2024

2023

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, December, 2023

EMNLP-23-Bootstrapping-a-Violence-Detector-for-Fan-Fiction.

[BibT_eX]

[DOI]

Dataset, October, 2023

Webis-Context-SciSumm-2023.

[BibT_eX]

[DOI]

Dataset, October, 2023

Task-Oriented Paraphrase Analytics.

[BibT_eX]

[DOI]

Dataset, October, 2023

Touché23-Image-Retrieval-for-Arguments.

[BibT_eX]

[DOI]

Dataset, September, 2023

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, August, 2023

TIRA Integrated Research Architecture.

[BibT_eX]

[DOI]

Dataset, August, 2023

Manipulating Embeddings of Stable Diffusion Prompts.

[BibT_eX]

[DOI]

Julia Peters

Giambattista Parascandolo

Dataset, August, 2023

ChatNoir Resiliparse.

[BibT_eX]

[DOI]

Dataset, August, 2023

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, July, 2023

Report on the Dagstuhl Seminar on Frontiers of Information Access Experimentation for Research and Education.

[BibT_eX]

[DOI]

SIGIR Forum, June, 2023

A diachronic perspective on citation latency in Wikipedia articles on CRISPR/Cas-9: an exploratory case study.

[BibT_eX]

[DOI]

Scientometrics, June, 2023

Webis Wikipedia Innovation History 2023.

[BibT_eX]

[DOI]

Dataset, June, 2023

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, February, 2023

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, February, 2023

Touché23-Image-Retrieval-for-Arguments.

[BibT_eX]

[DOI]

Dataset, February, 2023

Webis Wikipedia-IPC.

[BibT_eX]

[DOI]

Dataset, February, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

Commercialized Generative AI: A Critical Study of the Feasibility and Ethics of Generating Native Advertising Using Large Language Models in Conversational Web Search.

[BibT_eX]

[DOI]

Ines Zelch

CoRR, 2023

Using Language Models on Low-end Hardware.

[BibT_eX]

[DOI]

CoRR, 2023

Smooth Operators for Effective Systematic Review Queries.

[BibT_eX]

[DOI]

Harrisen Scells

Ferdinand Schlatt

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

pybool_ir: A Toolkit for Domain-Specific Search Experiments.

[BibT_eX]

[DOI]

Harrisen Scells

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

The Information Retrieval Experiment Platform.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

On Stance Detection in Image Retrieval for Argumentation.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Generating Natural Language Queries for More Effective Systematic Review Screening Prioritisation.

[BibT_eX]

[DOI]

Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023

Frame-oriented Summarization of Argumentative Discussions.

[BibT_eX]

[DOI]

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

A New Dataset for Causality Identification in Argumentative Texts.

[BibT_eX]

[DOI]

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

OpinionConv: Conversational Product Search with Grounded Opinions.

[BibT_eX]

[DOI]

Vahid Sadiri Javadi

Lucie Flek

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

SemEval-2023 Task 5: Clickbait Spoiling.

[BibT_eX]

[DOI]

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

The Information Retrieval Experiment Platform.

[BibT_eX]

[DOI]

Proceedings of the Lernen, 2023

Mining the History Sections of Wikipedia Articles on Science and Technology.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2023

SMAuC - The Scientific Multi-Authorship Corpus.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2023

Perspectives on Large Language Models for Relevance Judgment.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, 2023

Trigger Warnings: Bootstrapping a Violence Detector for Fan Fiction.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Indicative Summarization of Long Discussions.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Citance-Contextualized Summarization of Scientific Papers.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Dynamic Exploratory Search for the Information Retrieval Anthology.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

Continuous Integration for Reproducible Shared Tasks with TIRA.io.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

Bootstrapped nDCG Estimation in the Presence of Unjudged Documents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

Overview of Touché 2023: Argument and Causal Retrieval - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

Overview of PAN 2023: Authorship Verification, Multi-author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

Paraphrase Acquisition from Image Captions.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Topic Ontologies for Arguments.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Overview of the Multi-Author Writing Style Analysis Task at PAN 2023.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

Overview of the Trigger Detection Task at PAN 2023.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

Overview of the Authorship Verification Task at PAN 2023.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

Open Web Search at LongEval 2023: Reciprocal Rank Fusion on Automatically Generated Query Variants.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

Overview of PAN 2023: Authorship Verification, Multi-Author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection - Condensed Lab Overview.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2023

Overview of Touché 2023: Argument and Causal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

The Infinite Index: Information Retrieval on Generative Text-To-Image Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Human Information Interaction and Retrieval, 2023

Exploring Hyperparameter Usage and Tuning in Machine Learning Research.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE/ACM International Conference on AI Engineering, 2023

Modeling Appropriate Language in Argumentation.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Trigger Warning Assignment as a Multi-Label Document Classification Problem.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2023

Shared Tasks as Tutorials: A Methodical Approach.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Report on the 13th Conference and Labs of the Evaluation Forum (CLEF 2022): Experimental IR Meets Multilinguality, Multimodality, and Interaction.

[BibT_eX]

[DOI]

Giovanni Da San Martino

SIGIR Forum, December, 2022

Touché23-Image-Retrieval-for-Arguments.

[BibT_eX]

[DOI]

Dataset, November, 2022

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, October, 2022

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, October, 2022

Webis Causal Question Answering 2022.

[BibT_eX]

[DOI]

Dataset, October, 2022

Webis Causal Question Answering 2022.

[BibT_eX]

[DOI]

Dataset, October, 2022

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, September, 2022

Webis Health CauseNet 2022.

[BibT_eX]

[DOI]

Dataset, September, 2022

Webis-Web-Archive-Quality-22.

[BibT_eX]

[DOI]

Dataset, July, 2022

Small-Text: Active Learning for Text Classification in Python.

[BibT_eX]

[DOI]

Dataset, June, 2022

Touché22-Image-Retrieval-for-Arguments.

[BibT_eX]

[DOI]

Dataset, June, 2022

Touché22-Image-Retrieval-for-Arguments.

[BibT_eX]

[DOI]

Dataset, June, 2022

Touché22-Image-Retrieval-for-Arguments.

[BibT_eX]

[DOI]

Dataset, June, 2022

PAN22 Authorship Analysis: Authorship Verification.

[BibT_eX]

[DOI]

Dataset, March, 2022

PAN22 Authorship Analysis: Authorship Verification.

[BibT_eX]

[DOI]

Dataset, March, 2022

PAN22 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, March, 2022

Webis Clickbait Spoiling Corpus 2022.

[BibT_eX]

[DOI]

Dataset, March, 2022

Webis Clickbait Spoiling Corpus 2022.

[BibT_eX]

[DOI]

Dataset, March, 2022

Webis-MS-MARCO-Anchor-Texts-22.

[BibT_eX]

[DOI]

Dataset, January, 2022

WARC-DL: Scalable Web Archive Processing for Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Trigger Warnings: Bootstrapping a Violence Detector for FanFiction.

[BibT_eX]

[DOI]

CoRR, 2022

Tracking Discourse Influence in Darknet Forums.

[BibT_eX]

[DOI]

Christopher Akiki

Lukas Gienapp

CoRR, 2022

Webis at TREC 2022: Deep Learning and Health Misinformation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First Text REtrieval Conference, 2022

How Train-Test Leakage Affects Zero-Shot Retrieval.

[BibT_eX]

[DOI]

Proceedings of the String Processing and Information Retrieval, 2022

Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

Sparse Pairwise Re-ranking with Pre-trained Transformers.

[BibT_eX]

[DOI]

Proceedings of the ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Madrid, Spain, July 11, 2022

Visual Web Archive Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the Linking Theory and Practice of Digital Libraries, 2022

SUMMARY WORKBENCH: Unifying Application and Evaluation of Text Summarization Models.

[BibT_eX]

[DOI]

Shahbaz Syed

Dominik Schwabe

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

The Power of Anchor Text in the Neural Retrieval Era.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

Overview of Touché 2022: Argument Retrieval - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

Mining Health-related Cause-Effect Statements with High Precision at Large Scale.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

CausalQA: A Benchmark for Causal Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Overview of the Style Change Detection Task at PAN 2022.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Overview of the Authorship Verification Task at PAN 2022.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Noise-Reduction for Automatically Transferred Relevance Judgments.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

Overview of Touché 2022: Argument Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, and Style Change Detection.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers.

[BibT_eX]

[DOI]

Christopher Schröder

Andreas Niekler

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Clickbait Spoiling via Question Answering and Passage Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Webis-STEREO-21 (Full Version).

[BibT_eX]

[DOI]

Dataset, December, 2021

Data for PAN at SemEval 2019 Task 4: Hyperpartisan News Detection.

[BibT_eX]

[DOI]

Dataset, December, 2021

Touché22-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, November, 2021

Touché22-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, November, 2021

Touché21-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, November, 2021

Touché21-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, November, 2021

Webis-STEREO-21.

[BibT_eX]

[DOI]

Dataset, October, 2021

Same Side Stance Classification Resampled Datasets.

[BibT_eX]

[DOI]

Dataset, September, 2021

Same Sentiment Classification Train/Dev/Test Pair IDs.

[BibT_eX]

[DOI]

Dataset, September, 2021

Same Side Stance Classification Adversarial Test Cases.

[BibT_eX]

[DOI]

Dataset, September, 2021

Webis-ArgImages-21.

[BibT_eX]

[DOI]

Dataset, August, 2021

Webis-ArgImages-21.

[BibT_eX]

[DOI]

Dataset, August, 2021

Webis-ConcluGen-2021.

[BibT_eX]

[DOI]

Dataset, May, 2021

PAN21 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, March, 2021

PAN21 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, March, 2021

Webis-Dataset-Reviews-21.

[BibT_eX]

[DOI]

Nikolay Kolyada

Dataset, February, 2021

Webis-WebSeg-20-Algorithm-Segmentations.

[BibT_eX]

[DOI]

Dataset, January, 2021

Meta-Information in Conversational Search.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., 2021

The information retrieval anthology 2021: inaugural status report and challenges ahead.

[BibT_eX]

[DOI]

SIGIR Forum, 2021

Predicting essay quality from search and writing behavior.

[BibT_eX]

[DOI]

J. Assoc. Inf. Sci. Technol., 2021

STEREO: Scientific Text Reuse in Open Access Publications.

[BibT_eX]

[DOI]

CoRR, 2021

FastWARC: Optimizing Large-Scale Web Archive Analytics.

[BibT_eX]

[DOI]

Janek Bevendorff

CoRR, 2021

The Impact of Main Content Extraction on Near-Duplicate Detection.

[BibT_eX]

[DOI]

CoRR, 2021

BERTian Poetics: Constrained Composition with Masked LMs.

[BibT_eX]

[DOI]

Christopher Akiki

CoRR, 2021

Modeling Proficiency with Implicit User Representations.

[BibT_eX]

[DOI]

CoRR, 2021

Uncertainty-based Query Strategies for Active Learning with Transformers.

[BibT_eX]

[DOI]

Christopher Schröder

Andreas Niekler

Gretel Liz De la Peña Sarracén

CoRR, 2021

Argument Undermining: Counter-Argument Generation by Attacking Weak Premises.

[BibT_eX]

[DOI]

CoRR, 2021

Webis at TREC 2021: Deep Learning, Health Misinformation, and Podcasts Tracks.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth Text REtrieval Conference, 2021

The Information Retrieval Anthology.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Identifying Queries in Instant Search Logs.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Summary Explorer: Visualizing the State of the Art in Text Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

On Classifying whether Two Texts are on the Same Side of an Argument.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Casting the Same Sentiment Classification Problem.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

An Empirical Comparison of Web Page Segmentation Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2021

Overview of Touché 2021: Argument Retrieval - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2021

Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection - Extended Abstract.

[BibT_eX]

[DOI]

Janek Bevendorff

Berta Chulvi

Francisco Manuel Rangel Pardo

Proceedings of the Advances in Information Retrieval, 2021

Overview of the Style Change Detection Task at PAN 2021.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Overview of the Cross-Domain Authorship Verification Task at PAN 2021.

[BibT_eX]

[DOI]

Gretel Liz De la Peña Sarracén

Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Overview of Touché 2021: Argument Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2021

Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection.

[BibT_eX]

[DOI]

Janek Bevendorff

Berta Chulvi

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2021

Learning to Rank Arguments with Feature Selection.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Image Retrieval for Arguments Using Stance-Aware Query Expansion.

[BibT_eX]

[DOI]

Proceedings of the 8th Workshop on Argument Mining, 2021

Key Point Analysis via Contrastive Learning and Extractive Argument Summarization.

[BibT_eX]

[DOI]

Maximilian Spliethöver

Philipp Cimiano

Henning Wachsmuth

Proceedings of the 8th Workshop on Argument Mining, 2021

Generating Informative Conclusions for Argumentative Texts.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Beyond Metadata: What Paper Authors Say About Corpora They Use.

[BibT_eX]

[DOI]

Nikolay Kolyada

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Counter-Argument Generation by Attacking Weak Premises.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Webis SCSmeta 2021.

[BibT_eX]

[DOI]

Dataset, October, 2020

Webis-WebSeg-20-Algorithm-Segmentations.

[BibT_eX]

[DOI]

Dataset, October, 2020

CauseNet: Towards a Causality Graph Extracted from the Web.

[BibT_eX]

[DOI]

Stefan Heindorf

Yan Scholten

Henning Wachsmuth

Dataset, October, 2020

args.me corpus.

[BibT_eX]

[DOI]

Dataset, October, 2020

Touché20-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, September, 2020

Touché20-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, September, 2020

Touché20-Argument-Retrieval-for-Controversial-Questions.

[BibT_eX]

[DOI]

Dataset, September, 2020

Webis Gmane Email Corpus 2019.

[BibT_eX]

[DOI]

Dataset, June, 2020

Webis-WebSeg-20.

[BibT_eX]

[DOI]

Dataset, June, 2020

Webis-Web-Segments-20.

[BibT_eX]

[DOI]

Dataset, June, 2020

Webis Argument Quality Corpus 2020 (Webis-ArgQuality-20).

[BibT_eX]

[DOI]

Dataset, May, 2020

args.me corpus.

[BibT_eX]

[DOI]

Dataset, April, 2020

Disaster Tweet Corpus 2020.

[BibT_eX]

[DOI]

Dataset, March, 2020

PAN20 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, February, 2020

PAN20 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, February, 2020

PAN20 Authorship Analysis: Celebrity Profiling.

[BibT_eX]

[DOI]

Dataset, February, 2020

PAN20 Authorship Analysis: Celebrity Profiling.

[BibT_eX]

[DOI]

Dataset, February, 2020

Webis Abstractive Snippet Corpus 2020.

[BibT_eX]

[DOI]

Dataset, February, 2020

The dilemma of the direct answer.

[BibT_eX]

[DOI]

SIGIR Forum, 2020

On divergence-based author obfuscation: An attack on the state of the art in statistical authorship verification.

[BibT_eX]

[DOI]

it Inf. Technol., 2020

The Importance of Suppressing Domain Style in Authorship Analysis.

[BibT_eX]

[DOI]

CoRR, 2020

Common Conversational Community Prototype: Scholarly Conversational Assistant.

[BibT_eX]

[DOI]

CoRR, 2020

Abstractive Snippet Generation.

[BibT_eX]

[DOI]

Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Sampling Bias Due to Near-Duplicates in Learning to Rank.

[BibT_eX]

[DOI]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Towards Predicting the Subscription Status of Twitch.tv Users - ECML-PKDD ChAT Discovery Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of ECML-PKDD 2020 ChAT Discovery Challenge on Chat Analytics for Twitch co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases 2020 (ECML-PKDD 2020), 2020

Analysis of Detection Models for Disaster-Related Tweets.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Information Systems for Crisis Response and Management, 2020

Task Proposal: Abstractive Snippet Generation for Web Pages.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Natural Language Generation, 2020

Web Archive Analytics.

[BibT_eX]

[DOI]

Proceedings of the 50. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2020 - Back to the Future, Karlsruhe, Germany, 28. September, 2020

A Search Engine for Police Press Releases to Double-Check the News.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2020

The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2020

Touché: First Shared Task on Argument Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2020

Shared Tasks on Authorship Analysis at PAN 2020.

[BibT_eX]

[DOI]

Günther Specht

Eva Zangerle

Proceedings of the Advances in Information Retrieval, 2020

News Editorials: Towards Summarizing Long Argumentative Texts.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Overview of the Style Change Detection Task at PAN 2020.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2020, 2020

Overview of the Celebrity Profiling Task at PAN 2020.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2020, 2020

Overview of the Cross-Domain Authorship Verification Task at PAN 2020.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2020, 2020

Overview of Touché 2020: Argument Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2020, 2020

Overview of Touché 2020: Argument Retrieval - Extended Abstract.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2020

Overview of PAN 2020: Authorship Verification, Celebrity Profiling, Profiling Fake News Spreaders on Twitter, and Style Change Detection.

[BibT_eX]

[DOI]

Günther Specht

Eva Zangerle

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2020

Exploring Argument Retrieval with Transformers.

[BibT_eX]

[DOI]

Christopher Akiki

Proceedings of the Working Notes of CLEF 2020, 2020

Web Page Segmentation Revisited: Evaluation Framework and Dataset.

[BibT_eX]

[DOI]

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

CauseNet: Towards a Causality Graph Extracted from the Web.

[BibT_eX]

[DOI]

Stefan Heindorf

Yan Scholten

Henning Wachsmuth

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

The Impact of Negative Relevance Judgments on NDCG.

[BibT_eX]

[DOI]

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain.

[BibT_eX]

[DOI]

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Efficient Pairwise Annotation of Argument Quality.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Crawling and Preprocessing Mailing Lists At Scale for Dialog Analysis.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Target Inference in Argument Conclusion Generation.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

PAN19 Authorship Analysis: Cross-Domain Authorship Attribution.

[BibT_eX]

[DOI]

Dataset, November, 2019

args.me corpus.

[BibT_eX]

[DOI]

Dataset, July, 2019

Webis-Web-Errors-19.

[BibT_eX]

[DOI]

Dataset, April, 2019

Webis-Web-Archive-17 Content Error Annotations.

[BibT_eX]

[DOI]

Dataset, March, 2019

PAN19 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, January, 2019

PAN19 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, January, 2019

PAN19 Authorship Analysis: Style Change Detection.

[BibT_eX]

[DOI]

Dataset, January, 2019

PAN19 Authorship Analysis: Celebrity Profiling.

[BibT_eX]

[DOI]

Dataset, January, 2019

PAN19 Authorship Analysis: Celebrity Profiling.

[BibT_eX]

[DOI]

Dataset, January, 2019

Webis-Web-Archive-17 Content Error Annotations.

[BibT_eX]

[DOI]

Dataset, January, 2019

Modeling the usefulness of search results as measured by information use.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2019

Argument Search: Assessing Argument Relevance.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

SemEval-2019 Task 4: Hyperpartisan News Detection.

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Generalizing Unmasking for Short Texts.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Summarizing E-sports matches and tournaments: the example of counter-strike: global offensive.

[BibT_eX]

[DOI]

Mathias Lux

Pål Halvorsen

Duc-Tien Dang-Nguyen

Håkon Kvale Stensland

Manoj Kesavulu

Michael Riegler

Proceedings of the 11th ACM Workshop on Immersive Mixed and Virtual Environment Systems, 2019

GameStory Task at MediaEval 2019.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019

Data Acquisition for Argument Search: The args.me Corpus.

[BibT_eX]

[DOI]

Proceedings of the KI 2019: Advances in Artificial Intelligence, 2019

A Dataset for Content Error Detection in Web Archives.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019

Towards Summarization for Social Media - Results of the TL;DR Challenge.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Natural Language Generation, 2019

Debiasing Vandalism Detection Models at Wikidata.

[BibT_eX]

[DOI]

Proceedings of the 49. Jahrestagung der Gesellschaft für Informatik, 50 Jahre Gesellschaft für Informatik, 2019

A Decade of Shared Tasks in Digital Text Forensics at PAN.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2019

Wikipedia Text Reuse: Within and Without.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2019

Overview of the Style Change Detection Task at PAN 2019.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2019, 2019

Overview of the Celebrity Profiling Task at PAN 2019.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2019, 2019

Overview of the Cross-domain Authorship Attribution Task at PAN 2019.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2019, 2019

Overview of PAN 2019: Bots and Gender Profiling, Celebrity Profiling, Cross-Domain Authorship Attribution and Style Change Detection.

[BibT_eX]

[DOI]

Günther Specht

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2019

Same Side Stance Classification Using Contextualized Sentence Embeddings.

[BibT_eX]

[DOI]

Erik Körner

Gerhard Heyer

Proceedings of the Same Side Stance Classification Shared Task organized as a part of the 6th Workshop on Argument Mining (ArgMining 2019) and co-located with the the 57th Annual Meeting of the Association for Computational Linguistics (ACL19), 2019

Celebrity Profiling.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Heuristic Authorship Obfuscation.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Bias Analysis and Mitigation in the Evaluation of Authorship Verification.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Evolution of the PAN Lab on Digital Text Forensics.

[BibT_eX]

[DOI]

Walter Daelemans

Proceedings of the Information Retrieval Evaluation in a Changing World, 2019

TIRA Integrated Research Architecture.

[BibT_eX]

[DOI]

Proceedings of the Information Retrieval Evaluation in a Changing World, 2019

2018

Data for PAN at SemEval 2019 Task 4: Hyperpartisan News Detection.

[BibT_eX]

[DOI]

Dataset, November, 2018

PAN18 Author Profiling.

[BibT_eX]

[DOI]

Francisco Rangel

Manuel Montes-y-Gómez

Dataset, September, 2018

PAN18 Author Identification: Attribution.

[BibT_eX]

[DOI]

Dataset, September, 2018

PAN18 Multi-Author Analysis: Style-Change-Detection.

[BibT_eX]

[DOI]

Dataset, September, 2018

PAN18 Author Identification: Attribution.

[BibT_eX]

[DOI]

Erika Patricia Garces Fernandez

Dataset, September, 2018

Webis YouTube 8M Augmented 2018.

[BibT_eX]

[DOI]

Dataset, July, 2018

Webis Wikipedia Text Reuse Corpus 2018 (Webis-Wikipedia-Text-Reuse-18).

[BibT_eX]

[DOI]

Dataset, July, 2018

Webis Wikipedia Text Reuse Corpus 2018 (Webis-Wikipedia-Text-Reuse-18).

[BibT_eX]

[DOI]

Dataset, July, 2018

Webis Clickbait Corpus 2017 (Webis-Clickbait-17).

[BibT_eX]

[DOI]

Dataset, June, 2018

Webis Clickbait Corpus 2017 (Webis-Clickbait-17).

[BibT_eX]

[DOI]

Erika Patricia Garces Fernandez

Dataset, June, 2018

BuzzFeed-Webis Fake News Corpus 2016.

[BibT_eX]

[DOI]

Dataset, February, 2018

Reproducible Web Corpora: Interactive Archiving with Automatic Quality Assessment.

[BibT_eX]

[DOI]

ACM J. Data Inf. Qual., 2018

Evaluation-as-a-Service for the Computational Sciences: Overview and Outlook.

[BibT_eX]

[DOI]

Jayashree Kalpathy-Cramer

ACM J. Data Inf. Qual., 2018

The Clickbait Challenge 2017: Towards a Regression Model for Clickbait Strength.

[BibT_eX]

[DOI]

CoRR, 2018

Heuristic Feature Selection for Clickbait Detection.

[BibT_eX]

[DOI]

CoRR, 2018

A User Study on Snippet Generation: Text Reuse vs. Paraphrases.

[BibT_eX]

[DOI]

Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Team ORG @ GameStory Task 2018.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018

GameStory Task at MediaEval 2018.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018

Task Proposal: The TL;DR Challenge.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Natural Language Generation, 2018

Towards Crowdsourcing Clickbait Labels for YouTube Videos.

[BibT_eX]

[DOI]

Proceedings of the HCOMP 2018 Works in Progress and Demonstration Papers Track of the sixth AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2018), 2018

Predicting Retrieval Success Based on Information Use for Writing Tasks.

[BibT_eX]

[DOI]

Proceedings of the Digital Libraries for Open Knowledge, 2018

A Plan for Ancillary Copyright: Original Snippets.

[BibT_eX]

[DOI]

Proceedings of the Second International Workshop on Recent Trends in News Information Retrieval co-located with 40th European Conference on Information Retrieval (ECIR 2018), 2018

Shaping the Information Nutrition Label.

[BibT_eX]

[DOI]

Erika Patricia Garces Fernandez

Proceedings of the Second International Workshop on Recent Trends in News Information Retrieval co-located with 40th European Conference on Information Retrieval (ECIR 2018), 2018

Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2018

WASP: Web Archiving and Search Personalized.

[BibT_eX]

[DOI]

Proceedings of the First Biennial Conference on Design of Experimental Search & Information Retrieval Systems, 2018

CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies.

[BibT_eX]

[DOI]

Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018

Crowdsourcing a Large Corpus of Clickbait on Twitter.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

Overview of PAN 2018 - Author Identification, Author Profiling, and Author Obfuscation.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2018

Overview of the Author Obfuscation Task at PAN 2018: A New Approach to Measuring Safety.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2018, 2018

Overview of the 6th Author Profiling Task at PAN 2018: Multimodal Gender Identification in Twitter.

[BibT_eX]

[DOI]

Manuel Montes-y-Gómez

Proceedings of the Working Notes of CLEF 2018, 2018

Overview of the Author Identification Task at PAN-2018: Cross-domain Authorship Attribution and Style Change Detection.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2018, 2018

A Stylometric Inquiry into Hyperpartisan and Fake News.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Webis-Web-Archive-17.

[BibT_eX]

[DOI]

Dataset, October, 2017

Webis-Web-Archive-17.

[BibT_eX]

[DOI]

Dataset, October, 2017

Webis-Web-Archive-17.

[BibT_eX]

[DOI]

Dataset, October, 2017

PAN17 Multi-Author Analysis: Style-Change-Detection.

[BibT_eX]

[DOI]

Dataset, September, 2017

PAN17 Author Profiling.

[BibT_eX]

[DOI]

Dataset, September, 2017

PAN17 Author Identification: Clustering.

[BibT_eX]

[DOI]

Francisco Rangel

Dataset, September, 2017

Webis Query Spelling Corpus 2017 (Webis-QSpell-17).

[BibT_eX]

[DOI]

Dataset, August, 2017

Webis Query Spelling Corpus 2017 (Webis-QSpell-17).

[BibT_eX]

[DOI]

Dataset, August, 2017

Proceedings of the WSDM Cup 2017: Vandalism Detection and Triple Scoring.

[BibT_eX]

[DOI]

Marie-Catherine de Marneffe

Stefan Heindorf

Hannah Bast

CoRR, 2017

Overview of the Wikidata Vandalism Detection Task at WSDM Cup 2017.

[BibT_eX]

[DOI]

CoRR, 2017

WSDM Cup 2017: Vandalism Detection and Triple Scoring.

[BibT_eX]

[DOI]

Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

A Large-Scale Query Spelling Correction Corpus.

[BibT_eX]

[DOI]

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Spatio-Temporal Analysis of Reverted Wikipedia Edits.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Web and Social Media, 2017

TL;DR: Mining Reddit to Learn Automatic Summarization.

[BibT_eX]

[DOI]

Proceedings of the Workshop on New Frontiers in Summarization, 2017

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies.

[BibT_eX]

[DOI]

Christopher D. Manning

Héctor Martínez Alonso

Hector Fernandez Alcalde

Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2017

Overview of the Author Identification Task at PAN-2017: Style Breach Detection and Author Clustering.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2017, 2017

Overview of PAN'17 - Author Identification, Author Profiling, and Author Obfuscation.

[BibT_eX]

[DOI]

Francisco Manuel Rangel Pardo

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2017

Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety Identification in Twitter.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2017, 2017

Overview of the Author Obfuscation Task at PAN 2017: Safety Evaluation Revisited.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2017, 2017

Source Retrieval for Web-Scale Text Reuse Detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Building an Argument Search Engine for the Web.

[BibT_eX]

[DOI]

Proceedings of the 4th Workshop on Argument Mining, 2017

2016

PAN16 Author Profiling.

[BibT_eX]

[DOI]

Dataset, September, 2016

PAN16 Author Obfuscation: Author-Masking.

[BibT_eX]

[DOI]

Dataset, September, 2016

Wikidata Vandalism Corpus 2016 (WDVC-16).

[BibT_eX]

[DOI]

Dataset, September, 2016

PAN16 Author Identification: Clustering.

[BibT_eX]

[DOI]

Dataset, May, 2016

Webis Clickbait Corpus 2016 (Webis-Clickbait-16).

[BibT_eX]

[DOI]

Dataset, March, 2016

On Textual Analysis and Machine Learning for Cyberstalking Detection.

[BibT_eX]

[DOI]

Datenbank-Spektrum, 2016

Visualizing Article Similarities in Wikipedia.

[BibT_eX]

[DOI]

Proceedings of the 18th Eurographics Conference on Visualization, 2016

Passphone: Outsourcing Phone-Based Web Authentication While Protecting User Privacy.

[BibT_eX]

[DOI]

Proceedings of the Secure IT Systems - 21st Nordic Conference, NordSec 2016, Oulu, Finland, 2016

Algorithms and Corpora for Persian Plagiarism Detection: Overview of PAN at FIRE 2016.

[BibT_eX]

[DOI]

Proceedings of the Working notes of FIRE 2016, 2016

Clickbait Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2016

Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2016

Clustering by Authorship Within and Across Documents.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2016, 2016

Overview of PAN'16 - New Challenges for Authorship Analysis: Cross-Genre Profiling, Clustering, Diarization, and Obfuscation.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2016

Author Obfuscation: Attacking the State of the Art in Authorship Verification.

[BibT_eX]

[DOI]

Francisco Manuel Rangel Pardo

Proceedings of the Working Notes of CLEF 2016, 2016

Overview of the 4th Author Profiling Task at PAN 2016: Cross-Genre Evaluations.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2016, 2016

Vandalism Detection in Wikidata.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

How Writers Search: Analyzing the Search and Writing Logs of Non-fictional Essays.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Human Information Interaction and Retrieval, 2016

2015

PAN15 Author Profiling.

[BibT_eX]

[DOI]

Dataset, September, 2015

Palkovskii15 Originality: Text Alignment.

[BibT_eX]

[DOI]

Dataset, September, 2015

PAN15 Author Identification: Verification.

[BibT_eX]

[DOI]

Jayashree Kalpathy-Cramer

Dataset, September, 2015

Wikidata Vandalism Corpus 2015 (WDVC-15).

[BibT_eX]

[DOI]

Dataset, August, 2015

Report on the Evaluation-as-a-Service (EaaS) Expert Workshop.

[BibT_eX]

[DOI]

SIGIR Forum, 2015

Evaluation-as-a-Service: Overview and Outlook.

[BibT_eX]

[DOI]

Jayashree Kalpathy-Cramer

CoRR, 2015

Visual Assessment of Alleged Plagiarism Cases.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2015

Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis.

[BibT_eX]

[DOI]

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Webis: An Ensemble for Twitter Sentiment Detection.

[BibT_eX]

[DOI]

Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2015

Overview of the PAN/CLEF 2015 Evaluation Lab.

[BibT_eX]

[DOI]

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

Overview of the Author Identification Task at PAN 2015.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2015, 2015

Towards Data Submissions for Shared Tasks: First Experiences for the Task of Text Alignment.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2015, 2015

Overview of the 3rd Author Profiling Task at PAN 2015.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2015, 2015

Source Retrieval for Plagiarism Detection from Large Web Corpora: Recent Approaches.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2015, 2015

2014

PAN14 Author Profiling.

[BibT_eX]

[DOI]

Dataset, September, 2014

PAN14 Originality: Source Retrieval.

[BibT_eX]

[DOI]

Dataset, September, 2014

PAN14 Author Identification: Verification.

[BibT_eX]

[DOI]

Miguel A. Sanchez-Perez

Dataset, September, 2014

PAN14 Originality: Text Alignment.

[BibT_eX]

[DOI]

Dataset, September, 2014

Improving Cloze Test Performance of Language Learners Using Web N-Grams.

[BibT_eX]

[DOI]

Proceedings of the COLING 2014, 2014

Overview of the Author Identification Task at PAN 2014.

[BibT_eX]

[DOI]

Miguel A. Sánchez-Pérez

Proceedings of the Working Notes for CLEF 2014 Conference, 2014

Overview of the 6th International Competition on Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2014 Conference, 2014

Improving the Reproducibility of PAN's Shared Tasks: - Plagiarism Detection, Author Identification, and Author Profiling.

[BibT_eX]

[DOI]

Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Interaction, 2014

Overview of the Author Profiling Task at PAN 2014.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2014 Conference, 2014

2013

PAN13 Originality: Source Retrieval.

[BibT_eX]

[DOI]

Dataset, September, 2013

PAN13 Originality: Text Alignment.

[BibT_eX]

[DOI]

Dataset, September, 2013

Webis Crowd Paraphrase Corpus 2011 (Webis-CPC-11).

[BibT_eX]

[DOI]

Dataset, June, 2013

Paraphrase acquisition via crowdsourcing and machine learning.

[BibT_eX]

[DOI]

Steven Burrows

ACM Trans. Intell. Syst. Technol., 2013

Exploratory Search Missions for TREC Topics.

[BibT_eX]

[DOI]

Proceedings of the 3rd European Workshop on Human-Computer Interaction and Information Retrieval, 2013

Towards Optimum Query Segmentation: In Doubt Without.

[BibT_eX]

[DOI]

Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, 2013

Overview of the 5th International Competition on Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Recent Trends in Digital Text Forensics and Its Evaluation - Plagiarism Detection, Author Identification, and Author Profiling.

[BibT_eX]

[DOI]

Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visualization, 2013

Crowdsourcing Interaction Logs to Understand Text Reuse from the Web.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012

PAN12 Originality: Text-Alignment.

[BibT_eX]

[DOI]

Parth Gupta

Dataset, September, 2012

PAN12 Originality: Source Retrieval.

[BibT_eX]

[DOI]

Parth Gupta

Dataset, September, 2012

Webis Text Reuse Corpus 2012.

[BibT_eX]

[DOI]

Dataset, September, 2012

Technologies for Reusing Text from the Web

[BibT_eX]

[DOI]

PhD thesis, 2012

WORDGRAPH: Keyword-in-Context Visualization for NETSPEAK's Wildcard Search.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2012

Information Retrieval in the Commentsphere.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2012

Webis at the TREC 2012 Session Track.

[BibT_eX]

[DOI]

Proceedings of The Twenty-First Text REtrieval Conference, 2012

ChatNoir: a search engine for the ClueWeb09 corpus.

[BibT_eX]

[DOI]

Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Overview of the 4th International Competition on Plagiarism Detection.

[BibT_eX]

[DOI]

Parth Gupta

Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

2011

PAN Wikipedia Vandalism Corpus 2011 (PAN-WVC-11).

[BibT_eX]

[DOI]

Dataset, July, 2011

PAN Plagiarism Corpus 2011 (PAN-PC-11).

[BibT_eX]

[DOI]

Dataset, June, 2011

Fourth international workshop on uncovering plagiarism, authorship, and social software misuse.

[BibT_eX]

[DOI]

Moshe Koppel

SIGIR Forum, 2011

Cross-language plagiarism detection.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, 2011

Query segmentation revisited.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on World Wide Web, 2011

Technologien zur Wiederverwendung von Texten aus dem Web.

[BibT_eX]

[DOI]

Proceedings of the Ausgezeichnete Informatikdissertationen 2011, 2011

Overview of the 2nd International Competition on Wikipedia Vandalism Detection.

[BibT_eX]

[DOI]

Proceedings of the CLEF 2011 Labs and Workshop, 2011

Overview of the 3rd International Competition on Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the CLEF 2011 Labs and Workshop, 2011

The NETSPEAK WORDGRAPH: Visualizing keywords in context.

[BibT_eX]

[DOI]

Proceedings of the IEEE Pacific Visualization Symposium, 2011

2010

PAN Wikipedia Vandalism Corpus 2010 (PAN-WVC-10).

[BibT_eX]

[DOI]

Dataset, July, 2010

Webis Query Segmentation Corpus 2010 (Webis-QSeC-10).

[BibT_eX]

[DOI]

Dataset, July, 2010

PAN Plagiarism Corpus 2010 (PAN-PC-10).

[BibT_eX]

[DOI]

Dataset, May, 2010

Towards comment-based cross-media retrieval.

[BibT_eX]

[DOI]

Steffen Becker

Proceedings of the 19th International Conference on World Wide Web, 2010

Crowdsourcing a wikipedia vandalism corpus.

[BibT_eX]

[DOI]

Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

The power of naive query segmentation.

[BibT_eX]

[DOI]

Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Evaluating Humour Features on Web Comments.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Corpus and Evaluation Measures for Automatic Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Retrieving Customary Web Language to Assist Writers.

[BibT_eX]

[DOI]

Martin Trenkmann

Proceedings of the Advances in Information Retrieval, 2010

Netspeak - Assisting Writers in Choosing Words.

[BibT_eX]

[DOI]

Martin Trenkmann

Proceedings of the Advances in Information Retrieval, 2010

Opinion Summarization of Web Comments.

[BibT_eX]

[DOI]

Steffen Becker

Proceedings of the Advances in Information Retrieval, 2010

Cross-Language High Similarity Search: Why No Sub-linear Time Bound Can Be Expected.

[BibT_eX]

[DOI]

Maik Anderka

Proceedings of the Advances in Information Retrieval, 2010

An Evaluation Framework for Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the COLING 2010, 2010

Overview of the 1st International Competition on Wikipedia Vandalism Detection.

[BibT_eX]

[DOI]

Proceedings of the CLEF 2010 LABs and Workshops, 2010

Overview of the 2nd International Competition on Plagiarism Detection.

[BibT_eX]

[DOI]

Proceedings of the CLEF 2010 LABs and Workshops, 2010

2009

PAN Plagiarism Corpus 2009 (PAN-PC-09).

[BibT_eX]

[DOI]

Dataset, September, 2009

Measuring the descriptiveness of web comments.

[BibT_eX]

[DOI]

Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

2008

Retrieval-Technologien für die Plagiaterkennung in Programmen.

[BibT_eX]

Proceedings of the LWA 2008, 2008

Automatic Vandalism Detection in Wikipedia.

[BibT_eX]

[DOI]

Robert Gerling

Proceedings of the Advances in Information Retrieval , 2008

A Wikipedia-Based Multilingual Retrieval Model.

[BibT_eX]

[DOI]

Maik Anderka

Proceedings of the Advances in Information Retrieval , 2008

2007

Webis Wikipedia Vandalism Corpus (Webis-WVC-07).

[BibT_eX]

[DOI]

Robert Gerling

Dataset, January, 2007

Strategies for retrieving plagiarized documents.

[BibT_eX]

[DOI]

Sven Meyer zu Eissen

Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Wikipedia in the pocket: indexing technology for near-duplicate detection and high similarity search.

[BibT_eX]

[DOI]

Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

New Issues in Near-duplicate Detection.

[BibT_eX]

[DOI]

Proceedings of the Data Analysis, Machine Learning and Applications, 2007

2006

Hashing-basierte Indizierung: Anwendungsszenarien, Theorie und Methoden.

[BibT_eX]

[DOI]

Proceedings of the LWA 2006: Lernen - Wissensentdeckung - Adaptivität, Hildesheim, Deutschland, October 9th-11th 2006, joint workshop event of several interest groups of the German Society for Informatics (GI) - 14th Workshop on Adaptivity and User Modeling in Interactive Systems (ABIS 2006) - Workshop Information Retrieval 2006 of the Special Interest Group Information Retrieval (FGIR 2006) - Workshop on Knowledge and Experience Management (FGWM 2006), 2006

Putting Successor Variety Stemming to Work.

[BibT_eX]

[DOI]