Georg Rehm

Orcid: 0000-0002-7800-1893

Affiliations:
  • DFKI GmbH, Speech and Language Technology Lab, Berlin, Germany


According to our database1, Georg Rehm authored at least 107 papers between 2000 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Strategic Research, Innovation and Implementation Agenda for Digital Language Equality in Europe by 2030.
Proceedings of the European Language Equality, 2023

European Language Equality: Introduction.
Proceedings of the European Language Equality, 2023

Consulting the Community: How to Reach Digital Language Equality in Europe by 2030?
Proceedings of the European Language Equality, 2023

Results of the Forward-looking Community-wide Consultation.
Proceedings of the European Language Equality, 2023

Strategic Plans and Projects in Language Technology and Artificial Intelligence.
Proceedings of the European Language Equality, 2023

Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning.
CoRR, 2023

Towards FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph.
Proceedings of the Joint Proceedings of the Onto4FAIR 2023 Workshops, 2023


Research Knowledge Graphs in NFDI4DS.
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023

NFDI4DS Shared Tasks.
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023

NFDI4DS Transfer and Application.
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023

Integration of a Semantic Storytelling Recommender System in Speech Assistants.
Proceedings of Text2Story, 2023

Open Science Best Practices in Data Science and Artificial Intelligence.
Proceedings of the 1st Conference on Research Data Infrastructure - Connecting Communities, 2023

2022
Language Report German.
Proceedings of the European Language Equality, 2022

European Language Technology in 2022/2023.
Proceedings of the European Language Equality, 2022

Digital Language Equality: Definition, Metric, Dashboard.
Proceedings of the European Language Equality, 2022

Lynx: A knowledge-based AI service platform for content processing, enrichment and analysis for the legal domain.
Inf. Syst., 2022

Identification of Relations between Text Segments for Semantic Storytelling.
Proceedings of the Third Conference on Digital Curation Technologies (Qurator 2022), 2022

Automatic Assessment of Online Content Credibility by Measuring the Adherence to Journalistic Standards.
Proceedings of the Third Conference on Digital Curation Technologies (Qurator 2022), 2022

Making a Semantic Event-type Ontology Multilingual.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Semantic Relations between Text Segments for Semantic Storytelling: Annotation Tool - Dataset - Evaluation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

A Dataset of Offensive German Language Tweets Annotated for Speech Acts.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Claim Extraction and Law Matching for COVID-19-related Legislation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Generating Extended and Multilingual Summaries with Pre-trained Transformers.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Modelling Cultural and Socio-Economic Dimensions of Political Bias in German Tweets.
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), 2022

Specialized document embeddings for aspect-based similarity of research papers.
Proceedings of the JCDL '22: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20, 2022

Learning Ontology Classes from Text by Clustering Lexical Substitutes Derived from Language Models.
Proceedings of the Towards a Knowledge-Aware AI - SEMANTiCS 2022, 2022

Evaluating Web Content Using the W3C Credibility Signals.
Proceedings of the Towards a Knowledge-Aware AI - SEMANTiCS 2022, 2022

Plow: A Novel Approach to Interlinking Modular Ontologies Based on Software Package Management.
Proceedings of the Towards a Knowledge-Aware AI - SEMANTiCS 2022, 2022

User Experience Design for Automatic Credibility Assessment of News Content About COVID-19.
Proceedings of the HCI International 2022 - Late Breaking Papers. Interaction in New Media, Learning and Games, 2022

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Suspicious Sentence Detection and Claim Verification in the COVID-19 Domain.
Proceedings of the 2nd Workshop Reducing Online Misinformation through Credible Information Retrieval 2022 co-located with The 44th European Conference on Information Retrieval (ECIR 2022), 2022

Overview of the ELE Project.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Was sehe ich? Visualisierungsstrategien für Datentransparenz in der Historischen Netzwerkanalyse.
Proceedings of the 8. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2022

HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Combining Knowledge about Text Types and Document Structures for Enhanced Content Curation.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021

Parsing Discourse Structures for Semantic Storytelling: Evaluating an efficient RST Parser.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021

SynSemClass for German: Extending a Multilingual Verb Lexicon.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021

Annotation of Fine-Grained Geographical Entities in German Texts.
Proceedings of the 3rd Conference on Language, Data and Knowledge, 2021

DFKI SLT at GermEval 2021: Multilingual Pre-training and Data Augmentation for the Classification of Toxicity in Social Media Comments.
Proceedings of the GermEval 2021 Shared Task on the Identification of Toxic, 2021

Evaluating document representations for content-based legal literature recommendations.
Proceedings of the ICAIL '21: Eighteenth International Conference for Artificial Intelligence and Law, São Paulo Brazil, June 21, 2021


Ordering sentences and paragraphs with pre-trained encoder-decoder transformers and pointer ensembles.
Proceedings of the DocEng '21: ACM Symposium on Document Engineering 2021, 2021

2020
A Workflow Manager for Complex NLP and Content Curation Pipelines.
CoRR, 2020

Towards Discourse Parsing-inspired Semantic Storytelling.
CoRR, 2020

Observations on Annotations.
CoRR, 2020

Automatic Induction of Named Entity Classes from Legal Text Corpora.
Proceedings of the Joint Proceedings of Workshops AI4LEGAL2020, 2020

Towards Discourse Parsing-inspired Semantic Storytelling.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2020), Berlin, Germany, January 20th, 2020


Named Entities in Medical Case Reports: Corpus and Experiments.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Orchestrating NLP Services for the Legal Domain.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Workflow Manager for Complex NLP and Content Curation Workflows.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020




A Dataset of German Legal Documents for Named Entity Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language Grid.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Abstractive Text Summarization based on Language Model Conditioning and Locality Modeling.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

SoNAR (IDH): Datenschnittstellen für historische Netzwerkanalyse.
Proceedings of the 7. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2020

Aspect-based Document Similarity for Research Papers.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Enriching BERT with Knowledge Graph Embeddings for Document Classification.
Proceedings of the 15th Conference on Natural Language Processing, 2019

Fine-Grained Named Entity Recognition in Legal Documents.
Proceedings of the Semantic Systems. The Power of AI and Knowledge Graphs, 2019

Semantic Storytelling: Towards Identifying Storylines in Large Amounts of Text Content.
Proceedings of Text2Story, 2019

Curation Technologies for Cultural Heritage Archives: Analysing and transforming a heterogeneous data set into an interactive curation workbench.
Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, 2019

2018
Automatic and Manual Web Annotations in an Infrastructure to handle Fake News and other Online Media Phenomena.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017
Improving Machine Translation through Linked Data.
Prague Bull. Math. Linguistics, 2017

DFKI-DKT at SemEval-2017 Task 8: Rumour Detection and Classification using Cascading Heuristics.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Towards User Interfaces for Semantic Storytelling.
Proceedings of the Human Interface and the Management of Information: Supporting Learning, Decision-Making and Collaboration, 2017

Designing User Interfaces for Curation Technologies.
Proceedings of the Human Interface and the Management of Information: Information, Knowledge and Interaction Design, 2017

Different German and English Coreference Resolution Models for Multi-domain Content Curation Scenarios.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

Different Types of Automated and Semi-automated Semantic Storytelling: Curation Technologies for Different Sectors.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

An Infrastructure for Empowering Internet Users to Handle Fake News and Other Online Media Phenomena.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

Automatic Classification of Abusive Language and Personal Attacks in Various Forms of Online Communication.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

Semantic Storytelling, Cross-lingual Event Detection and other Semantic Services for a Newsroom Content Curation Dashboard.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017

From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017


Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters.
Proceedings of the Events and Stories in the News Workshop@ACL 2017, 2017

2016
The strategic impact of META-NET on the regional, national and international level.
Lang. Resour. Evaluation, 2016

Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer.
Proceedings of the Semantic Web - ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29, 2016

Digital curation technologies (DKT).
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

CRACKER - cracking the language barrier. Selected results 2015/2016.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows.
Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web, 2016

2015
Digitale Kuratierungstechnologien: Verfahren für die effiziente Verarbeitung, Erstellung und Verteilung qualitativ hochwertiger Medieninhalte.
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, 2015

CRACKER: Cracking the Language Barrier.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

2014
META-SHARE: One year after.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2013
The State of Computational Morphology for Europe's Languages and the META-NET Strategic Research Agenda.
Proceedings of the Systems and Frameworks for Computational Morphology, 2013

META - Multilingual Europe Technology Alliance.
Proceedings of Machine Translation Summit XIV: European projects, 2013

MATECAT: Machine Translation Enhanced Computer Assisted Translation META - Multilingual Europe Technology Alliance.
Proceedings of Machine Translation Summit XIV: European projects, 2013

Strategic Research Agenda for Multilingual Europe 2020 - presented by the META Technology Council.
White Paper Series, Springer, ISBN: 978-3-642-36349-8, 2013

2012
The German Language in the Digital Age
White Paper Series, Springer, ISBN: 978-3-642-27166-3, 2012

2009
SusTEInability of linguistic resources through feature structures.
Lit. Linguistic Comput., 2009

Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources.
Lit. Linguistic Comput., 2009

2008
A Web-Platform for Preserving, Exploring, Visualising, and Querying Linguistic Corpora and other Resources.
Proces. del Leng. Natural, 2008

Digital Text Collections, Linguistic Research Data, and Mashups: Notes on the Legal Situation.
Libr. Trends, 2008

The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Ontology-Based XQuery'ing of XML-Encoded Language Resources on Multiple Annotation Layers.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2006
Hypertextsorten: Definition - Struktur - Klassifikation.
PhD thesis, 2006

2005
Language-Independent Text Parsing of Arbitrary HTML-Documents. Towards A Foundation For Web Genre Identification.
LDV Forum, 2005

2002
Towards Automatic Web Genre Identification.
Proceedings of the 35th Hawaii International Conference on System Sciences (HICSS-35 2002), 2002

2001
Die Chronik der Chronik - Erfahrungen über die Konvertierung und Weiterverarbeitung proprietär annotierter Daten.
Proceedings of the Proceedings der GLDV-Frühjahrstagung 2001, 2001

korpus.html - Zur Sammlung, Datenbankbasierten Erfassung, Annotation und Auswertung von HTML-Dokumenten.
Proceedings of the Proceedings der GLDV-Frühjahrstagung 2001, 2001

2000
From Open Source to Open Information: Collaborative Methods in Creating XML-Based Markup Languages.
Proceedings of the Electronic Publishing 2000, Electronic Publishing in the Third Millenium: 4th ICCC/IFIP conference held at Kaliningrad/Svetlogorsk, 2000


  Loading...