Leo Wanner

Orcid: 0000-0002-9446-3748

Affiliations:
  • Pompeu Fabra University


According to our database1, Leo Wanner authored at least 159 papers between 1989 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
CoRR, 2024

2023
V4Design: Intelligent Analysis and Integration of Multimedia Content for Creative Industries.
IEEE Syst. J., June, 2023

Towards Weakly-Supervised Hate Speech Classification Across Datasets.
CoRR, 2023

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP.
CoRR, 2023

2022
Intelligent Interaction with Autonomous Assistants in the Wild (NII Shonan Meeting 188).
NII Shonan Meet. Rep., 2022

Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers.
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022

Automatic Multilingual Incident Report Generation for Crisis Management.
Proceedings of the 19th International Conference on Information Systems for Crisis Response and Management, 2022

Social Media and Web Sensing on Interior and Urban Design.
Proceedings of the IEEE Symposium on Computers and Communications, 2022

Imageability-Based Multi-modal Analysis of Urban Environments for Architects and Artists.
Proceedings of the Image Analysis and Processing. ICIAP 2022 Workshops, 2022

Directions for NLP Practices Applied to Online Hate Speech Detection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets?
Inf. Process. Manag., 2021

Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants.
Proceedings of the Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. The PAAMS Collection, 2021

ThemePro 2.0: Showcasing the Role of Thematic Progression in Engaging Human-Computer Interaction.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

How much pretraining data do language models need to learn syntax?
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

On the evolution of syntactic information encoded by BERT's contextualized representations.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Evaluating language models for the retrieval and categorization of lexical collocations.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Concept Extraction Using Pointer-Generator Networks.
CoRR, 2020

User Identity Linkage in Social Media Using Linguistic and Social Interaction Features.
Proceedings of the WebSci '20: 12th ACM Conference on Web Science, 2020

CollFrEn: Rich Bilingual English-French Collocation Resource.
Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, 2020

Toxic, Hateful, Offensive or Abusive? What Are We Really Classifying? An Empirical Analysis of Hate Speech Datasets.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

ThemePro: A Toolkit for the Analysis of Thematic Progression.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Concept Extraction Using Pointer-Generator Networks and Distant Supervision for Data Augmentation.
Proceedings of the Knowledge Engineering and Knowledge Management, 2020

2019
A portable grammar-based NLG system for verbalization of structured data.
Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, 2019

Automatic Classification and Linguistic Analysis of Extremist Online Material.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Teaching FORGe to Verbalize DBpedia Properties in Spanish.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

The Second Multilingual Surface Realisation Shared Task (SR'19): Overview and Evaluation Results.
Proceedings of the 2nd Workshop on Multilingual Surface Realisation, 2019

Collocation Classification with Unsupervised Relation Vectors.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
On the role of syntactic dependencies and discourse relations for author and gender identification.
Pattern Recognit. Lett., 2018

A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content.
Frontiers Robotics AI, 2018

Improving the Quality of Video-to-Language Models by Optimizing Annotation of the Training Material.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Generation of a Spanish Artificial Collocation Error Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Compilation of Corpora for the Study of the Information Structure-Prosody Interface.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018


1st International Workshop on Multimedia Analysis for Architecture, Design and Virtual Reality Games (MADVR 2018).
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2018

Sentence Packaging in Text Generation from Semantic Graphs as a Community Detection Problem.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

Underspecified Universal Dependency Structures as Inputs for Multilingual Surface Realisation.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

Towards expressive prosody generation in TTS for reading aloud applications.
Proceedings of the Fourth International Conference, 2018

Automatic Identification of Texts Written by Authors with Alzheimer's Disease.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

How Can Intelligent Conversational Agents Help? The Needs of Geriatric Patients and Their Caregivers.
Proceedings of the AAMAS Workshop on Intelligent Conversation Agents in Home and Geriatric Care Applications co-located with the Federated AI Meeting (FAIM 2018), 2018

On the Role of Communicative Structure in Read Aloud Applications for the Elderly.
Proceedings of the AAMAS Workshop on Intelligent Conversation Agents in Home and Geriatric Care Applications co-located with the Federated AI Meeting (FAIM 2018), 2018

2017
Using genre-specific features for patent summaries.
Inf. Process. Manag., 2017

Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech.
Proceedings of the Statistical Language and Speech Processing, 2017

FORGe at SemEval-2017 Task 9: Deep sentence generation based on a sequence of graph transducers.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

KRISTINA: A Knowledge-Based Virtual Conversation Agent.
Proceedings of the Advances in Practical Applications of Cyber-Physical Multi-Agent Systems: The PAAMS Collection, 2017

Towards Reasoned Modality Selection in an Embodied Conversation Agent.
Proceedings of the Intelligent Virtual Agents - 17th International Conference, 2017

Prosograph: A Tool for Prosody Visualisation of Large Speech Corpora.
Proceedings of the Interspeech 2017, 2017

Using Prosody to Classify Discourse Relations.
Proceedings of the Interspeech 2017, 2017

A Thematicity-Based Prosody Enrichment Tool for CTS.
Proceedings of the Interspeech 2017, 2017

A demo of FORGe: the Pompeu Fabra Open Rule-based Generator.
Proceedings of the 10th International Conference on Natural Language Generation, 2017

Shared Task Proposal: Multilingual Surface Realization Using Universal Dependency Trees.
Proceedings of the 10th International Conference on Natural Language Generation, 2017

On the Relevance of Syntactic and Discourse Features for Author Profiling and Identification.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Revising the METU-Sabancı Turkish Treebank: An Exercise in Surface-Syntactic Annotation of Agglutinative Languages.
Proceedings of the Fourth International Conference on Dependency Linguistics, 2017

Automatic Extraction of Parallel Speech Corpora from Dubbed Movies.
Proceedings of the 10th Workshop on Building and Using Comparable Corpora, 2017

2016
Semantics-Driven Collocation Discovery.
Proces. del Leng. Natural, 2016

Data-driven deep-syntactic dependency parsing.
Nat. Lang. Eng., 2016

Environmental data extraction from heatmaps using the AirMerge system.
Multim. Tools Appl., 2016

Towards a Multimedia Knowledge-Based Agent with Social Competence and Human Interaction Capabilities.
Proceedings of the 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, 2016

Towards an Ontology-Driven Adaptive Dialogue Framework.
Proceedings of the 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, 2016

A Semi-Supervised Approach for Gender Identification.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Example-based Acquisition of Fine-grained Collocation Resources.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Towards Multiple Antecedent Coreference Resolution in Specialized Discourse.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Question Answering over Pattern-Based User Models.
Proceedings of the 12th International Conference on Semantic Systems, 2016

A Neural Network Architecture for Multilingual Punctuation Generation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Multilingual Natural Language Generation within Abstractive Summarization.
Proceedings of the 1st International Workshop on Multimodal Media Data Analytics co-located with the 22nd European Conference on Artificial Intelligence, 2016

The MULTISENSOR Project - Development of Multimedia Content Integration Technologies for Journalism, Media Monitoring and International Exporting Decision Support.
Proceedings of the 1st International Workshop on Multimodal Media Data Analytics co-located with the 22nd European Conference on Artificial Intelligence, 2016

Combining Dictionary- and Corpus-Based Concept Extraction.
Proceedings of the 1st International Workshop on Multimodal Media Data Analytics co-located with the 22nd European Conference on Artificial Intelligence, 2016

Praat on the Web: An Upgrade of Praat for Semi-Automatic Speech Annotation.
Proceedings of the COLING 2016, 2016

An Automatic Prosody Tagger for Spontaneous Speech.
Proceedings of the COLING 2016, 2016

Extending WordNet with Fine-Grained Collocational Information via Supervised Distributional Learning.
Proceedings of the COLING 2016, 2016

Towards Multilingual Natural Language Generation Within Abstractive Summarization.
Proceedings of the Artificial Intelligence Research and Development, 2016

Authorship Attribution Using Syntactic Dependencies.
Proceedings of the Artificial Intelligence Research and Development, 2016

Semantics-Driven Recognition of Collocations Using Word Embeddings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Classification of Grammatical Collocation Errors in the Writings of Learners of Spanish.
Proces. del Leng. Natural, 2015

Ontology-centered environmental information delivery for personalized decision support.
Expert Syst. Appl., 2015

Getting the environmental information across: from the Web to the user.
Expert Syst. J. Knowl. Eng., 2015

Fusion of meteorological and air quality data extracted from the web for personalized environmental information services.
Environ. Model. Softw., 2015

Representing and Visualizing Text as Ontologies: A Case from the Patent Domain.
Proceedings of the International Workshop on Visualizations and User Interfaces for Ontologies and Linked Data co-located with 14th International Semantic Web Conference (ISWC 2015), 2015

Classification of Lexical Collocation Errors in the Writings of Learners of Spanish.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Visualizing Deep-Syntactic Parser Output.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Data-driven sentence generation with non-isomorphic trees.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

MULTISENSOR: Development of multimedia content integration technologies for journalism, media monitoring and international exporting decision support.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Towards a multi-layered dependency annotation of Finnish.
Proceedings of the Third International Conference on Dependency Linguistics, 2015

Multiple Language Gender Identification for Blog Posts.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

2014
Natural Language Generation in the context of the Semantic Web.
Semantic Web, 2014

Towards advanced collocation error correction in Spanish learner corpora.
Lang. Resour. Evaluation, 2014

The extraction and fusion of meteorological and air quality information for orchestrated services.
Proceedings of the 1st International Workshop on Environnmental Multimedia Retrieval co-located with ACM International Conference on Multimedia Retrieval, 2014

How to Use less Features and Reach Better Performance in Author Gender Identification.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Classifiers for data-driven deep sentence generation.
Proceedings of the INLG 2014, 2014

Deep-Syntactic Parsing.
Proceedings of the COLING 2014, 2014

2013
Towards the Annotation of Penn TreeBank with Information Structure.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

The fusion of meteorological- and air quality information for orchestrated services using environmental profiling.
Proceedings of the 16th International Conference on Information Fusion, 2013

Involving the Expert in the Delivery of Environmental Information from the Web.
Proceedings of the 27th International Conference on Environmental Informatics for Environmental Protection, 2013

Overview of the First Content Selection Challenge from Open Semantic Web Data.
Proceedings of the ENLG 2013, 2013

AnCora-UPF: A Multi-Level Annotation of Spanish.
Proceedings of the Second International Conference on Dependency Linguistics, 2013

2012
Perspective-oriented generation of football match summaries: Old tasks, new challenges.
ACM Trans. Speech Lang. Process., 2012

Co-occurrence Graphs Applied to Taxonomy Extraction in Scientific and Technical Corpora.
Proces. del Leng. Natural, 2012

Labeling Semantically Motivated Clusters of Verbal Relations.
Proces. del Leng. Natural, 2012

Generation of patent abstracts: a challenge for automatic text summarization.
Proceedings of the 2nd International Workshop on Exploiting Large Knowledge Repositories, 2012

From Ontology to NL: Generation of Multilingual User-Oriented Environmental Reports.
Proceedings of the Natural Language Processing and Information Systems, 2012

Towards a Surface Realization-Oriented Corpus Annotation.
Proceedings of the INLG 2012 - Proceedings of the Seventh International Natural Language Generation Conference, 30 May 2012, 2012

Content Selection From Semantic Web Data.
Proceedings of the INLG 2012 - Proceedings of the Seventh International Natural Language Generation Conference, 30 May 2012, 2012

The Surface Realisation Task: Recent Developments and Future Plans.
Proceedings of the INLG 2012 - Proceedings of the Seventh International Natural Language Generation Conference, 30 May 2012, 2012



Generation of Multilingual Personalized Environmental Bulletins from an OWL-based Ontology.
Proceedings of the Light up the Ideas of Environmental Informatics: Proceedings of the 26th International Conference on Informatics for Environmental Protection, 2012

How Does the Granularity of an Annotation Scheme Influence Dependency Parsing Performance?
Proceedings of the COLING 2012, 2012

2011
Towards the derivation of verbal content relations from patent claims using deep syntactic structures.
Knowl. Based Syst., 2011


FootbOWL: Using a Generic Ontology of Football Competition for Planning Match Summaries.
Proceedings of the Semantic Web: Research and Applications, 2011

Content selection from an ontology-based knowledge base for the generation of football summaries.
Proceedings of the ENLG 2011, 2011

StuMaBa : From Deep Representation to Surface.
Proceedings of the ENLG 2011, 2011

Looking Behind the Scenes of Syntactic Dependency Corpus Annotation: Towards a Motivated Annotation Schema of Surface-Syntax in Spanish.
Proceedings of the Computational Dependency Theory [papers from the International Conference on Dependency Linguistics, 2011

One Step further towards Stochastic Semantic Sentence Generation.
Proceedings of the Computational Dependency Theory [papers from the International Conference on Dependency Linguistics, 2011

2010
Report Generation.
Proceedings of the Handbook of Natural Language Processing, Second Edition., 2010

Marquis: Generation of User-Tailored Multilingual Air Quality Bulletins.
Appl. Artif. Intell., 2010

Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Open Soucre Graph Transducer Interpreter and Grammar Development Environment.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Broad Coverage Multilingual Deep Sentence Generation with a Stochastic Multi-Level Realizer.
Proceedings of the COLING 2010, 2010

2009
Hacia una anotación de dependencias enriquecida de corpus españoles.
Proces. del Leng. Natural, 2009

Improving the comprehension of legal documentation: the case of patent claims.
Proceedings of the 12th International Conference on Artificial Intelligence and Law, 2009

Simplification of Patent Claim Sentences for their Paraphrasing and Summarization.
Proceedings of the Twenty-Second International Florida Artificial Intelligence Research Society Conference, 2009

2008
Morphological mismatches in machine translation.
Mach. Transl., 2008

Using Semantically Annotated Corpora to Build Collocation Resources.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Making Text Resources Accessible to the Reader: the Case of Patent Claims.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Two-step flow in bilingual lexicon extraction from unrelated corpora.
Proceedings of the 12th Annual conference of the European Association for Machine Translation, 2008

Multilingual summarization in practice: the case of patent claims.
Proceedings of the 12th Annual conference of the European Association for Machine Translation, 2008

2007
Towards Quantitative Concept Analysis.
Proces. del Leng. Natural, 2007

A Modular Framework for Ontology-based Representation of Patent Information.
Proceedings of the Legal Knowledge and Information Systems, 2007

Towards New Generation Environmental Information Services.
Proceedings of the Environmental Informatics and Systems Research: Proceedings of the 21st International Conference for Environmental Protection, 2007

Addressee-Tailored Interpretation of Air Quality Data.
Proceedings of the Environmental Informatics and Systems Research: Proceedings of the 21st International Conference for Environmental Protection, 2007

Pollen: A Challenge for Environmental Information Services.
Proceedings of the Environmental Informatics and Systems Research: Proceedings of the 21st International Conference for Environmental Protection, 2007

Text Planning of Air Quality Information.
Proceedings of the Environmental Informatics and Systems Research: Proceedings of the 21st International Conference for Environmental Protection, 2007

Automatic Production of Multilingual Environmental Information.
Proceedings of the Environmental Informatics and Systems Research: Proceedings of the 21st International Conference for Environmental Protection, 2007

2006
Discourse Structuring of Dynamic Content.
Proces. del Leng. Natural, 2006

Syntactic mismatches in machine translation.
Mach. Transl., 2006

Making sense of collocations.
Comput. Speech Lang., 2006

PATExpert: Semantic Processing of Patent Documentation.
Proceedings of the Poster and Demo Proceedings of the 1st International Conference on Semantic and Digital Media Technologies, 2006

Local Document Relevance Clustering in IR Using Collocation Information.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

2004
Towards automatic fine-grained semantic classification of verb-noun collocations.
Nat. Lang. Eng., 2004

Enriching the Spanish EuroWordNet by Collocations.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2003
Información colocacional y recuperación de la información.
Proces. del Leng. Natural, 2003

Deriving the Communicative Structure in Applied NLG.
Proceedings of the 9th European Workshop on Natural Language Generation, 2003

2001
Towards a Lexicographic Approach to Lexical Transfer in Machine Translation (Illustrated by the German-Russian Language Pair).
Mach. Transl., 2001

On Using a Parallel Graph Rewriting Formalism in Generation.
Proceedings of the ACL 2001 Eighth European Workshop on Natural Language Generation, 2001

2000
A development Environment for an MTT-Based Sentence Generator.
Proceedings of the INLG 2000, 2000

1998
De-Constraining Text Generation.
Proceedings of the Ninth International Workshop on Natural Language Generation, 1998

1997
Exploring lexical resources for text generation in a systemic functional language model.
PhD thesis, 1997

1996
Lexical choice in text generation and machine translation.
Mach. Transl., 1996

Editor's note.
Mach. Transl., 1996

The HealthDoc Sentence Planner.
Proceedings of the Eighth International Natural Language Generation Workshop, 1996

1994
Building Another Bridge over the Generation Gap.
Proceedings of the Seventh International Workshop on Natural Language Generation, 1994

On Lexically Biased Discourse Organization In Text Generation.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

1992
Lexical Choice and the Organization of Lexical Resources in Text Generation.
Proceedings of the 10th European Conference on Artificial Intelligence, 1992

1990
Towards a Lexicon for German Organized by Communicative Function: an Application of 'Lexical Functions'.
Proceedings of the GWAI-90, 1990

A collocational based approach to salience-sensitive lexical selection.
Proceedings of the Fifth International Workshop on Natural Language Generation, 1990

1989
Multimediale Wissensverarbeitung in integrierten Publikations- und Informationssystemen.
Künstliche Intell., 1989


  Loading...