Fabrizio Sebastiani

Orcid: 0000-0003-4221-6427

Affiliations:
  • Qatar Computing Research Institute, Doha, Qatar
  • CNR, Pisa, Italy (former)


According to our database1, Fabrizio Sebastiani authored at least 165 papers between 1990 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Label Quantification.
ACM Trans. Knowl. Discov. Data, January, 2024

Same or Different? Diff-Vectors for Authorship Analysis.
ACM Trans. Knowl. Discov. Data, January, 2024

2023
Unravelling interlanguage facts via explainable machine learning.
Digit. Scholarsh. Humanit., August, 2023

Improved risk minimization algorithms for technology-assisted review.
Intell. Syst. Appl., May, 2023

Generalized Funnelling: Ensemble Learning and Heterogeneous Document Embeddings for Cross-Lingual Text Classification.
ACM Trans. Inf. Syst., April, 2023

Syllabic quantity patterns as rhythmic features for Latin authorship attribution.
J. Assoc. Inf. Sci. Technol., January, 2023

Learning to Quantify
The Information Retrieval Series 47, Springer, ISBN: 978-3-031-20466-1, 2023

Measuring Fairness Under Unawareness of Sensitive Attributes: A Quantification-Based Approach.
J. Artif. Intell. Res., 2023

Explainable Authorship Identification in Cultural Heritage Applications: Analysis of a New Perspective.
CoRR, 2023

Regularization-Based Methods for Ordinal Quantification.
CoRR, 2023

Binary Quantification and Dataset Shift: An Experimental Investigation.
CoRR, 2023

2022
Report on the 13th Conference and Labs of the Evaluation Forum (CLEF 2022): Experimental IR Meets Multilinguality, Multimodality, and Interaction.
SIGIR Forum, December, 2022

Lost in Transduction: Transductive Transfer Learning in Text Classification.
ACM Trans. Knowl. Discov. Data, 2022

Report on the 1st International Workshop on Learning to Quantify (LQ 2021).
SIGKDD Explor., 2022

MedLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts.
ACM Journal on Computing and Cultural Heritage, 2022

Ordinal Quantification Through Regularization.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

LeQua@CLEF2022: Learning to Quantify.
Proceedings of the Advances in Information Retrieval, 2022

A Concise Overview of LeQua@CLEF 2022: Learning to Quantify.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

A Detailed Overview of LeQua@CLEF 2022: Learning to Quantify.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Active Learning and the Saerens-Latinne-Decaestecker Algorithm: An Evaluation.
Proceedings of the 2nd Joint Conference of the Information Retrieval Communities in Europe (CIRCLE 2022), 2022

2021
A Critical Reassessment of the Saerens-Latinne-Decaestecker Algorithm for Posterior Probability Adjustment.
ACM Trans. Inf. Syst., 2021

Report on the 43rd european conference on information retrieval (ECIR 2021).
SIGIR Forum, 2021

Word-class embeddings for multiclass text classification.
Data Min. Knowl. Discov., 2021

Measuring Fairness under Unawareness via Quantification.
CoRR, 2021

Heterogeneous document embeddings for cross-lingual text classification.
Proceedings of the SAC '21: The 36th ACM/SIGAPP Symposium on Applied Computing, 2021

Garbled-Word Embeddings for Jumbled Text.
Proceedings of the 11th Italian Information Retrieval Workshop 2021, 2021

Re-assessing the "Classify and Count" Quantification Method.
Proceedings of the Advances in Information Retrieval, 2021

QuaPy: A Publicly Available Python-Based Software Library for Quantification.
Proceedings of the CIKM 2021 Workshops co-located with 30th ACM International Conference on Information and Knowledge Management (CIKM 2021), 2021

QuaPy: A Python-Based Framework for Quantification.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

LeQua @ CLEF 2022: A Shared Task for Evaluating Quantification Systems.
Proceedings of the CIKM 2021 Workshops co-located with 30th ACM International Conference on Information and Knowledge Management (CIKM 2021), 2021

Learning to Quantify: Methods and Applications (LQ 2021).
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Learning to Weight for Text Classification.
IEEE Trans. Knowl. Data Eng., 2020

Transitioning the information retrieval literature to a fully open access model.
SIGIR Forum, 2020

Report on the 2nd ACM SIGIR/SIGKDD Africa school on machine learning for data mining and search.
SIGIR Forum, 2020

Evaluation measures for quantification: an axiomatic approach.
Inf. Retr. J., 2020

Cross-Lingual Sentiment Quantification.
IEEE Intell. Syst., 2020

Tweet Sentiment Quantification: An Experimental Re-Evaluation.
CoRR, 2020

MedLatin1 and MedLatin2: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts.
CoRR, 2020

2019
Jointly Minimizing the Expected Costs of Review for Responsiveness and Privilege in E-Discovery.
ACM Trans. Inf. Syst., 2019

Funnelling: A New Ensemble Method for Heterogeneous Transfer Learning and Its Application to Cross-Lingual Text Classification.
ACM Trans. Inf. Syst., 2019

Building Automated Survey Coders via Interactive Machine Learning.
CoRR, 2019

Funnelling: A New Ensemble Method for Heterogeneous Transfer Learning and its Application to Polylingual Text Classification.
CoRR, 2019

Learning to Quantify: Estimating Class Prevalence via Supervised Learning.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Evaluating Variable-Length Multiple-Option Lists in Chatbots and Mobile Search.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

The Epistle to Cangrande Through the Lens of Computational Authorship Verification.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2019, 2019

Tutorial: Supervised Learning for Prevalence Estimation.
Proceedings of the Flexible Query Answering Systems - 13th International Conference, 2019

2018
Sentiment Quantification of User-Generated Content.
Proceedings of the Encyclopedia of Social Network Analysis and Mining, 2nd Edition, 2018

Multimedia Information Retrieval Model.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Optimizing non-decomposable measures with deep networks.
Mach. Learn., 2018

The Impacts of Low-Quality Training Data on Information Extraction from Clinical Reports.
ERCIM News, 2018

Revisiting Distributional Correspondence Indexing: A Python Reimplementation and New Experiments.
CoRR, 2018

Distributional Correspondence Indexing for Cross-Lingual and Cross-Domain Sentiment Classification (Extended Abstract).
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Lightweight Random Indexing for Polylingual Text Classification (Extended Abstract).
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

A Recurrent Neural Network for Sentiment Quantification.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

How Data Mining and Machine Learning Evolved from Relational Data Base to Data Science.
Proceedings of the A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years., 2018

2017
On the Effects of Low-Quality Training Data on Information Extraction from Clinical Reports.
ACM J. Data Inf. Qual., 2017

Distributional Correspondence Indexing for Cross-Lingual and Cross-Domain Sentiment Classification.
ERCIM News, 2017

Lightweight Random Indexing for Polylingual Text Classification.
ERCIM News, 2017

QT2S: A System for Monitoring Road Traffic Via Fine Grounding of Tweets.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

2016
From classification to quantification in tweet sentiment analysis.
Soc. Netw. Anal. Min., 2016

Stochastic Optimization Techniques for Quantification Performance Measures.
CoRR, 2016

The Challenge of Sentiment Quantification.
Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, 2016

Distributional Random Oversampling for Imbalanced Text Classification.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Ordinal Text Quantification.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

SemEval-2016 Task 4: Sentiment Analysis in Twitter.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

QCRI at SemEval-2016 Task 4: Probabilistic Methods for Binary and Ordinal Quantification.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Online Optimization Methods for the Quantification Problem.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Transductive Distributional Correspondence Indexing for Cross-Domain Topic Classification.
Proceedings of the 7th Italian Information Retrieval Workshop, 2016

2015
Optimizing Text Quantifiers for Multivariate Loss Functions.
ACM Trans. Knowl. Discov. Data, 2015

Utility-Theoretic Ranking for Semiautomated Text Classification.
ACM Trans. Knowl. Discov. Data, 2015

Bridging social media via distant supervision.
Soc. Netw. Anal. Min., 2015

Classifying websites by industry sector: a study in feature design.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Multi-store metadata-based supervised mobile app classification.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Distant Supervision for Tweet Classification Using YouTube Labels.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

An Axiomatically Derived Measure for the Evaluation of Classification Algorithms.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

Quantification in social networks.
Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, 2015

Semi-Automated Text Classification for Sensitivity Identification.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Tweet Sentiment: From Classification to Quantification.
Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2015

2014
Feature Selection for Ordinal Text Classification.
Neural Comput., 2014

Text Quantification.
Proceedings of the Advances in Information Retrieval, 2014

Hierarchical Multi-label Conditional Random Fields for Aspect-Oriented Opinion Mining.
Proceedings of the Advances in Information Retrieval, 2014

Explicit Loss Minimization in Quantification Applications (Preliminary Draft).
Proceedings of the 8th International Workshop on Information Filtering and Retrieval co-located with XIII AI*IA Symposium on Artificial Intelligence (AI*IA 2014), 2014

2013
Improving Text Classification Accuracy by Training Label Cleaning.
ACM Trans. Inf. Syst., 2013

StarTrack: The Next Generation (of Product Review Management Tools).
New Gener. Comput., 2013

Editorial.
J. Discrete Algorithms, 2013

An enhanced CRFs-based system for information extraction from radiology reports.
J. Biomed. Informatics, 2013

Endorsements and rebuttals in blog distillation.
Inf. Sci., 2013

Using micro-documents for feature selection: The case of ordinal text classification.
Expert Syst. Appl., 2013

Variable-constraint classification and quantification of radiology reports under the ACR Index.
Expert Syst. Appl., 2013

Utility-Theoretic Ranking for Semi-Automated Text Classification.
ERCIM News, 2013

Quantification Trees.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

2012
A utility-theoretic ranking method for semi-automated text classification.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Blog Distillation via Sentiment-Sensitive Link Analysis.
Proceedings of the Natural Language Processing and Information Systems, 2012

Metadata Enrichment Services for the Europeana Digital Library.
Proceedings of the Theory and Practice of Digital Libraries, 2012

2011
Publishing survey articles on information retrieval topics.
SIGIR Forum, 2011

ISTI@TREC Microblog Track 2011: Exploring the Use of Hashtag Segmentation and Text Quality Ranking.
Proceedings of The Twentieth Text REtrieval Conference, 2011

2010
Guest Editors' introduction to the focussed issue on the 14th European Conference on Digital Libraries (ECDL 2010).
Int. J. Digit. Libr., 2010

Selecting negative examples for hierarchical text classification: An experimental comparison.
J. Assoc. Inf. Sci. Technol., 2010

Sentiment Quantification.
IEEE Intell. Syst., 2010

Extracting Information from Free-text Mammography Reports.
ERCIM News, 2010

ISTI@SemEval-2 Task 8: Boosting-Based Multiway Relation Classification.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Feature selection for ordinal regression.
Proceedings of the 2010 ACM Symposium on Applied Computing (SAC), 2010

SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Sentence-Based Active Learning Strategies for Information Extraction.
Proceedings of the IIR 2010, 2010

Selecting Features for Ordinal Text Classification.
Proceedings of the IIR 2010, 2010

Evaluating Information Extraction.
Proceedings of the Multilingual and Multimodal Information Access Evaluation, 2010

2009
Multimedia Information Retrieval Model.
Proceedings of the Encyclopedia of Database Systems, 2009

Adaptive Committees of Feature-Specific Classifiers for Image Classification.
ERCIM News, 2009

Multi-Faceted Rating of Product Reviews.
ERCIM News, 2009

Preferential Text Classification: Learning Algorithms and Evaluation Measures.
ERCIM News, 2009

Enhancing Opinion Extraction by Automatically Annotated Lexical Resources - (Extended Version).
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

Evaluation Measures for Ordinal Regression.
Proceedings of the Ninth International Conference on Intelligent Systems Design and Applications, 2009

Training Data Cleaning for Text Classification.
Proceedings of the Advances in Information Retrieval Theory, 2009

Encoding Ordinal Features into Binary Features for Text Classification.
Proceedings of the Advances in Information Retrieval, 2009

Active Learning Strategies for Multi-Label Text Classification.
Proceedings of the Advances in Information Retrieval, 2009

Multi-facet Rating of Product Reviews.
Proceedings of the Advances in Information Retrieval, 2009

2008
Boosting multi-label hierarchical text categorization.
Inf. Retr., 2008

Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007
Cluster Generation and Labeling for Web Snippets: A Fast, Accurate Hierarchical Solution.
Internet Math., 2007

Automatically Determining Attitude Type and Force for Sentiment Analysis.
Proceedings of the Human Language Technology. Challenges of the Information Society, 2007

Preference Learning for Category-Ranking based Interactive Text Categorization.
Proceedings of the International Joint Conference on Neural Networks, 2007

PageRanking WordNet Synsets: An Application to Opinion Mining.
Proceedings of the ACL 2007, 2007

2006
Automatic expansion of domain-specific lexicons by term categorization.
ACM Trans. Speech Lang. Process., 2006

Cluster Generation and Cluster Labelling for Web Snippets: A Fast and Accurate Hierarchical Solution.
Proceedings of the String Processing and Information Retrieval, 2006

TreeBoost.MH: A Boosting Algorithm for Multi-label Hierarchical Text Categorization.
Proceedings of the String Processing and Information Retrieval, 2006

MP-Boost: A Multiple-Pivot Boosting Algorithm and Its Application to Text Categorization.
Proceedings of the String Processing and Information Retrieval, 2006

A scalable algorithm for high-quality clustering of web snippets.
Proceedings of the 2006 ACM Symposium on Applied Computing (SAC), 2006

SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Determining Term Subjectivity and Term Orientation for Opinion Mining.
Proceedings of the EACL 2006, 2006

2005
An analysis of the relative hardness of Reuters-21578 subsets.
J. Assoc. Inf. Sci. Technol., 2005

Determining the semantic orientation of terms through gloss classification.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Text Categorization.
Proceedings of the Encyclopedia of Database Technologies and Applications, 2005

2004
Introduction: Special Issue on the 25th European Conference on Information Retrieval Research.
Inf. Retr., 2004

An Experimental Comparison of Term Representation for Term Management Applications.
Proceedings of the Twelfth Italian Symposium on Advanced Database Systems, 2004

An Analysis of the Relative Difficulty of Reuters-21578 Subsets.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Distributional term representations: an experimental comparison.
Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, 2004

2003
Report on the 25th European conference on information retrieval research (ECIR-03).
SIGIR Forum, 2003

Automating survey coding by multiclass text categorization techniques.
J. Assoc. Inf. Sci. Technol., 2003

Multiclass Text Categorization for Automated Survey Coding.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

Supervised Term Weighting for Automated Text Categorization.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

Expanding Domain-Specific Lexicons by Term Categorization.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

Discretizing Continuous Attributes in AdaBoost for Text Categorization.
Proceedings of the Advances in Information Retrieval, 2003

2002
Report on the workshop on Operational Text Classification Systems (OTC-02).
SIGIR Forum, 2002

Guest Editors' Introduction to the Special Issue on Automated Text Categorization.
J. Intell. Inf. Syst., 2002

Machine learning in automated text categorization.
ACM Comput. Surv., 2002

Building thematic lexical resources by term categorization.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Mapping an Automated Survey Coding Task into a Probabilistic Text Categorization Framework.
Proceedings of the Advances in Natural Language Processing, 2002

2001
Report on the Workshop on Operational Text Classification systems (OTC-01).
SIGIR Forum, 2001

A model of multimedia information retrieval.
J. ACM, 2001

2000
Experiments on the Use of Feature Selection and Negative Evidence in Automated Text Categorization.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2000

An Improved Boosting Algorithm and its Application to Text Categorization.
Proceedings of the 2000 ACM CIKM International Conference on Information and Knowledge Management, 2000

1999
Towards a Logical Reconstruction of Information Retrieval Theory.
Cybern. Syst., 1999

Total Knowledge and Partial Knowledge in Logical Models of Information Retrieval.
Proceedings of the Foundations of Intelligent Systems, 11th International Symposium, 1999

A System for the Fast Prototyping of Multidimensional Image Retrieval.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

1998
Trends in ... a Critical Review: On the Role of Logic in Information Retrieval.
Inf. Process. Manag., 1998

Information Retrieval, Imaging and Probabilistic Logic.
Comput. Artif. Intell., 1998

1997
The Terminological Image Retrieval Model.
Proceedings of the Image Analysis and Processing, 9th International Conference, 1997

Modelling the Retrieval of Structured Documents Containing Texts and Images.
Proceedings of the Research and Advanced Technology for Digital Libraries. First European Conference, 1997

Conceptual Modeling in Multimedia Information Seeking.
Proceedings of the Conceptual Modeling, 1997

1995
Default Reasoning in a Terminological Logic.
Comput. Artif. Intell., 1995

A Note on Logic and Information Retrieval.
Proceedings of the Final WorkShop on Multimedia Information Retrieval (MIRO'95), 1995

1994
A Probabilistic Terminological Logic for Modelling Information Retrieval.
Proceedings of the 17th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Dublin, 1994

1993
A Model of Information Retrieval Based on a Terminological Logic.
Proceedings of the 16th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Pittsburgh, PA, USA, June 27, 1993

1991
A Computationally Tractable Terminological Logic.
Proceedings of the Third Scandinavian Conference on Artificial Intelligence, 1991

1990
A Proof-Theoretic Account of Model-Preference Default Reasoning.
Proceedings of the Artificial Intelligence IV: Methodology, Systems, Applications, 1990


  Loading...