Marc Boullé

According to our database1, Marc Boullé authored at least 114 papers between 2002 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Fast and fully-automated histograms for large-scale data sets.
Comput. Stat. Data Anal., April, 2023

An Efficient Shapley Value Computation for the Naive Bayes Classifier.
CoRR, 2023

Two-level histograms for dealing with outliers and heavy tail distributions.
CoRR, 2023

Une approche bayésienne non paramétrique de sélection de variables pour la modélisation de l'uplift.
Proceedings of the Extraction et Gestion des Connaissances, 2023

Comparaison des valeurs de Shapley et des valeurs du poids de l'évidence dans le cas du classifieur naïf de Bayes.
Proceedings of the Extraction et Gestion des Connaissances, 2023

2022
A Non-parametric Bayesian Approach for Uplift Discretization and Feature Selection.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

2021
Interpretable Feature Construction for Time Series Extrinsic Regression.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

2020
Sélections simultanées de variables et de représentations pour la classification de séries temporelles.
Proceedings of the Extraction et Gestion des Connaissances, 2020

Multivariate Time Series Classification: A Relational Way.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2020

2019
A scalable robust and automatic propositionalization approach for Bayesian classification of large mixed numerical and categorical data.
Mach. Learn., 2019

FEARS: a Feature and Representation Selection approach for Time Series Classification.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

Analysis of the AutoML Challenge Series 2015-2018.
Proceedings of the Automated Machine Learning - Methods, Systems, Challenges, 2019

2018
Hierarchical two-part MDL code for multinomial distributions.
Int. J. Approx. Reason., 2018

Discovering patterns in time-varying graphs: a triclustering approach.
Adv. Data Anal. Classif., 2018

Un modèle Bayésien de co-clustering de données mixtes.
Proceedings of the Extraction et Gestion des Connaissances, 2018

Co-clustering Based Exploratory Analysis of Mixed-Type Data Tables.
Proceedings of the Advances in Knowledge Discovery and Management, 2018

Model Based Co-clustering of Mixed Numerical and Binary Data.
Proceedings of the Advances in Knowledge Discovery and Management, 2018

A two level co-clustering algorithm for very large data sets.
Proceedings of the Extraction et Gestion des Connaissances, 2018

2017
A user parameter-free approach for mining robust sequential classification rules.
Knowl. Inf. Syst., 2017

Co-clustering de données mixtes à base des modèles de mélange.
Proceedings of the 17ème Journées Francophones Extraction et Gestion des Connaissances, 2017

Application du coclustering à l'analyse exploratoire d'une table de données.
Proceedings of the 17ème Journées Francophones Extraction et Gestion des Connaissances, 2017

Sélection et transformation de variables pour la classification Multi-Label par une approche MDL.
Proceedings of the 17ème Journées Francophones Extraction et Gestion des Connaissances, 2017

MiSeRe-Hadoop: A Large-Scale Robust Sequential Classification Rules Mining Framework.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2017

2016
Revisiting enumerative two-part crude MDL for Bernoulli and multinomial distributions (Extended version).
CoRR, 2016

Predicting Dangerous Seismic Events in Coal Mines under Distribution Drift.
Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, 2016

Analyse exploratoire par k-Coclustering avec Khiops CoViz.
Proceedings of the 16ème Journées Francophones Extraction et Gestion des Connaissances, 2016

Khiops: outil d'apprentissage supervisé automatique pour la fouille de grandes bases de données multi-tables.
Proceedings of the 16ème Journées Francophones Extraction et Gestion des Connaissances, 2016

2015
Cats & Co: Categorical Time Series Coclustering.
CoRR, 2015

Universal Approximation of Edge Density in Large Graphs.
CoRR, 2015

Prediction of Methane Outbreak in Coal Mines from Historical Sensor Data under Distribution Drift.
Proceedings of the Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, 2015

Country-Scale Exploratory Analysis of Call Detail Records Through the Lens of Data Grid Models.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Symbolic Representation of Time Series: A Hierarchical Coclustering Formalization.
Proceedings of the 1st International Workshop on Advanced Analytics and Learning on Temporal Data, 2015

Concept drift detection using supervised bivariate grids.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

A Parameter-Free Approach for Mining Robust Sequential Classification Rules.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Tagging fireworkers activities from body sensors under distribution drift.
Proceedings of the 2015 Federated Conference on Computer Science and Information Systems, 2015

Online Learning of a Weighted Selective Naive Bayes Classifier with Non-convex Optimization.
Proceedings of the Advances in Knowledge Discovery and Management, 2015

TESS: Temporal event sequence summarization.
Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, 2015

2014
Khiops CoViz: A Tool for Visual Exploratory Analysis of k-Coclustering Results.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Towards Automatic Feature Construction for Supervised Classification.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Parsimonious Naive Bayes.
Proceedings of the 2014 Federated Conference on Computer Science and Information Systems, 2014

Apprentissage incrémental anytime d'un classifieur Bayésien naïf pondéré.
Proceedings of the 14èmes Journées Francophones Extraction et Gestion des Connaissances, 2014

Clustering de séquences d'évènements temporels.
Proceedings of the 14èmes Journées Francophones Extraction et Gestion des Connaissances, 2014

2013
Feature Extraction over Multiple Representations for Time Series Classification.
Proceedings of the New Frontiers in Mining Complex Patterns, 2013

SAXO: An optimized data-driven symbolic representation of time series.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Grille bivariée pour la détection de changement dans un flux étiqueté.
Proceedings of the Extraction et gestion des connaissances (EGC'2013), Actes, 29 janvier, 2013

Co-Clustering Network-Constrained Trajectory Data.
Proceedings of the Advances in Knowledge Discovery and Management, 2013

Classifications croisées de données de trajectoires contraintes par un réseau routier.
Proceedings of the Extraction et gestion des connaissances (EGC'2013), Actes, 29 janvier, 2013

Un Critère d'Évaluation pour la Construction de Variables à base d'Itemsets pour l'Apprentissage Supervisé Multi-Tables.
Proceedings of the Extraction et gestion des connaissances (EGC'2013), Actes, 29 janvier, 2013

A Study of the Spatio-Temporal Correlations in Mobile Calls Networks.
Proceedings of the Advances in Knowledge Discovery and Management, 2013

Étude des corrélations spatio-temporelles des appels mobiles en France.
Proceedings of the Extraction et gestion des connaissances (EGC'2013), Actes, 29 janvier, 2013

Construction de descripteurs à partir du coclustering pour la classification supervisée de séries temporelles.
Proceedings of the Extraction et gestion des connaissances (EGC'2013), Actes, 29 janvier, 2013

Vers une Automatisation de la Construction de Variables pour la Classification Supervisée.
Proceedings of the Extraction et gestion des connaissances (EGC'2013), Actes, 29 janvier, 2013

2012
Functional data clustering via piecewise constant nonparametric density estimation.
Pattern Recognit., 2012

A Bayesian Approach for Classification Rule Mining in Quantitative Databases.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Itemset-Based Variable Construction in Multi-relational Supervised Learning.
Proceedings of the Inductive Logic Programming - 22nd International Conference, 2012

A Triclustering Approach for Time Evolving Graphs.
Proceedings of the 12th IEEE International Conference on Data Mining Workshops, 2012

Supervised Pre-processing of Numerical Variables for Multi-Relational Data Mining.
Proceedings of the Advances in Knowledge Discovery and Management, 2012

Prétraitement Supervisé des Variables Numériques pour la Fouille de Données Multi-Tables.
Proceedings of the Extraction et gestion des connaissances (EGC'2012), Actes, janvier 31, 2012

Nonparametric Hierarchical Clustering of Functional Data.
Proceedings of the Advances in Knowledge Discovery and Management, 2012

Clustering hiérarchique non paramétrique de données fonctionnelles.
Proceedings of the Extraction et gestion des connaissances (EGC'2012), Actes, janvier 31, 2012

Sélection Bayésienne de Modèles avec Prior Dépendant des Données.
Proceedings of the Extraction et gestion des connaissances (EGC'2012), Actes, janvier 31, 2012

2011
A Bayesian Criterion for Evaluating the Robustness of Classification Rules in Binary Data Sets.
Proceedings of the Advances in Knowledge Discovery and Management, 2011

Informative Variables Selection for Multi-relational Supervised Learning.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2011

A supervised approach for change detection in data streams.
Proceedings of the 2011 International Joint Conference on Neural Networks, 2011

Sélection des variables informatives pour l'apprentissage supervisé multi-tables.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

Optimisation directe des poids de modèles dans un prédicteur Bayésien naïf moyenné.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

Un critère Bayésien pour évaluer la robustesse des règles de classification.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

Estimation de la densité d'arcs dans les graphes de grande taille : une alternative à la détection de clusters.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

Détection de changements de distribution dans un flux de données : une approche supervisée.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

2010
Bayesian instance selection for the nearest neighbor rule.
Mach. Learn., 2010

A non-parametric semi-supervised discretization method.
Knowl. Inf. Syst., 2010

The Orange Customer Analysis Platform.
Proceedings of the Advances in Data Mining. Applications and Theoretical Aspects, 2010

A method to build a representation using a classifier and its use in a K Nearest Neighbors-based deployment.
Proceedings of the International Joint Conference on Neural Networks, 2010

Exploration vs. exploitation in active learning : A Bayesian approach.
Proceedings of the International Joint Conference on Neural Networks, 2010

Simultaneous Partitioning of Input and Class Variables for Supervised Classification Problems with Many Classes.
Proceedings of the Advances in Knowledge Discovery and Management, 2010

Classification supervisée pour de grands nombres de valeurs à prédire.
Proceedings of the Extraction et gestion des connaissances (EGC'2010), 2010

Une nouvelle stratégie d'apprentissage Bayésienne.
Proceedings of the Extraction et gestion des connaissances (EGC'2010), 2010

Modelling Complex Data by Learning Which Variable to Construct.
Proceedings of the Data Warehousing and Knowledge Discovery, 12th International Conference, 2010

2009
Approches Statistique et Linguistique Pour la Classification de Textes d'Opinion Portant sur les Films.
Proceedings of the Fouille de Données d'Opinions, 2009

Design and analysis of the KDD cup 2009: fast scoring on a large orange customer database.
SIGKDD Explor., 2009

Analysis of the KDD Cup 2009: Fast Scoring on a Large Orange Customer Database.
Proceedings of KDD-Cup 2009 competition, Paris, France, June 28, 2009, 2009

A Parameter-Free Classification Method for Large Scale Learning.
J. Mach. Learn. Res., 2009

Optimum simultaneous discretization with data grid models in supervised classification: a Bayesian model selection approach.
Adv. Data Anal. Classif., 2009

Un critère d'évaluation Bayésienne pour la construction d'arbre de décision.
Proceedings of the Extraction et gestion des connaissances (EGC'2009), 2009

Une méthode de classification supervisée sans paramètre pour l'apprentissage sur les grandes bases de données.
Proceedings of the Extraction et gestion des connaissances (EGC'2009), 2009

A Bayes Evaluation Criterion for Decision Trees.
Proceedings of the Advances in Knowledge Discovery and Management [Best of EGC 2009, 2009

2008
A Non-parametric Semi-supervised Discretization Method.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Analyse exploratoire d'opinions cinématographiques : co-clustering de corpus textuels communautaires.
Proceedings of the Extraction et gestion des connaissances (EGC'2008), 2008

Vers l'exploitation de grandes masses de données.
Proceedings of the Extraction et gestion des connaissances (EGC'2008), 2008

Khiops : outil de préparation et modélisation des données pour la fouille des grandes bases de données.
Proceedings of the Extraction et gestion des connaissances (EGC'2008), 2008

2007
A New Probabilistic Approach in Rank Regression with Optimal Bayesian Partitioning.
J. Mach. Learn. Res., 2007

Compression-Based Averaging of Selective Naive Bayes Classifiers.
J. Mach. Learn. Res., 2007

Comparing State-of-the-Art Collaborative Filtering Systems.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2007

Report on Preliminary Experiments with Data Grid Models in the Agnostic Learning vs. Prior Knowledge Challenge.
Proceedings of the International Joint Conference on Neural Networks, 2007

Une approche non paramétrique Bayesienne pour l'estimation de densité conditionnelle sur les rangs.
Proceedings of the Extraction et gestion des connaissances (EGC'2007), 2007

Evaluation supervisée de métrique : application à la préparation de données séquentielles.
Proceedings of the Extraction et gestion des connaissances (EGC'2007), 2007

Une méthode optimale d'évaluation bivariée pour la classification supervisée.
Proceedings of the Extraction et gestion des connaissances (EGC'2007), 2007

2006
MODL: A Bayes optimal discretization method for continuous attributes.
Mach. Learn., 2006

Supervised evaluation of Voronoi partitions.
Intell. Data Anal., 2006

Supervised Selection of Dynamic Features, with an Application to Telecommunication Data Preparation.
Proceedings of the Advances in Data Mining, 2006

Regularization and Averaging of the Selective Naive Bayes classifier.
Proceedings of the International Joint Conference on Neural Networks, 2006

Sélection supervisée d'instances : une approche descriptive.
Proceedings of the Extraction et gestion des connaissances (EGC'2006), 2006

Optimal Bayesian 2D-Discretization for Variable Ranking in Regression.
Proceedings of the Discovery Science, 9th International Conference, 2006

An Enhanced Selective Naïve Bayes Method with Optimal Discretization.
Proceedings of the Feature Extraction - Foundations and Applications, 2006

2005
A Bayes Optimal Approach for Partitioning the Values of Categorical Attributes.
J. Mach. Learn. Res., 2005

Optimal bin number for equal frequency discretizations in supervized learning.
Intell. Data Anal., 2005

Supervised Evaluation of Dataset Partitions: Advantages and Practice.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2005

Multivariate Discretization by Recursive Supervised Bipartition of Graph.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2005

A Grouping Method for Categorical Attributes Having Very Large Number of Values.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2005

2004
Khiops: A Statistical Discretization Method of Continuous Attributes.
Mach. Learn., 2004

Utilisation des graphes de proximité dans le cadre de l'apprentissage basé sur les voisins.
Proceedings of the Extraction et gestion des connaissances (EGC'2004), 2004

A robust method for partitioning the values of categorical attributes.
Proceedings of the Extraction et gestion des connaissances (EGC'2004), 2004

2003
Khiops: A Discretization Method of Continuous Attributes with Guaranteed Resistance to Noise.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2003

2002
Khiops: une méthode statistique de discrétisation.
Proceedings of the Extraction et gestion des connaissances (EGC'2002), 2002


  Loading...