Mickaël Coustaty

Orcid: 0000-0002-0123-439X

Affiliations:
  • University of La Rochelle, L3i Laboratory


According to our database1, Mickaël Coustaty authored at least 146 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering.
CoRR, 2024

Deep Discriminative Feature Learning for Document Image Manipulation Detection.
Proceedings of the 19th International Joint Conference on Computer Vision, 2024

2023
Automatic classification of company's document stream: Comparison of two solutions.
Pattern Recognit. Lett., August, 2023

VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification.
Pattern Recognit., July, 2023

In-depth analysis of the impact of OCR errors on named entity recognition and linking.
Nat. Lang. Eng., March, 2023

Lazy-k: Decoding for Constrained Token Classification.
CoRR, 2023

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language.
CoRR, 2023

Estimating Post-OCR Denoising Complexity on Numerical Texts.
CoRR, 2023

CHIC: Corporate Document for Visual question Answering.
CoRR, 2023

An Enhanced Prototypical Network Architecture for Few-Shot Handwritten Urdu Character Recognition.
IEEE Access, 2023

Guilloche Detection for ID Authentication: A Dataset and Baselines.
Proceedings of the 25th IEEE International Workshop on Multimedia Signal Processing, 2023

Extracting Key-Value Pairs in Business Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

DocILE Benchmark for Document Information Localization and Extraction.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Incremental Learning and Ambiguity Rejection for Document Classification.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

KAP: Pre-training Transformers for Corporate Documents Understanding.
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

ICDAR 2023 Competition on Document UnderstanDing of Everything (DUDE).
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Subgraph-Induced Extraction Technique for Information (SETI) from Administrative Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

Document Understanding Dataset and Evaluation (DUDE).
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STRAS: A Semantic Textual-Cues Leveraged Rule-Based Approach for Article Separation in Historical Newspapers.
Proceedings of the Leveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration, 2023

Benchmarking NAS for Article Separation in Historical Newspapers.
Proceedings of the Leveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration, 2023

Text Line Detection in Historical Index Tables: Evaluations on a New French PArish REcord Survey Dataset (PARES).
Proceedings of the Leveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration, 2023

An Explorative Guide on How to Detect Forged Car Insurance Claims with Language Models.
Proceedings of the 15th International Joint Conference on Knowledge Discovery, 2023

Lazy-k Decoding: Constrained Decoding for Information Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Extended Overview of DocILE 2023: Document Information Localization and Extraction.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

Overview of DocILE 2023: Document Information Localization and Extraction.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2023

2022
Correction to: MELHISSA: a multilingual entity linking architecture for historical press articles.
Int. J. Digit. Libr., 2022

MELHISSA: a multilingual entity linking architecture for historical press articles.
Int. J. Digit. Libr., 2022

Survey of Post-OCR Processing Approaches.
ACM Comput. Surv., 2022

Weighting Sliding Tiles For Writer Identification in Handwritten Musical Scores.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2022

Document Forgery Detection in the Context of Double JPEG Compression.
Proceedings of the Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges, 2022

Adapting Transformers for Detecting Emergency Events on Social Media.
Proceedings of the 14th International Joint Conference on Knowledge Discovery, 2022

ReadOCR: A Novel Dataset and Readability Assessment of OCRed Texts.
Proceedings of the Document Analysis Systems - 15th IAPR International Workshop, 2022

QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document.
Proceedings of the Document Analysis Systems - 15th IAPR International Workshop, 2022

2021
Deep multimodal learning for cross-modal retrieval: One model for all tasks.
Pattern Recognit. Lett., 2021

Improving seller-customer communication process using word embeddings.
J. Ambient Intell. Humaniz. Comput., 2021

EAML: ensemble self-attention-based mutual learning network for document image classification.
Int. J. Document Anal. Recognit., 2021

A systematic study on the role of SentiWordNet in opinion mining.
Frontiers Comput. Sci., 2021

Urdu Handwritten Characters Data Visualization and Recognition Using Distributed Stochastic Neighborhood Embedding and Deep Network.
Complex., 2021

Graph Neural Networks Using Local Descriptions in Attributed Graphs: An Application to Symbol Recognition and Hand Written Character Recognition.
IEEE Access, 2021

Transformer-based Methods with #Entities for Detecting Emergency Events on Social Media.
Proceedings of the Thirtieth Text REtrieval Conference, 2021

Toward an Incremental Classification Process of Document Stream Using a Cascade of Systems.
Proceedings of the Document Analysis and Recognition, 2021

Multimodal Attention-Based Learning for Imbalanced Corporate Documents Classification.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Information Extraction from Invoices.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Robust Hashing for Character Authentication and Retrieval Using Deep Features and Iterative Quantization.
Proceedings of the Document Analysis and Recognition, 2021

Applying Segmented Images by Louvain Method into Content-Based Image Retrieval.
Proceedings of the Context-Aware Systems and Applications, 2021

Apprentissage multimodal basé sur des modèles d'attention pour la classification de documents dans un contexte déséquilibré.
Proceedings of the Extraction et Gestion des Connaissances, 2021

2020
Correction: OpinionML - Opinion Markup Language for Sentiment Representation. Symmetry 2019, 11, 545.
Symmetry, 2020

Scientometric analysis of social science and science disciplines in a developing nation: a case study of Pakistan in the last decade.
Scientometrics, 2020

An adaptive document recognition system for lettrines.
Int. J. Document Anal. Recognit., 2020

Urdu handwritten text recognition: a survey.
IET Image Process., 2020

Performance Evaluation of Deep Generative Models for Generating Hand-Written Character Images.
CoRR, 2020

Additive Angular Margin Loss in Deep Graph Neural Network Classifier for Learning Graph Edit Distance.
IEEE Access, 2020

Dataset for Temporal Analysis of English-French Cognates.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Neural Machine Translation with BERT for Post-OCR Error Detection and Correction.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

An Extended Evaluation of the Impact of Different Modules in ST-VQA Systems.
Proceedings of the Pattern Recognition and Artificial Intelligence, 2020

Multi-Attribute Learning With Highly Imbalanced Data.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Local Geometry Analysis For Image Tampering Detection.
Proceedings of the IEEE International Conference on Image Processing, 2020

Cross-Modal Deep Networks For Document Image Classification.
Proceedings of the IEEE International Conference on Image Processing, 2020

Entity Linking for Historical Documents: Challenges and Solutions.
Proceedings of the Digital Libraries at Times of Massive Societal Transition, 2020

Assessing and Minimizing the Impact of OCR Quality on Named Entity Recognition.
Proceedings of the Digital Libraries for Open Knowledge, 2020

Evaluation of Neural Network Classification Systems on Document Stream.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

Classification of Phonetic Characters by Space-Filling Curves.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

Background Removal of French University Diplomas.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

Visual and Textual Deep Feature Fusion for Document Image Classification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
A Framework for Security and Privacy for the Internet of Things (SPIRIT).
Proceedings of the Security and Privacy in the Internet of Things: Challenges and Solutions, 2019

OpinionML - Opinion Markup Language for Sentiment Representation.
Symmetry, 2019

Visualization of High-Dimensional Data by Pairwise Fusion Matrices Using t-SNE.
Symmetry, 2019

Systematic review and usability evaluation of writing mobile apps for children.
New Rev. Hypermedia Multim., 2019

A comparison of local features for camera-based document image retrieval and spotting.
Int. J. Document Anal. Recognit., 2019

A Combination of Histogram of Oriented Gradients and Color Features to Cooperate with Louvain Method based Image Segmentation.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

Security and PrIvacy foR the Internet of Things: an overview of the project.
Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019

Deep Statistical Analysis of OCR Errors for Effective Post-OCR Processing.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019

An Analysis of the Performance of Named Entity Recognition over OCRed Documents.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019

ICDAR 2019 Competition on Post-OCR Text Correction.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Post-OCR Error Detection by Generating Plausible Candidates.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Learning Free Document Image Binarization Based on Fast Fuzzy C-Means Clustering.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

A Blind Document Image Watermarking Approach Based on Discrete Wavelet Transform and QR Code Embedding.
Proceedings of the Second International Workshop on Computational Document Forensics, 2019

Semantic Text Recognition via Visual Question Answering.
Proceedings of the Second International Workshop on Machine Learning, 2019

An Interactive Recommendation System for 2nd Language Vocabulary Learning - Vocabulometer 2.0.
Proceedings of the 2nd International Workshop on Human-Document Interaction, 2019

An Enhanced Louvain Based Image Segmentation Approach Using Color Properties and Histogram of Oriented Gradients.
Proceedings of the Computer Vision, Imaging and Computer Graphics Theory and Applications, 2019

Alphanumeric Glyphs Transformation Based on Shape Morphing: Context of Text.
Proceedings of the Eighth International Conference on Emerging Security Technologies, 2019

TLR at BSNLP2019: A Multilingual Named Entity Recognition System.
Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing, 2019

2018
New spatial-organization-based scale and rotation invariant features for heterogeneous-content camera-based document image retrieval.
Pattern Recognit. Lett., 2018

SentiML ++: an extension of the SentiML sentiment annotation scheme.
New Rev. Hypermedia Multim., 2018

Augmented Documents for Research Contact Management.
Proceedings of the 4th IEEE International Forum on Research and Technology for Society and Industry, 2018

A randomized hierarchical trees indexing approach for camera-based information spotting.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Categorization of Document Image Tampering Techniques and How to Identify Them.
Proceedings of the Pattern Recognition and Information Forensics, 2018

Constrained and Parametric Dynamic Programming for Word Image Retrieval.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Adaptive Edit-Distance and Regression Approach for Post-OCR Text Correction.
Proceedings of the Maturity and Innovation in Digital Libraries, 2018

Feature Selection for Document Flow Segmentation.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

A New Image Segmentation Approach Based on the Louvain Algorithm.
Proceedings of the 2018 International Conference on Content-Based Multimedia Indexing, 2018

SSKSRIF: Scale and Rotation Invariant Features Based on Spatial Space of Keypoints for Camera-Based Information Spotting.
Proceedings of the 2018 International Conference on Content-Based Multimedia Indexing, 2018

An Efficient Agglomerative Algorithm Cooperating with Louvain Method for Implementing Image Segmentation.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2018

2017
Fuzzy generalized median graphs computation: Application to content-based document retrieval.
Pattern Recognit., 2017

TouchDoc: A Tool to Bridge the Gap between Physical and Digital Libraries.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

Impact of OCR Errors on the Use of Digital Libraries: Towards a Better Access to Information.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

Enhancing Table of Contents Extraction by System Aggregation.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Machine Learning vs Deterministic Rule-Based System for Document Stream Segmentation.
Proceedings of the First Workshop of Machine Learning, 2017

Local Binary Patterns for Document Forgery Detection.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Local Enlacement Histograms for Historical Drop Caps Style Recognition.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

ICDAR2017 Competition on Post-OCR Text Correction.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

SmartDoc 2017 Video Capture: Mobile Document Acquisition in Video Mode.
Proceedings of the 1st International Workshop on Open Services and Tools for Document Analysis, 2017

Extraction of Ancient Map Contents Using Trees of Connected Components.
Proceedings of the Graphics Recognition, Current Trends and Evolutions, 2017

A dataset for forgery detection and spotting in document images.
Proceedings of the Seventh International Conference on Emerging Security Technologies, 2017

2016
Camera-based document image spotting system for complex linguistic maps.
Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics, 2016

Polygon-shape-based Scale and Rotation Invariant Features for camera-based document image retrieval.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

ICFHR2016 Competition on the Analysis of Handwritten Text in Images of Balinese Palm Leaf Manuscripts.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

DataTourism: Designing an Architecture to Process Tourism Data.
Proceedings of the Information and Communication Technologies in Tourism 2016, 2016

Delaunay Triangulation-Based Features for Camera-Based Document Image Retrieval System.
Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

Reconnaissance et classification de lettrines à base des descripteurs de bas niveau et de représentation structurelle.
Proceedings of the CORIA 2016 - Conférence en Recherche d'Informations et Applications, 2016

Recherche par le contenu d'images de monnaies de collection.
Proceedings of the CORIA 2016 - Conférence en Recherche d'Informations et Applications, 2016

2015
Towards ontology-based retrieval of historical images.
Appl. Ontology, 2015

A bottom-up method using texture features and a graph-based representation for lettrine recognition and classification.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Camera-based document image retrieval system using local features - comparing SRIF with LLAH, SIFT, SURF and ORB.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

SRIF: Scale and Rotation Invariant Features for camera-based document image retrieval.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Graph matching versus bag of graph: a comparative study for lettrines recognition.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

ICDAR2015 competition on smartphone document capture and OCR (SmartDoc).
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

A System for Camera-Based Retrieval of Heterogeneous-Content Complex Linguistic Map.
Proceedings of the Graphic Recognition. Current Trends and Challenges, 2015

SentiML++: An Extension of the SentiML Sentiment Annotation Scheme.
Proceedings of the Semantic Web: ESWC 2015 Satellite Events - ESWC 2015 Satellite Events Portorož, Slovenia, May 31, 2015

Applying Semantic Web Technologies for Improving the Visibility of Tourism Data.
Proceedings of the Eighth Workshop on Exploiting Semantic Annotations in Information Retrieval, 2015

2014
Segmentation system and its evaluation for gray scale coin documents.
Proceedings of the 4th International Conference on Image Processing Theory, 2014

A multi-layer approach for camera-based complex map image retrieval and spotting system.
Proceedings of the 4th International Conference on Image Processing Theory, 2014

Multi-modal and Cross-Modal for Lecture Videos Retrieval.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Annotation sémantique de documents administratifs.
Proceedings of the 14èmes Journées Francophones Extraction et Gestion des Connaissances, 2014

MAD : une plateforme mobile pour l'annotation de document vers la classification.
Proceedings of the CORIA 2014, 2014

A multi-layer separation based system for camera-based complex map image retrieval.
Proceedings of the CORIA 2014, 2014

2013
Interactive Knowledge Learning for Ancient Images.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Visual Saliency and Terminology Extraction for Document Classification.
Proceedings of the Graphics Recognition. Current Trends and Challenges, 2013

Visual saliency and terminology extraction for document annotation.
Proceedings of the ACM Symposium on Document Engineering 2013, 2013

2012
Extraction of light and specific features for historical image indexing and matching.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2011
Contribution à l'analyse complexe de documents anciens, application aux lettrines. (Complex analysis of historical documents, application to lettrines).
PhD thesis, 2011

A New Adaptive Structural Signature for Symbol Recognition by Using a Galois Lattice as a Classifier.
IEEE Trans. Syst. Man Cybern. Part B, 2011

Towards historical document indexing: extraction of drop cap letters.
Int. J. Document Anal. Recognit., 2011

Bags of Strokes Based Approach for Classification and Indexing of Drop Caps.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Discrimination of Old Document Images Using Their Style.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Using Ontologies to Reduce the Semantic Gap between Historians and Image Processing Algorithms.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Ancient Documents Denoising and Decomposition Using Aujol and Chambolle Algorithm.
Proceedings of the Graphics Recognition. New Trends and Challenges, 2011

Historical document analysis: A review of French projects and open issues.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Reconnaissance de symboles à partir d'une signature structurelle flexible et d'un classifieur de type treillis de Galois.
Tech. Sci. Informatiques, 2010

Stroke feature extraction for lettrine indexing.
Proceedings of the 2nd International Conference on Image Processing Theory Tools and Applications, 2010

NAVIDOMASS: Structural-based Approaches Towards Handling Historical Documents.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Approche complexe de l'analyse de documents anciens.
Proceedings of the Extraction et gestion des connaissances (EGC'2010), 2010

Analyzing Old Documents Using a Complex Approach: Application to Lettrines Indexing.
Proceedings of the Advances in Knowledge Discovery and Management, 2010

2009
Drop Caps Decomposition for Indexing a New Letter Extraction Method.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

Segmenting and Indexing Old Documents Using a Letter Extraction.
Proceedings of the Graphics Recognition. Achievements, 2009

2007
On the Joint Use of a Structural Signature and a Galois Lattice Classifier for Symbol Recognition.
Proceedings of the Graphics Recognition. Recent Advances and New Opportunities, 2007


  Loading...