Hervé Déjean

Orcid: 0000-0002-9837-5358

According to our database1, Hervé Déjean authored at least 54 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE.
CoRR, 2024

SPLADE-v3: New baselines for SPLADE.
CoRR, 2024

Two-Step SPLADE: Simple, Efficient and Effective Approximation of SPLADE.
Proceedings of the Advances in Information Retrieval, 2024

2023
A Static Pruning Study on Sparse Neural Retrievers.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Benchmarking Middle-Trained Language Models for Neural Search.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Parameter-Efficient Sparse Retrievers and Rerankers Using Adapters.
Proceedings of the Advances in Information Retrieval, 2023

An Experimental Study on Pretraining Transformers from Scratch for IR.
Proceedings of the Advances in Information Retrieval, 2023

2022
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents.
CoRR, 2022

2019
Transforming scholarship in the archives through handwritten text recognition.
J. Documentation, 2019

Versatile Layout Understanding via Conjugate Graph.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR 2019 Competition on Table Detection and Recognition (cTDaR).
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Table Rows Segmentation.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

2018
Bench-Marking Information Extraction in Semi-Structured Historical Handwritten Records.
CoRR, 2018

Matching Table Structures of Historical Register Books using Association Graphs.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Comparing Machine Learning Approaches for Table Recognition in Historical Register Books.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

2015
Extracting structured data from unstructured document with incomplete resources.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

2014
Using ancestral layout models for document digitization.
Proceedings of the Digital Access to Textual Cultural Heritage 2014, 2014

2011
XML Processing in the Cloud: Large-Scale Digital Preservation in Small Institutions.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Using Page Breaks for Book Structuring.
Proceedings of the Focused Retrieval of Content and Structure, 2011

Unsupervised method to generate page templates.
Proceedings of the Document Recognition and Retrieval XVIII, 2011

2010
Xerox Launches Document Process Modelling Technology 'Xeproc©'.
ERCIM News, 2010

Xeproc(c): A Model-Based Approach towards Document Process Preservation.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2010

Numbered sequence detection in documents.
Proceedings of the Document Recognition and Retrieval XVII, 2010

Reflections on the INEX structure extraction competition.
Proceedings of the Ninth IAPR International Workshop on Document Analysis Systems, 2010

Preserving the Intent behind.
Proceedings of the Automation in Digital Preservation, 18.07. - 23.07.2010, 2010

Document: a useful level for facing noisy data.
Proceedings of the Fourth Workshop on Analytics for Noisy Unstructured Text Data, 2010

2009
On tables of contents and how to recognize them.
Int. J. Document Anal. Recognit., 2009

XRCE Participation to the 2009 Book Structure Task.
Proceedings of the Focused Retrieval and Evaluation, 2009

2008
XRCE Participation to the Book Structure Task.
Proceedings of the Advances in Focused Retrieval, 2008

Versatile page numbering analysis.
Proceedings of the Document Recognition and Retrieval XV, 2008

Combining Multiple Methods for Book Indexing.
Proceedings of the Eighth IAPR International Workshop on Document Analysis Systems, 2008

2007
Logical document conversion: combining functional and formal knowledge.
Proceedings of the 2007 ACM Symposium on Document Engineering, 2007

2006
A System for Converting PDF Documents into Structured XML Format.
Proceedings of the Document Analysis Systems VII, 7th International Workshop, 2006

2005
Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval.
Artif. Intell. Medicine, 2005

From Legacy Documents to XML: A Conversion Framework.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2005

Structuring documents according to their table of contents.
Proceedings of the 2005 ACM Symposium on Document Engineering, 2005

2004
A Geometric View on Bilingual Lexicon Extraction from Comparable Corpora.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

2003
Reducing Parameter Space for Word Alignment.
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 2003

Report on CLEF-2003 Experiments: Two Ways of Extracting Multilingual Resources from Corpora.
Proceedings of the Working Notes for CLEF 2003 Workshop co-located with the 7th European Conference on Digital Libraries (ECDL 2003), 2003

2002
Learning Rules and Their Exceptions.
J. Mach. Learn. Res., 2002

Combining Labelled and Unlabelled Data: A Case Study on Fisher Kernels and Transductive Inference for Biological Entity Recognition.
Proceedings of the 6th Conference on Natural Language Learning, 2002

An Approach Based on Multilingual Thesauri and Model Combination for Bilingual Lexicon Extraction.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

XRCE Participation in CLEF 2002.
Proceedings of the Working Notes for CLEF 2002 Workshop co-located with the 6th European Conference on Digital Libraries (ECDL 2002), 2002

Assessing Automatically Extracted Bilingual Lexicons for CLIR in Vertical Domains: XRCE Participation in the GIRT Track of CLEF 2002.
Proceedings of the Advances in Cross-Language Information Retrieval, 2002

2001
Introduction to the CoNLL-2001 shared task: clause identification.
Proceedings of the ACL 2001 Workshop on Computational Natural Language Learning, 2001

Learning Computational Grammars.
Proceedings of the ACL 2001 Workshop on Computational Natural Language Learning, 2001

Using ALLiS for clausing.
Proceedings of the ACL 2001 Workshop on Computational Natural Language Learning, 2001

2000
How To Evaluate and Compare Tagsets? A Proposal.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Learning Syntactic Structures with XML.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

ALLiS: a Symbolic Learning System for Natural Language Learning.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

Applying System Combination to Base Noun Phrase Identification.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

Theory Refinement and Natural Language Learning.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1998
Concepts et algorithmes pour la découverte des structures formelles des langues. (Concepts and Algorithms for Discovering Formal Structures of Languages).
PhD thesis, 1998

Morphemes as Necessary Concept for Structures Discovery from Untagged Corpora.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998


  Loading...