Sanket Biswas

Orcid: 0000-0001-6648-8270

According to our database1, Sanket Biswas authored at least 46 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding.
CoRR, April, 2025

Tricho-Vision: The use of computer vision in trichotaxonomy for enhancing wildlife conservation of priority species.
Ecol. Informatics, 2025

FASTER: A Font-Agnostic Scene Text Editing and Rendering Framework.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ICDAR 2025 Handwritten Notes Understanding Challenge.
Proceedings of the Document Analysis and Recognition - ICDAR 2025, 2025

Doc2GraphFormer: Bridging Structured Graph Learning with Transformer Attention for Efficient Document Understanding.
Proceedings of the Document Analysis and Recognition - ICDAR 2025, 2025

Where Layout Meets Language: Lightweight Spatial Enhancement to Large Language Models for Document Understanding.
Proceedings of the Document Analysis and Recognition - ICDAR 2025, 2025

Doc2Graph-X: A Multilingual Graph-Based Framework for Form Understanding.
Proceedings of the Graph-Based Representations in Pattern Recognition, 2025

2024
A unified representation framework for the evaluation of Optical Music Recognition systems.
Int. J. Document Anal. Recognit., September, 2024

SemiDocSeg: harnessing semi-supervised learning for document layout analysis.
Int. J. Document Anal. Recognit., September, 2024

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks.
CoRR, 2024

Recurrent Few-Shot model for Document Verification.
CoRR, 2024

DocSynthv2: A Practical Autoregressive Modeling for Document Generation.
CoRR, 2024

GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding.
CoRR, 2024

Synthetic dataset of ID and Travel Document.
CoRR, 2024

Beyond Document Page Classification: Design, Datasets, and Challenges.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Recurrent Few-Shot Model for Document Verification.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

LayeredDoc: Domain Adaptive Document Restoration with a Layer Separation Approach.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 Workshops, 2024

DistilDoc: Knowledge Distillation for Visually-Rich Document Applications.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Towards Generative Class Prompt Learning for Fine-grained Visual Recognition.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
The Common Optical Music Recognition Evaluation Framework.
CoRR, 2023

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language.
CoRR, 2023

Segmentation-Free Alignment of Arbitrary Symbol Transcripts to Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

Can Pre-trained Language Models Help in Understanding Handwritten Symbols?
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

SelfDocSeg: A Self-supervised Vision-Based Approach Towards Document Segmentation.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Document UnderstanDing of Everything (DUDE).
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement.
CoRR, 2022

DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer.
CoRR, 2022

DocEnTr: An End-to-End Document Image Enhancement Transformer.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

A Few Shot Multi-representation Approach for N-Gram Spotting in Historical Manuscripts.
Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

Doc2Graph: A Task Agnostic Document Understanding Framework Based on Graph Neural Networks.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
Beyond document object detection: instance-level segmentation of complex layouts.
Int. J. Document Anal. Recognit., 2021

Graph-Based Deep Generative Modelling for Document Layout Generation.
Proceedings of the Document Analysis and Recognition, 2021

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

2018
Fault Area Detection in Leaf Diseases using k-means Clustering.
CoRR, 2018

A Statistical Approach to Adult Census Income Level Prediction.
CoRR, 2018

2017
Prediction of Diabetes Type-II Using a Two-Class Neural Network.
Proceedings of the Computational Intelligence, Communications, and Business Analytics, 2017


  Loading...