Alert button
Picture for Josep Lladós

Josep Lladós

Alert button

GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding

Add code
Bookmark button
Alert button
May 06, 2024
Nil Biescas, Carlos Boned, Josep Lladós, Sanket Biswas

Viaarxiv icon

SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition

Add code
Bookmark button
Alert button
May 06, 2024
Adarsh Tiwari, Sanket Biswas, Josep Lladós

Viaarxiv icon

SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

Add code
Bookmark button
Alert button
Mar 30, 2024
Ayan Banerjee, Nityanand Mathur, Josep Lladós, Umapada Pal, Anjan Dutta

Viaarxiv icon

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

Add code
Bookmark button
Alert button
Feb 20, 2024
Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Viaarxiv icon

Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes

Add code
Bookmark button
Alert button
Oct 06, 2023
Alloy Das, Sanket Biswas, Umapada Pal, Josep Lladós

Figure 1 for Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Figure 2 for Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Figure 3 for Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Figure 4 for Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Viaarxiv icon

Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance

Add code
Bookmark button
Alert button
Oct 06, 2023
Alloy Das, Sanket Biswas, Ayan Banerjee, Saumik Bhattacharya, Josep Lladós, Umapada Pal

Figure 1 for Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Figure 2 for Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Figure 3 for Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Figure 4 for Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Viaarxiv icon

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language

Add code
Bookmark button
Alert button
Sep 11, 2023
Souhail Bakkali, Sanket Biswas, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol, Oriol Ramos Terrades, Josep Lladós

Figure 1 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Figure 2 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Figure 3 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Figure 4 for TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
Viaarxiv icon

SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

Add code
Bookmark button
Alert button
May 08, 2023
Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Figure 1 for SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Figure 2 for SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Figure 3 for SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Figure 4 for SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Viaarxiv icon

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

Add code
Bookmark button
Alert button
May 02, 2023
Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

Figure 1 for SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation
Figure 2 for SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation
Figure 3 for SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation
Figure 4 for SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation
Viaarxiv icon

Towards Stroke Patients' Upper-limb Automatic Motor Assessment Using Smartwatches

Add code
Bookmark button
Alert button
Dec 09, 2022
Asma Bensalah, Jialuo Chen, Alicia Fornés, Cristina Carmona-Duarte, Josep Lladós, Miguel A. Ferrer

Viaarxiv icon