Journal Track

All accepted journal track papers will have an oral presentation at the conference.

140Large Scale Genealogical Information Extraction From Handwritten Quebec Parish RecordsSolène Tarride , Martin Maarand, Mélodie Boillet, James McGrath, Eugénie Capel, Hélène Vézina, Christopher Kermorvant
144An Accurate Approach to Real-time Machine Readable Zone Detection with Mobile DevicesAlexander Gayer, Daria Ershova, Vladimir V. Arlazarov
147Online Handwriting Trajectory Reconstruction from Kinematic Sensors using Temporal Convolutional NetworkWassim Swaileh, Florent Imbert, Yann Soullard, Romain Tavenard, Eric Anquetil
149Classification of Incunable Glyphs and Out-of-distribution Detection with Joint Energy-based ModelsFlorian Kordon, Nikolaus Weichselbaumer, Randall Herz, Stephen Mossman, Edward Potten, Mathias Seuret, Martin Mayr, Vincent Christlein
150Historical Document Image Analysis using Controlled Data for Pre-TrainingNajoua Rahal, Lars Vögtlin, Rolf Ingold
151End-to-end Optical Music Recognition for Pianoform Sheet MusicAntonio Ríos-Vila, David Rizo, José M. Iñesta, Jorge Calvo-Zaragoza
154Analyzing the Potential of Active Learning for Document Image ClassificationSaifullah Saifullah, Stefan Agne, Andreas Dengel, Sheraz Ahmed
158LSTM-Based Siamese Neural Network for Urdu News Story SegmentationMuhammad Nauman Ahmed Bhatti, Imran Siddiqi, Momina Moetesum
161Inv3D: A High-Resolution 3D Invoice Dataset for Template-Guided Single-Image Document UnwarpingFelix Hertlein, Alexander Naumann, Patrick Philipp
162Printed Ottoman Text Recognition Using Synthetic Data and Data AugmentationEsma F. Bilgin Tasdemir
163IAMonSense: Multi-level Handwriting Classification using Spatio-temporal InformationAhmad Mustafid, Junaid Younas, Paul Lukowicz, Sheraz Ahmed
182Line Extraction in Handwritten Documents via Instance SegmentationAdeela Islam, Tayaba Anjum, Nazar Khan

Oral Presentations

286Improving Information Extraction from Semi-Structured Documents Using Attention based Semi-variational Graph Auto-encoderDjedjiga Belhadj, Abdel Belaïd and Yolande Belaïd
424Search for Hyphenated Words in Probabilistic Indices: a Machine Learning ApproachJosé Andrés, Alejandro H. Toselli and Enrique Vidal
673Language Independent Neuro-Symbolic Semantic Parsing for Form UnderstandingBhanu Prakash Voutharoja, Lizhen Qu and Fatemeh Shiri
855Multi-Stage Fine-tuning Deep Learning Models Improves Automatic Assessment of the Rey-Osterrieth Complex Figure TestBenjamin Schuster, Florian Kordon, Martin Mayr, Mathias Seuret and Vincent Christlein
897SpaDen : Sparse and Dense Keypoint Estimation for Real-World Chart UnderstandingSaleem Ahmed, David Doermann, Srirangaraj Setlur, Venu Govindaraju and Pengyu Yan
910DocILE Benchmark for Document Information Localization and ExtractionŠtěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty and Dimosthenis Karatzas
1070A Study on Reproducibility and Replicability of Table Structure Recognition MethodsKehinde Ajayi, Muntabir Choudhury, Sarah Rajtmajer and Jian Wu
1200Key-value information extraction from full handwritten pagesSolène Tarride, Mélodie Boillet and Christopher Kermorvant
1221Towards End-to-End Semi-Supervised Table Detection with Deformable TransformerTahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Marcus Liwicki and Muhammad Zeshan Afzal
1250Styled Text-to-Text-Content-Image Generation with Latent Diffusion ModelsKonstantina Nikolaidou, George Retsinas, Vincent Christlein, Mathias Seuret, Giorgos Sfikas, Elisa Barney Smith, Hamam Mokayed and Marcus Liwicki
1261Multi-Teacher Knowledge Distillation for End-to-End Text Image Machine TranslationCong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou and Chengqing Zong
1462Decoupling Visual-Semantic Features Learning with Dual Masked Autoencoder for Self-Supervised Scene Text RecognitionZhi Qiao, Zhilong Ji, Ye Yuan and Jinfeng Bai
1580Handwritten Text Generation with Character-specific Encoding for Style ImitationJan Zdenek and Hideki Nakayama
1641Relative position embedding asymmetric siamese network for Offline handwritten mathematical expression recognitionChunyi Wang, Wei Hu, Xiaqing Rao, Runqi Luohu, Ning Bi and Tan Jun
1653Consistent Nested Named Entity Recognition in handwritten documents via Lattice RescoringDavid Villanova-Aparisi, Carlos David Martinez-Hinarejos, Verónica Romero and Moisés Pastor-Gadea
1710Optimized Table Tokenization for Table Structure RecognitionMaksym Lysak, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer and Peter Staar
1830MemeGraphs: Linking Memes to Knowledge GraphsVasiliki Kougia, Simon Fetzel, Thomas Kirchmair, Erion Çano, Sina Baharlou, Sahand Sharifzadeh and Benjamin Roth
2017EDSL: An Encoder-Decoder Architecture with Symbol-Level Features for Printed Mathematical Expression RecognitionYingnan Fu, Tingting Liu, Ming Gao and Aoying Zhou
2113TransDocAnalyser: A framework for semi-structured offline handwritten documents analysis with an application to legal domainSagar Chakraborty, Gaurav Harit and Saptarshi Ghosh
2326Scene Table Structure Recognition with Segmentation and Key Point CollaborationLi Zhuoming, Peng Fan, Xue Yang, Ni Hao and Jin Lianwen
2427ViSA: Visual and Semantic Alignment for Robust Scene Text RecognitionZhenru Pan, Zhilong Ji, Xiao Liu, Jinfeng Bai and Cheng-Lin Liu
2503SET, SORT! A Novel Sub-Stroke Level Transformer for Offline Handwriting to Online ConversionElmokhtar Mohamed Moussa, Thibault Lelore and Harold Mouchère
2969Robustness Evaluation of Transformer-based Form Field Extractors via Form AttacksLe Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong and Ran Xu
3025FCN-Boosted Historical Map Segmentation with Little Training DataJosef Baloun, Ladislav Lenc and Pavel Král
3117SCI-3000: A Dataset for Figure, Table and Caption Extraction from Scientific PDFsFilip Darmanović, Allan Hanbury and Markus Zlabinger
3176How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-TuningVittorio Pippi, Silvia Cascianelli, Christopher Kermorvant and Rita Cucchiara
3372An End-to-End Local Attention Based Model for Table RecognitionNam Tuan Ly and Atsuhiro Takasu
3827Diffusion-based document layout generationLiu He, Yijuan Lu, John Corring, Dinei Florencio and Cha Zhang
4206Character Queries: A Transformer-based Approach to On-Line Handwritten Character SegmentationMichael Jungo, Beat Wolf, Andrii Maksai, Claudiu Musat and Andreas Fischer
4261Semantic Graph Representation Learning for Handwritten Mathematical Expression RecognitionZhuang Liu, Ye Yuan, Zhilong Ji, Jinfeng Bai and Xiang Bai
4838TBM-GAN: Synthetic Document Generation with Degraded BackgroundArnab Poddar, Soumyadeep Dey, Pratik Jawanpuria, Jayanta Mukhopadhyay and Prabir Kumar Biswas
4995Information Extraction from Documents: Question Answering vs Token Classification in real-world setupsLaurent Lam, Pirashanth Ratnamogan, Joël Tang, William Vanhuffel and Fabien Caspani
5340A Holistic Approach for Aligned Music and Lyrics TranscriptionJuan C. Martinez-Sevilla, Antonio Rios-Vila, Francisco J. Castellanos and Jorge Calvo-Zaragoza
5871The Bullinger Writer Adaptation ChallengeAnna Scius-Bertrand and Andreas Fischer
5959HisDoc R-CNN: Robust Chinese Historical Document Text Line Detection with Dynamic Rotational Proposal Network and Iterative Attention HeadCheng Jian, Lianwen Jin, Lingyu Liang and Chongyu Liu
6936Zero-shot Generation of Training Data with Denoising Diffusion Probabilistic Model for Handwritten Chinese Character RecognitionDongnan Gui, Kai Chen, Haisong Ding and Qiang Huo
7163Multimodal Scoring Model for Handwritten Chinese EssayTonghua Su, Jifeng Wang, Hongming You and Zhongjie Wang
7277Structure Diagram Recognition in Financial AnnouncementsMeixuan Qiao, Jun Wang, Junfu Xiang, Qiyu Hou and Ruixuan Li
7655Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoringGeorge Retsinas, Giorgos Sfikas and Christophoros Nikou
7705Re-thinking Text Clustering for Images with TextShwet Kamal Mishra, Soham Joshi and Viswanath Gopalakrishnan
8247An Encoder-Decoder Method with Position-Aware for Printed Mathematical Expression RecognitionQuan Hong, Jun Long and Liu Yang
8444A multi-level synthesis strategy for online handwritten chemical equation recognitionHaoyang Shen, Jinrong Li, Jianmin Lin and Wei Wu
8654BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetMd. Istiak Hossain Shihab, Md. Rakibul Hasan, Mahfuzur Rahman Emon, Syed Mobassir Hossen, Md. Nazmuddoha Ansary, Intesur Ahmed, Fazle Rabbi Rakib, Shahriar Elahi Dhruvo, Souhardya Saha Dip, Akib Hasan Pavel, Marsia Haque Meghla, Md. Rezwanul Haque, Sayma Sultana Chowdhury, Farig Sadeque, Tahsin Reasat, Ahmed Imtiaz Humayun and Asif Shahriyar Sushmit
8783SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance SegmentationAyan Banerjee, Sanket Biswas, Josep Lladós and Umapada Pal
9104A Unified Document-level Chinese Discourse Parser on Different Granularity LevelsWeihao Liu, Feng Jiang, Yaxin Fan, Xiaomin Chu, Peifeng Li and Qiaoming Zhu
9420Context Aware Document Binarization and Its Application to Information Extraction from Structured DocumentsJán Koloda and Jue Wang
9527Context and Structure Understanding Oriented Chart Object DetectionPengyu Yan, Saleem Ahmed and David Doermann
9548SelfDocSeg: A self-supervised vision-based approach towards Document SegmentationSubhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya and Umapada Pal
9623Generalization of Fine Granular Extractions from ChartsShubham Singh Paliwal, Manasi Patwardhan and Lovekesh Vig
9627A hybrid model for multilingual OCRDavid Etter, Cameron Carpenter and Nolan King
9679Towards Writer Retrieval for Historical DatasetsMarco Peer, Florian Kleber and Robert Sablatnig
9690DTDT: Highly Accurate Dense Text Line Detection in Historical Documents via Dynamic TransformerHaiyang Li, Chongyu Liu, Jiapeng Wang, Mingxin Huang, Weiying Zhou and Lianwen Jin
9711DQ-DETR: Dynamic Queries Enhanced Detection Transformer for Arbitrary Shape Text DetectionChixiang Ma, Lei Sun, Jiawei Wang and Qiang Huo

Poster Presentations

125A Shallow Graph Neural Network with Innovative Node Updating for Online Handwritten Stroke ClassificationYan-Rong Wang, Da-Han Wang, Xiao-Long Yun, Yan-Ming Zhang, Fei Yin and Shunzhi Zhu
171Improving Handwritten OCR with Training Samples Generated by Glyph Conditional Denoising Diffusion Probabilistic ModelHaisong Ding, Bozhi Luan, Dongnan Gui, Kai Chen and Qiang Huo
200MIDV-Holo: a dataset for ID document hologram detection in a video streamLeisan Koliaskina, Ekaterina Emelianova, Daniil Tropin, Vladimir Popov, Konstantin Bulatov, Dmitry Nikolaev and Vladimir V. Arlazarov
491Aligning benchmark datasets for table structure recognitionBrandon Smock, Rohith Pesala and Robin Abraham
590Improved Learning for Online Handwritten Chinese Text Recognition with Convolutional Prototye NetworkYi Chen, Heng Zhang and Cheng-Lin Liu
832Vision Conformer: Incorporating Convolutions into Vision Transformer LayersBrian Kenji Iwana and Akihiro Kusuda
864Transductive Learning for Near-Duplicate Image Detection in Scanned Photo CollectionsFrancesc Net, Marc Folia, Pep Casals-Puig, and Lluis Gomez
1118Modeling Cross-layer Interaction for Chinese Calligraphy Style ClassificationZhigang Li, Li Liu, Taorong Qiu, Yue Lu and Ching Y. Suen
1120Evaluation of different tagging schemes for Named Entity Recognition in Handwritten DocumentsDavid Villanova-Aparisi, Carlos David Martinez-Hinarejos, Verónica Romero and Moisés Pastor-Gadea
1419Analyzing the Impact of Tokenization on Multilingual Epidemic Surveillance in Low-resource LanguagesStephen Mutuvi, Emanuela Boros, Antoine Doucet, Adam Jatowt, Gaël Lejeune and Moses Odeo
1429Text Reading Order in Uncontrolled Conditions by Sparse Graph SegmentationRenshen Wang, Yasuhisa Fujii and Alessandro Bissacco
1442Exploring Semantic Word Representations for Recognition-free NLP on Handwritten Document ImagesOliver Tüselmann and Gernot A. Fink
1633DAMGCN: Entity Linking in Visually Rich Documents with Dependency-Aware Multimodal Graph Convolutional NetworkYi-Ming Chen, Xiang-Ting Hou, Dong-Fang Lou, Zhi-Lin Liao and Cheng-Lin Liu
1827TDAE: Text Detection with Affinity Areas and Evolution StrategiesKefan Ma, Yuchen Luo, Zheng Huang, Kai Chen, Jie Guo and Weidong Qiu
1887OCR Language Models with Custom VocabulariesPeter Garst, Yasuhisa Fuji and Reeve Ingle
1934Incremental Learning and Ambiguity Rejection for Document ClassificationTri-Cong Pham, Mickaël Coustaty, Aurélie Joseph, Vincent Poulain D’Andecy, Muriel Visani and Nicolas Sidere
2013LineFormer: Line Chart Data Extraction using Instance SegmentationJay Lal, Aditya Mitkari, Mahesh Bhosale and David Doermann
2095A Unified Architecture for Urdu Printed and Handwritten Text RecognitionArooba Maqsood, Nauman Riaz, Adnan Ul-Hasan and Faisal Shafait
2100Analysing Textual Information from Financial Statements for Default PredictionChinesh Doshi, Himani Shrotriya, Rohit Bhiogade, Himanshu Sharad Bhatt and Abhishek Jha
2111Visual Information Extraction in the Wild: Practical Dataset and End-to-end SolutionJianfeng Kuang, Wei Hua, Dingkang Liang, Mingkun Yang, Deqiang Jiang, Bo Ren and Xiang Bai
2121Line-of-sight with Graph Attention Parser (LGAP) for Math FormulasAyush Kumar Shah and Richard Zanibbi
2194EEBO-Verse: Sifting for Poetry in Large Early Modern Corpora using Visual FeaturesDanlu Chen, Nan Jiang and Taylor Berg-Kirkpatrick
2309A Graphical Approach to Document Layout AnalysisJilin Wang, Michael Krumdick, Baojia Tong, Delphine Vendryes, Hamima Halim, Maxim Sokolov, Vadym Barda and Chris Tanner
2311Scene Text Recognition with Image-Text Matching-guided DictionaryJiajun Wei, Hongjian Zhan, Xiao Tu, Yue Lu and Umapada Pal
2566PyramidTabNet: Transformer based Table Recognition in Image-based DocumentsMuhammad Umer, Ahmed Mohsin, Adnan Ul-Hasan and Faisal Shafait
2627Gaussian Kernels based Network for Multiple License Plate Number Detection in Day-Night ImagesSoumi Das, Shivakumara Palaiahnakote, Umapada Pal and Raghavendra Ramachandra
2678Ensuring an error-free transcription on a full engineering tags dataset through unsupervised Post-OCR methodsMathieu Francois and Véronique Eglin
2745Sampling and Ranking for Digital Ink Generation on a tight computational budgetAndrii Maksai, Andrei Afonin, Aleksandr Timofeev and Claudiu Musat
2771Unraveling confidence: examining confidence scores as proxy for OCR qualityMirjam Cuper, Corine van Dongen and Tineke Koster
2850E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine TranslationCong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou and Chengqing Zong
3015RealCQA: Scientific Chart Question Answering as a Test-bed for First-Order LogicSaleem Ahmed, Bhavin Jawade, Shubham Pandey, Srirangaraj Setlur and Venu Govindaraju
3165An Iterative Graph Learning Convolution Network for Key Information Extraction Based on the Document Inductive BiasJiyao Deng, Yi Zhang, Xinpeng Zhang, Zhi Tang and Liangcai Gao
3232Open-Set Text Recognition via Shape-Awareness Visual ReconstructionChang Liu, Chun Yang and Xu-Cheng Yin
3409Accelerating Transformer-Based Scene Text Detection and Recognition via Token PruningSergi Garcia-Bordils, Dimosthenis Karatzas and Marçal Rusiñol
3475Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss FunctionJoseph Attieh, Abraham Woubie Zewoudie, Vladimir Vlassov, Adrian Flanagan and Tom Bäckström
3789Linguistic Knowledge within Handwritten Text Recognition Models: A Real-World Case StudySamuel Londner, Yoav Phillips, Hadar Miller, Nachum Dershowitz, Tsvi Kuflik and Moshe Lavee
3792FTDNet: Joint Semantic Learning for Scene Text Detection in Adverse Weather ConditionsJiakun Tian, Gang Zhou, Yangxin Liu, En Deng and Zhenhong Jia
3833DocParser: end-to-end OCR-free information extraction from Visually Rich DocumentsMohamed Dhouib, Ghassen Bettaieb and Aymen Shabou
3928Ambigram Generation by A Diffusion ModelTakahiro Shirakawa and Seiichi Uchida
4033Decoupled Learning for Long-Tailed Oracle Character RecognitionJing Li, Bin Dong, Qiu-Feng Wang, Lei Ding, Rui Zhang and Kaizhu Huang
4066Analyzing Font Style Usage and Contextual Factors in Real ImagesNaoya Yasukochi, Hideaki Hayashi, Daichi Haraguchi and Seiichi Uchida
4083Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document RecognitionDenis Coquenet, Clément Chatelain and Thierry Paquet
4131QuOTeS: Query-Oriented Technical SummarizationJuan Antonio Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Axel J. Soto, Flavia P. Zanoto and Evangelos Milios
4178MUGS: A Multiple Granularity Semi-Supervised Method for Text RecognitionQi Song, Qianyi Jiang, Wang Lei, Lingling Zhao and Rui Zhang
4204Text Enhancement:Scene Text Recognition in Hazy WeatherEn Deng, Gang Zhou, Jiakun Tian, Yangxin Liu and Zhenhong Jia
4287Shared-Operation Hypercomplex Networks for Handwritten Text RecognitionGiorgos Sfikas, George Retsinas, Panagiotis Dimitrakopoulos, Basilis Gatos and Christophoros Nikou
4289A Hybrid Approach to Document Layout Analysis for Heterogeneous Document ImagesZhuoyao Zhong, Jiawei Wang, Haiqing Sun, Kai Hu, Erhan Zhang, Lei Sun and Qiang Huo
4319ColDBin: Cold Diffusion for Document Image BinarizationSaifullah Saifullah, Stefan Agne, Andreas Dengel and Sheraz Ahmed
4485You Only Look for a Symbol Once: An Object Detector for Symbols and Regions in DocumentsWilliam Smith and Toby Pillatt
4548SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text RecognitionJunyi Zhang, Chang Liu and Chun Yang
4601DSS: Synthesizing long Digital Ink using Data augmentation, Style encoding and Split generation.Aleksandr Timofeev, Anastasiia Fadeeva, Andrii Maksai, Claudiu Musat and Andrei Afonin
4804A Benchmark of Nested Named Entity Recognition Approaches in Historical Structured DocumentsSolenn Tual, Nathalie Abadie, Bertrand Duménieu, Joseph Chazalon and Edwin Carlinet
5000Reading Between the Lanes: Text VideoQA on the RoadGeorge Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas and C.V. Jawahar
5003Line Graphics Digitization: A Step Towards Full AutomationOmar Moured, Jiaming Zhang, Alina Roitberg, Thorsten Schwarz and Rainer Stiefelhagen
5017TACTFUL: A framework for Targeted Active Learning for Document AnalysisVenkatapathy Subramanian, Sagar Poudel, Ganesh Ramakrishnan and Parag Chaudhuri
5117“Explain Thyself Bully”: Sentiment Aided Cyberbullying Detection with ExplanationKrishanu Maity, Prince Jha, Raghav Jain, Sriparna Saha and Pushpak Bhattacharyya
5155CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl DataMichał Turski, Tomasz Stanisławek, Karol Kaczmarek, Paweł Dyda and Filip Graliński
5441LayoutGCN: A Lightweight Architecture for Visually Rich Document UnderstandingDengliang Shi, Siliang Liu, Jintao Du and Huijia Zhu
5525TPFNet: A Novel Text In-painting Transformer for Text RemovalOnkar Susladkar, Dhruv Makwana, Gayatri Deshmukh, Sparsh Mittal, R Sai Chandra Teja and Rekha Singhal
5671Linear Object Detection in Document Images using Multiple Object TrackingPhilippe Bernet, Joseph Chazalon, Edwin Carlinet, Alexandre Bourquelot and Elodie Puybareau
5935ESTER-Pt: An Evaluation Suite for TExt Recognition in PortugueseMoniele Kunrath Santos, Guilherme Bazzo, Lucas Lima de Oliveira and Viviane P. Moreira
5939Topic Shift Detection in Chinese Dialogues: Corpus and BenchmarkJiangyi Lin, Yaxin Fan, Feng Jiang, Xiaomin Chu and Peifeng Li
5951End-to-end Multi-line License Plate Recognition with Cascaded PerceptionSong-Lu Chen, Qi Liu, Feng Chen and Xu-Cheng Yin
6036Precise Segmentation for Children Handwriting Analysis by Combining Multiple Deep Models with Online KnowledgeSimon Corbillé, Éric Anquetil and Élisa Fromont
6077Augraphy: A Data Augmentation Library for Document ImagesAlexander Groleau, Kok Wei Chee, Stefan Larson, Samay Maini and Jonathan Boarman
6359TRACE:Table Reconstruction Aligned to Corner and EdgesYoungmin Baek, Daehyun Nam, Jaeheung Surh, Seung Shin and Seonghyeon Kim
6471Fine-tuning Vision Encoder-Decoder Transformers for Handwriting Text Recognition on Historical DocumentsDaniel Parres Montoya and Roberto Paredes Palacios
6475Detecting Forged Receipts with Domain-specific Ontology-based Entities & RelationsBeatriz Martínez Tornés, Emanuela Boros, Petra Gomez-Krämer, Antoine Doucet and Jean-Marc Ogier
6512Evaluating Adversarial Robustness on Document Image ClassificationTimothée Fronteau, Arnaud Paran and Aymen Shabou
6516UTRNet: High-Resolution Urdu Text Recognition In Printed DocumentsAbdur Rahman, Chetan Arora and Arjun Ghosh
6754Contour Completion by Transformers and Its Application to Vector Font DataYusuke Nagata, Brian Kenji Iwana and Seiichi Uchida
6780CED: Catalog Extraction from DocumentsTong Zhu, Guoliang Zhang, Zechang Li, Zijian Yu, Junfei Ren, Mengsong Wu, Zhefeng Wang, Baoxing Huai, Pingfu Chao and Wenliang Chen
7047TextREC: a Dataset for Referring Expression Comprehension with Reading ComprehensionChenyang Gao, Biao Yang, Hao Wang, Mingkun Yang, Wenwen Yu, Yuliang Liu and Xiang Bai
7080Layout Analysis of Historical Document Images using a Light Fully Convolutional NetworksNajoua Rahal, Lars Vögtlin and Rolf Ingold
7131A Character-level Document Key Information Extraction Method with Contrastive LearningXinpeng Zhang, Liangcai Gao and Jiyao Deng
7310Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting RecognitionJan Kohút and Michal Hradiš
7319Combining OCR Models for Reading Early Modern BooksMathias Seuret, Janne van der Loop, Nikolaus Weichselbaumer, Martin Mayr, Janina Molnar, Tatjana Hass and Vincent Christlein
7403Incremental Teacher Model with Mixed Augmentations and Scheduled Pseudo-Label Loss for Handwritten Text RecognitionMasayuki Honda, Hung Tuan Nguyen, Cuong Tuan Nguyen, Cong Kha Nguyen, Ryosuke Odate, Takashi Kanemaru and Masaki Nakagawa
7663AFFGANwriting: A handwriting image generation method based on multi-feature fusionHeng Wang, Yiming Wang and Hongxi Wei
7707Towards Making Flowchart Images Machine InterpretableShreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav and Anand Mishra
7741SeamFormer: High Precision Text Line Segmentation for Handwritten DocumentsNiharika Vadlamudi, Rahul Krishna and Ravi Kiran Sarvadevabhatla
7774SIMARA: a database for key-value information extraction from full-page handwritten documentsSolène Tarride, Mélodie Boillet, Jean-François Moufflet and Christopher Kermorvant
7991On Web-based Visual Corpus Construction for Visual Document UnderstandingDongHyun Kim, Teakgyu Hong, Moonbin Yim, Yoonsik Kim and Geewook Kim
8156SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit SegmentationJiarong Huang, Dezhi Peng, Hongliang Li, Hao Ni and Lianwen Jin
8519DocImagen: Diffusion Model for Layout Conditioned Document Image GenerationNoman Tanveer, Adnan Ul-Hasan and Faisal Shafait
8595Detecting Text on Historical Maps by Selecting Best Candidates of Deep Neural Networks OutputGerasimos Matidis, Basilis Gatos, Anastasios Kesidis and Panagiotis Kaddas
8630Adversarial Attacks on Convolutional Siamese Signature Verification NetworksMaham Jahangir, Muhammad Imran Malik and Faisal Shafait
8652EnsExam: A Dataset for Handwritten Text Erasure on Examination PapersLiufeng Huang, Bangdong Chen, Chongyu Liu, Dezhi Peng, Weiying Zhou, Yaqiang Wu, Hui Li, Hao Ni and Lianwen Jin
8727A System for Processing and Recognition of Greek Byzantine and Post-Byzantine DocumentsPanagiotis Kaddas, Konstantinos Palaiologos, Basilis Gatos, Vassilis Katsouros and Katerina Christopoulou
8939Multimodal Rumour Detection: Catching news that never transpired!Raghvendra Kumar, Ritika Sinha, Sriparna Saha and Adam Jatowt
9048Towards Writing Style Adaptation in Handwriting RecognitionJan Kohút, Michal Hradiš and Martin Kišš
9308Formerge: Recover spanning cells in complex table structure using transformer networkNam Quan Nguyen, Anh Duy Le, Anh Khoa Lu, Xuan Toan Mai and Tuan Anh Tran
9362GriTS: Grid table similarity metric for table structure recognitionBrandon Smock, Rohith Pesala and Robin Abraham
9403Semantic triple-assisted learning for question answering passage re-rankingDinesh Nagumothu, Bahadorreza Ofoghi and Peter Eklund
9559I-WAS: a Data Augmentation Method with GPT-2 for Simile DetectionYongzhu Chang, Rongsheng Zhang and Jiashu Pu
9669Historical document image segmentation combining deep learning and Gabor featuresMaroua Mehri, Akrem Sellami and Salvatore Tabbone
9806Group, Contrast and Recognize: A Self-supervised Method for Chinese Character RecognitionXinzhe Jiang, Jun Du, Pengfei Hu, Mobai Xue, Jiefeng Ma, Jiajia Wu and Jianshu Zhang
9867Receipt Dataset for Document Forgery DetectionBeatriz Martínez Tornés, Théo Taburet, Emanuela Boros, Kais Rouis, Petra Gomez-Krämer, Nicolas Sidere, Antoine Doucet and Vincent Poulain d’Andecy
9897Content-Aware Urdu Handwriting GenerationZeeshan Memon, Adnan Ul-Hasan and Faisal Shafait
9904Weakly supervised information extraction from inscrutable handwritten document imagesSujoy Paul, Gagan Madan, Akankshya Mishra, Narayan Hegde, Pradeep Kumar and Gaurav Aggarwal
9981Information Redundancy and Biases in Public Document Information Extraction BenchmarksSeif Edinne Laatiri, Pirashanth Ratnamogan, Joël Tang, Laurent Lam, William Vanhuffel and Fabien Caspani

Competitions

inv-2ICDAR 2023 Competition on Video Text Reading for Dense and Small TextWeijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas and Xiang Bai
inv-4ICDAR 2023 Competition on Document UnderstanDing of Everything (DUDE)Jordy Van Landeghem, Rubèn Tito, Łukasz Borchmann, Michał Pietruszka, Dawid Jurkiewicz, Rafał Powalski, Paweł Józiak, Sanket Biswas, Mickaël Coustaty and Tomasz Stanisławek
inv-5ICDAR 2023 Competition on Indic Handwriting Text RecognitionAjoy Mondal and C. V. Jawahar
inv-6ICDAR 2023 Competition on Visual Question Answering on Business Document ImagesSachin Raja, Ajoy Mondal and C. V. Jawahar
inv-7ICDAR 2023 Competition on Robust Layout Segmentation in Corporate DocumentsChristoph Auer, Ahmed Nassar, Maksym Lysak, Michele Dolfi, Nikolaos Livathinos and Peter Staar
inv-8ICDAR 2023 Competition on Hierarchical Text Detection and RecognitionShangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii and Michalis Raptis
inv-9ICDAR 2023 Competition on Detection and Recognition of Greek Letters on PapyriMathias Seuret, Isabelle Marthot-Santaniello, Stephen A. White, Olga Serbaeva Saraogi, Selaudin Agolli, Guillaume Carrière, Dalia Rodriguez-Salas and Vincent Christlein
inv-10ICDAR 2023 Competition on Born Digital Video Text Question AnsweringZhibo Yang, Xiaoge Song, Sibo Song, Tong Lu, Xiang Bai, Cheng-Lin Liu, Fei Huang and Cong Yao
inv-11ICDAR 2023 Competition on Reading the Seal TitleWenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas and Xiang Bai
inv-12ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document ImagesWenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang and Xiang Bai
inv-13ICDAR 2023 CROHME: Competition on Recognition of Handwritten Mathematical ExpressionsYejing Xie, Harold Mouchère, Foteini Simistira Liwicki, Sumit Rakesh, Rajkumar Saini, Masaki Nakagawa, Cuong Tuan Nguyen and Thanh-Nghia Truong
inv-14ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical ExpressionsChenyang Gao, Yuliang Liu, Shiyu Yao, Jinfeng Bai, Xiang Bai, Lianwen Jin and Cheng-Lin Liu
inv-15ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and RecognitionGeorge Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas and C V Jawahar
inv-16ICDAR 2023 Competition on Detecting Tampered Text in ImagesDongliang Luo, Yu Zhou, Rui Yang, Yuliang Liu, Xianjin Liu, Jishen Zeng, Enming Zhang, Biao Yang, Ziming Huang, Lianwen Jin and Xiang Bai