Remote Presentation Links for the Session and Poster Tracks can be found on the Program Overview.

The ICDAR 2023 main conference session presentation details can be found below.

Note: All times are Pacific Daylight Time (PDT).

Note: Links to individual paper DOIs will begin working once the publisher finalizes the proceedings.

Oral Session 1 – Graphics 1: Graphics Recognition

Chair: Richard Zanibbi
Monday, August 21, 2023 – 10:50-12:30 PDT

O1.15340A Holistic Approach for Aligned Music and Lyrics TranscriptionJuan C. Martinez-Sevilla, Antonio Rios-Vila, Francisco J. Castellanos and Jorge Calvo-Zaragoza
O1.2J151End-to-end Optical Music Recognition for Pianoform Sheet Music (IJDAR Track)Antonio Ríos-Vila, David Rizo, José M. Iñesta, Jorge Calvo-Zaragoza
O1.38444A multi-level synthesis strategy for online handwritten chemical equation recognitionHaoyang Shen, Jinrong Li, Jianmin Lin and Wei Wu
O1.49527Context and Structure Understanding Oriented Chart Object DetectionPengyu Yan, Saleem Ahmed and David Doermann
O1.53117SCI-3000: A Dataset for Figure, Table and Caption Extraction from Scientific PDFsFilip Darmanović, Allan Hanbury and Markus Zlabinger
Oral Session 2 – D-NLP 1: Document NLP

Chair: Rajiv Jain
Monday, August 21, 2023 – 10:50-12:30 PDT

O2.11653Consistent Nested Named Entity Recognition in handwritten documents via Lattice RescoringDavid Villanova-Aparisi, Carlos David Martinez-Hinarejos, Verónica Romero and Moisés Pastor-Gadea
O2.2424Search for Hyphenated Words in Probabilistic Indices: a Machine Learning ApproachJosé Andrés, Alejandro H. Toselli and Enrique Vidal
O2.39104A Unified Document-level Chinese Discourse Parser on Different Granularity LevelsWeihao Liu, Feng Jiang, Yaxin Fan, Xiaomin Chu, Peifeng Li and Qiaoming Zhu
O2.4J158LSTM-Based Siamese Neural Network for Urdu News Story Segmentation (IJDAR Track)Muhammad Nauman Ahmed Bhatti, Imran Siddiqi, Momina Moetesum
O2.5J140Large Scale Genealogical Information Extraction From Handwritten Quebec Parish Records (IJDAR Track)Solène Tarride , Martin Maarand, Mélodie Boillet, James McGrath, Eugénie Capel, Hélène Vézina, Christopher Kermorvant
Oral Session 3 – Graphics 2: Tables and Charts

Chair: Jean-Christophe Burie
Monday, August 21, 2023 – 16:00-18:00 PDT

O3.11070A Study on Reproducibility and Replicability of Table Structure Recognition MethodsKehinde Ajayi, Muntabir Choudhury, Sarah Rajtmajer and Jian Wu
O3.23372An End-to-End Local Attention Based Model for Table RecognitionNam Tuan Ly and Atsuhiro Takasu
O3.31710Optimized Table Tokenization for Table Structure RecognitionMaksym Lysak, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer and Peter Staar
O3.41221Towards End-to-End Semi-Supervised Table Detection with Deformable TransformerTahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Marcus Liwicki and Muhammad Zeshan Afzal
O3.5897SpaDen : Sparse and Dense Keypoint Estimation for Real-World Chart UnderstandingSaleem Ahmed, David Doermann, Srirangaraj Setlur, Venu Govindaraju and Pengyu Yan
O3.69623Generalization of Fine Granular Extractions from ChartsShubham Singh Paliwal, Manasi Patwardhan and Lovekesh Vig
Oral Session 4 – D-NLP 2: Information Extraction

Chair: Josep Llados
Monday, August 21, 2023 – 16:00-18:00 PDT

O4.1286Improving Information Extraction from Semi-Structured Documents Using Attention based Semi-variational Graph Auto-encoderDjedjiga Belhadj, Abdel Belaïd and Yolande Belaïd
O4.2673Language Independent Neuro-Symbolic Semantic Parsing for Form UnderstandingBhanu Prakash Voutharoja, Lizhen Qu and Fatemeh Shiri
O4.3910DocILE Benchmark for Document Information Localization and ExtractionŠtěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty and Dimosthenis Karatzas
O4.42969Robustness Evaluation of Transformer-based Form Field Extractors via Form AttacksLe Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong and Ran Xu
O4.51200Key-value information extraction from full handwritten pagesSolène Tarride, Mélodie Boillet and Christopher Kermorvant
O4.64995Information Extraction from Documents: Question Answering vs Token Classification in real-world setupsLaurent Lam, Pirashanth Ratnamogan, Joël Tang, William Vanhuffel and Fabien Caspani
Oral Session 5 – Applications 1: Medical, Legal, and Financial

Chair: Elisa Barney Smith
Tuesday, August 22, 2023 – 09:00-10:20 PDT

O5.1855Multi-Stage Fine-tuning Deep Learning Models Improves Automatic Assessment of the Rey-Osterrieth Complex Figure TestBenjamin Schuster, Florian Kordon, Martin Mayr, Mathias Seuret and Vincent Christlein
O5.27277Structure Diagram Recognition in Financial AnnouncementsMeixuan Qiao, Jun Wang, Junfu Xiang, Qiyu Hou and Ruixuan Li
O5.32113TransDocAnalyser: A framework for semi-structured offline handwritten documents analysis with an application to legal domainSagar Chakraborty, Gaurav Harit and Saptarshi Ghosh
O5.4J161Inv3D: A High-Resolution 3D Invoice Dataset for Template-Guided Single-Image Document Unwarping (IJDAR Track)Felix Hertlein, Alexander Naumann, Patrick Philipp
Oral Session 6 – Handwriting 1: Online Documents

Chair: Gernot Fink
Tuesday, August 22, 2023 – 09:00-10:20 PDT

O6.1J147Online Handwriting Trajectory Reconstruction from Kinematic Sensors using Temporal Convolutional Network (IJDAR Track)Wassim Swaileh, Florent Imbert, Yann Soullard, Romain Tavenard, Eric Anquetil
O6.2J163IAMonSense: Multi-level Handwriting Classification using Spatio-temporal Information (IJDAR Track)Ahmad Mustafid, Junaid Younas, Paul Lukowicz, Sheraz Ahmed
O6.32503SET, SORT! A Novel Sub-Stroke Level Transformer for Offline Handwriting to Online Conversion Elmokhtar Mohamed Moussa, Thibault Lelore and Harold Mouchère
O6.44206Character Queries: A Transformer-based Approach to On-Line Handwritten Character SegmentationMichael Jungo, Beat Wolf, Andrii Maksai, Claudiu Musat and Andreas Fischer
Oral Session 7 – DAR 1: Document Layout Analysis

Chair: Koichi Kise
Tuesday, August 22, 2023 – 10:50-12:30 PDT

O7.18783SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance SegmentationAyan Banerjee, Sanket Biswas, Josep Lladós and Umapada Pal
O7.28654BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetMd. Istiak Hossain Shihab, Md. Rakibul Hasan, Mahfuzur Rahman Emon, Syed Mobassir Hossen, Md. Nazmuddoha Ansary, Intesur Ahmed, Fazle Rabbi Rakib, Shahriar Elahi Dhruvo, Souhardya Saha Dip, Akib Hasan Pavel, Marsia Haque Meghla, Md. Rezwanul Haque, Sayma Sultana Chowdhury, Farig Sadeque, Tahsin Reasat, Ahmed Imtiaz Humayun and Asif Shahriyar Sushmit
O7.39548SelfDocSeg: A self-supervised vision-based approach towards Document SegmentationSubhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya and Umapada Pal
O7.4J182Line Extraction in Handwritten Documents via Instance Segmentation (IJDAR Track)Adeela Islam, Tayaba Anjum, Nazar Khan
O7.53827Diffusion-based document layout generationLiu He, Yijuan Lu, John Corring, Dinei Florencio and Cha Zhang
Oral Session 8 – Handwriting 2: Historical Documents

Chair: Rolf Ingold
Tuesday, August 22, 2023 – 10:50-12:30 PDT

O8.19690DTDT: Highly Accurate Dense Text Line Detection in Historical Documents via Dynamic TransformerHaiyang Li, Chongyu Liu, Jiapeng Wang, Mingxin Huang, Weiying Zhou and Lianwen Jin
O8.25871The Bullinger Writer Adaptation ChallengeAnna Scius-Bertrand and Andreas Fischer
O8.39679Towards Writer Retrieval for Historical DatasetsMarco Peer, Florian Kleber and Robert Sablatnig
O8.45959HisDoc R-CNN: Robust Chinese Historical Document Text Line Detection with Dynamic Rotational Proposal Network and Iterative Attention HeadCheng Jian, Lianwen Jin, Lingyu Liang and Chongyu Liu
O8.57655Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoringGeorge Retsinas, Giorgos Sfikas and Christophoros Nikou
Poster Session 1

Tuesday, August 22, 2023 – 14:30-16:00 PDT

P1.11120Evaluation of different tagging schemes for Named Entity Recognition in Handwritten DocumentsDavid Villanova-Aparisi, Carlos David Martinez-Hinarejos, Verónica Romero and Moisés Pastor-GadeaD-NLP
P1.21633DAMGCN: Entity Linking in Visually Rich Documents with Dependency-Aware Multimodal Graph Convolutional NetworkYi-Ming Chen, Xiang-Ting Hou, Dong-Fang Lou, Zhi-Lin Liao and Cheng-Lin LiuD-NLP
P1.33015RealCQA: Scientific Chart Question Answering as a Test-bed for First-Order LogicSaleem Ahmed, Bhavin Jawade, Shubham Pandey, Srirangaraj Setlur and Venu GovindarajuD-NLP
P1.44131QuOTeS: Query-Oriented Technical SummarizationJuan Antonio Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Axel J. Soto, Flavia P. Zanoto and Evangelos MiliosD-NLP
P1.55117Explain Thyself Bully”: Sentiment Aided Cyberbullying Detection with ExplanationKrishanu Maity, Prince Jha, Raghav Jain, Sriparna Saha and Pushpak BhattacharyyaD-NLP
P1.65939Topic Shift Detection in Chinese Dialogues: Corpus and BenchmarkJiangyi Lin, Yaxin Fan, Feng Jiang, Xiaomin Chu and Peifeng LiD-NLP
P1.76780CED: Catalog Extraction from DocumentsTong Zhu, Guoliang Zhang, Zechang Li, Zijian Yu, Junfei Ren, Mengsong Wu, Zhefeng Wang, Baoxing Huai, Pingfu Chao and Wenliang ChenD-NLP
P1.88939Multimodal Rumour Detection: Catching news that never transpired!Raghvendra Kumar, Ritika Sinha, Sriparna Saha and Adam JatowtD-NLP
P1.99559I-WAS: a Data Augmentation Method with GPT-2 for Simile DetectionYongzhu Chang, Rongsheng Zhang and Jiashu PuD-NLP
P1.107991On Web-based Visual Corpus Construction for Visual Document UnderstandingDongHyun Kim, Teakgyu Hong, Moonbin Yim, Yoonsik Kim and Geewook KimData and Synthesis
P1.114066Analyzing Font Style Usage and Contextual Factors in Real ImagesNaoya Yasukochi, Hideaki Hayashi, Daichi Haraguchi and Seiichi UchidaData and Synthesis
P1.125935ESTER-Pt: An Evaluation Suite for TExt Recognition in PortugueseMoniele Kunrath Santos, Guilherme Bazzo, Lucas Lima de Oliveira and Viviane P. MoreiraData and Synthesis
P1.137047TextREC: a Dataset for Referring Expression Comprehension with Reading ComprehensionChenyang Gao, Biao Yang, Hao Wang, Mingkun Yang, Wenwen Yu, Yuliang Liu and Xiang BaiData and Synthesis
P1.148519DocImagen: Diffusion Model for Layout Conditioned Document Image GenerationNoman Tanveer, Adnan Ul-Hasan and Faisal ShafaitData and Synthesis
P1.158652EnsExam: A Dataset for Handwritten Text Erasure on Examination PapersLiufeng Huang, Bangdong Chen, Chongyu Liu, Dezhi Peng, Weiying Zhou, Yaqiang Wu, Hui Li, Hao Ni and Lianwen JinData and Synthesis
P1.16491Aligning benchmark datasets for table structure recognitionBrandon Smock, Rohith Pesala and Robin AbrahamGraphics
P1.172121Line-of-sight with Graph Attention Parser (LGAP) for Math FormulasAyush Kumar Shah and Richard ZanibbiGraphics
P1.185003Line Graphics Digitization: A Step Towards Full AutomationOmar Moured, Jiaming Zhang, Alina Roitberg, Thorsten Schwarz and Rainer StiefelhagenGraphics
P1.196359TRACE:Table Reconstruction Aligned to Corner and EdgesYoungmin Baek, Daehyun Nam, Jaeheung Surh, Seung Shin and Seonghyeon KimGraphics
P1.207707Towards Making Flowchart Images Machine InterpretableShreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav and Anand MishraGraphics
P1.219362GriTS: Grid table similarity metric for table structure recognitionBrandon Smock, Rohith Pesala and Robin AbrahamGraphics
P1.22171Improving Handwritten OCR with Training Samples Generated by Glyph Conditional Denoising Diffusion Probabilistic ModelHaisong Ding, Bozhi Luan, Dongnan Gui, Kai Chen and Qiang HuoHandwriting
P1.23832Vision Conformer: Incorporating Convolutions into Vision Transformer LayersBrian Kenji Iwana and Akihiro KusudaHandwriting
P1.241442Exploring Semantic Word Representations for Recognition-free NLP on Handwritten Document ImagesOliver Tüselmann and Gernot A. FinkHandwriting
P1.252095A Unified Architecture for Urdu Printed and Handwritten Text RecognitionArooba Maqsood, Nauman Riaz, Adnan Ul-Hasan and Faisal ShafaitHandwriting
P1.263789Linguistic Knowledge within Handwritten Text Recognition Models: A Real-World Case StudySamuel Londner, Yoav Phillips, Hadar Miller, Nachum Dershowitz, Tsvi Kuflik and Moshe LaveeHandwriting
P1.274083Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document RecognitionDenis Coquenet, Clément Chatelain and Thierry PaquetHandwriting
P1.284601DSS: Synthesizing long Digital Ink using Data augmentation, Style encoding and Split generation.Aleksandr Timofeev, Anastasiia Fadeeva, Andrii Maksai, Claudiu Musat and Andrei AfoninHandwriting
P1.296471Fine-tuning Vision Encoder-Decoder Transformers for Handwriting Text Recognition on Historical DocumentsDaniel Parres Montoya and Roberto Paredes PalaciosHandwriting
P1.307403Incremental Teacher Model with Mixed Augmentations and Scheduled Pseudo-Label Loss for Handwritten Text RecognitionMasayuki Honda, Hung Tuan Nguyen, Cuong Tuan Nguyen, Cong Kha Nguyen, Ryosuke Odate, Takashi Kanemaru and Masaki NakagawaHandwriting
P1.317741SeamFormer: High Precision Text Line Segmentation for Handwritten DocumentsNiharika Vadlamudi, Rahul Krishna and Ravi Kiran SarvadevabhatlaHandwriting
P1.328630Adversarial Attacks on Convolutional Siamese Signature Verification NetworksMaham Jahangir, Muhammad Imran Malik and Faisal ShafaitHandwriting
P1.339048Towards Writing Style Adaptation in Handwriting RecognitionJan Kohút, Michal Hradiš and Martin KiššHandwriting
P1.349806Group, Contrast and Recognize: A Self-supervised Method for Chinese Character RecognitionXinzhe Jiang, Jun Du, Pengfei Hu, Mobai Xue, Jiefeng Ma, Jiajia Wu and Jianshu ZhangHandwriting
P1.359904Weakly supervised information extraction from inscrutable handwritten document imagesSujoy Paul, Gagan Madan, Akankshya Mishra, Narayan Hegde, Pradeep Kumar and Gaurav AggarwalHandwriting
P1.361827TDAE: Text Detection with Affinity Areas and Evolution StrategiesKefan Ma, Yuchen Luo, Zheng Huang, Kai Chen, Jie Guo and Weidong QiuScene Text
P1.372311Scene Text Recognition with Image-Text Matching-guided DictionaryJiajun Wei, Hongjian Zhan, Xiao Tu, Yue Lu and Umapada PalScene Text
P1.383232Open-Set Text Recognition via Shape-Awareness Visual ReconstructionChang Liu, Chun Yang and Xu-Cheng YinScene Text
P1.394204Text Enhancement:Scene Text Recognition in Hazy WeatherEn Deng, Gang Zhou, Jiakun Tian, Yangxin Liu and Zhenhong JiaScene Text
P1.405525TPFNet: A Novel Text In-painting Transformer for Text RemovalOnkar Susladkar, Dhruv Makwana, Gayatri Deshmukh, Sparsh Mittal, R Sai Chandra Teja and Rekha SinghalScene Text
P1.411934Incremental Learning and Ambiguity Rejection for Document ClassificationTri-Cong Pham, Mickaël Coustaty, Aurélie Joseph, Vincent Poulain D’Andecy, Muriel Visani and Nicolas SidereText & Document Recognition
P1.422309A Graphical Approach to Document Layout AnalysisJilin Wang, Michael Krumdick, Baojia Tong, Delphine Vendryes, Hamima Halim, Maxim Sokolov, Vadym Barda and Chris TannerText & Document Recognition
P1.432678Ensuring an error-free transcription on a full engineering tags dataset through unsupervised Post-OCR methodsMathieu Francois and Véronique EglinText & Document Recognition
P1.443475Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss FunctionJoseph Attieh, Abraham Woubie Zewoudie, Vladimir Vlassov, Adrian Flanagan and Tom BäckströmText & Document Recognition
P1.453833DocParser: end-to-end OCR-free information extraction from Visually Rich DocumentsMohamed Dhouib, Ghassen Bettaieb and Aymen ShabouText & Document Recognition
P1.464289A Hybrid Approach to Document Layout Analysis for Heterogeneous Document ImagesZhuoyao Zhong, Jiawei Wang, Haiqing Sun, Kai Hu, Erhan Zhang, Lei Sun and Qiang HuoText & Document Recognition
P1.474485You Only Look for a Symbol Once: An Object Detector for Symbols and Regions in DocumentsWilliam Smith and Toby PillattText & Document Recognition
P1.485017TACTFUL: A framework for Targeted Active Learning for Document AnalysisVenkatapathy Subramanian, Sagar Poudel, Ganesh Ramakrishnan and Parag ChaudhuriText & Document Recognition
P1.496512Evaluating Adversarial Robustness on Document Image ClassificationTimothée Fronteau, Arnaud Paran and Aymen ShabouText & Document Recognition
P1.507080Layout Analysis of Historical Document Images using a Light Fully Convolutional NetworksNajoua Rahal, Lars Vögtlin and Rolf IngoldText & Document Recognition
P1.518595Detecting Text on Historical Maps by Selecting Best Candidates of Deep Neural Networks OutputGerasimos Matidis, Basilis Gatos, Anastasios Kesidis and Panagiotis KaddasText & Document Recognition
P1.52inv-2ICDAR 2023 Competition on Video Text Reading for Dense and Small TextWeijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas and Xiang BaiCompetition
P1.53inv-10ICDAR 2023 Competition on Born Digital Video Text Question AnsweringZhibo Yang, Xiaoge Song, Sibo Song, Tong Lu, Xiang Bai, Cheng-Lin Liu, Fei Huang and Cong YaoCompetition
P1.54inv-5ICDAR 2023 Competition on Indic Handwriting Text RecognitionAjoy Mondal and C. V. JawaharCompetition
P1.55inv-11ICDAR 2023 Competition on Reading the Seal TitleWenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas and Xiang BaiCompetition
P1.56inv-16ICDAR 2023 Competition on Detecting Tampered Text in ImagesDongliang Luo, Yu Zhou, Rui Yang, Yuliang Liu, Xianjin Liu, Jishen Zeng, Enming Zhang, Biao Yang, Ziming Huang, Lianwen Jin and Xiang BaiCompetition
Oral Session 9 – DAR 2: Camera Images and Scene Text

Chair: Seiichi Uchida
Tuesday, August 22, 2023 – 16:00-18:00 PDT

O9.12427ViSA: Visual and Semantic Alignment for Robust Scene Text RecognitionZhenru Pan, Zhilong Ji, Xiao Liu, Jinfeng Bai and Cheng-Lin Liu
O9.2J144An Accurate Approach to Real-time Machine Readable Zone Detection with Mobile Devices (IJDAR Track)Alexander Gayer, Daria Ershova, Vladimir V. Arlazarov
O9.39711DQ-DETR: Dynamic Queries Enhanced Detection Transformer for Arbitrary Shape Text DetectionChixiang Ma, Lei Sun, Jiawei Wang and Qiang Huo
O9.41462Decoupling Visual-Semantic Features Learning with Dual Masked Autoencoder for Self-Supervised Scene Text RecognitionZhi Qiao, Zhilong Ji, Ye Yuan and Jinfeng Bai
O9.57705Re-thinking Text Clustering for Images with TextShwet Kamal Mishra, Soham Joshi and Viswanath Gopalakrishnan
O9.62326Scene Table Structure Recognition with Segmentation and Key Point CollaborationLi Zhuoming, Peng Fan, Xue Yang, Ni Hao and Jin Lianwen
Oral Session 10 – Handwriting 3: Document Synthesis

Chair: Robert Sablatnig
Tuesday, August 22, 2023 – 16:00-18:00 PDT

O10.1J150Historical Document Image Analysis using Controlled Data for Pre-Training (IJDAR Track)Najoua Rahal, Lars Vögtlin, Rolf Ingold
O10.21580Handwritten Text Generation with Character-specific Encoding for Style ImitationJan Zdenek and Hideki Nakayama
O10.33176How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-TuningVittorio Pippi, Silvia Cascianelli, Christopher Kermorvant and Rita Cucchiara
O10.44838TBM-GAN: Synthetic Document Generation with Degraded BackgroundArnab Poddar, Soumyadeep Dey, Pratik Jawanpuria, Jayanta Mukhopadhyay and Prabir Kumar Biswas
O10.51250Styled Text-to-Text-Content-Image Generation with Latent Diffusion ModelsKonstantina Nikolaidou, George Retsinas, Vincent Christlein, Mathias Seuret, Giorgos Sfikas, Elisa Barney Smith, Hamam Mokayed and Marcus Liwicki
O10.66936Zero-shot Generation of Training Data with Denoising Diffusion Probabilistic Model for Handwritten Chinese Character RecognitionDongnan Gui, Kai Chen, Haisong Ding and Qiang Huo
Oral Session 11 – Competitions

Chair: Kenny Davila
Wednesday, August 23, 2023 – 09:00-10:20 PDT

O11.1inv-13ICDAR 2023 CROHME: Competition on Recognition of Handwritten Mathematical ExpressionsYejing Xie, Harold Mouchère, Foteini Simistira Liwicki, Sumit Rakesh, Rajkumar Saini, Masaki Nakagawa, Cuong Tuan Nguyen and Thanh-Nghia Truong
O11.2inv-8ICDAR 2023 Competition on Hierarchical Text Detection and RecognitionShangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii and Michalis Raptis
O11.3inv-15ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and RecognitionGeorge Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas and C V Jawahar
O11.4inv-4ICDAR 2023 Competition on Document UnderstanDing of Everything (DUDE)Jordy Van Landeghem, Rubèn Tito, Łukasz Borchmann, Michał Pietruszka, Dawid Jurkiewicz, Rafał Powalski, Paweł Józiak, Sanket Biswas, Mickaël Coustaty and Tomasz Stanisławek
Oral Session 12 – Graphics 3: Math Recognition

Chair: Harold Mouchere
Wednesday, August 23, 2023 – 09:00-10:20 PDT

O12.11641Relative position embedding asymmetric siamese network for Offline handwritten mathematical expression recognitionChunyi Wang, Wei Hu, Xiaqing Rao, Runqi Luohu, Ning Bi and Tan Jun
O12.22017EDSL: An Encoder-Decoder Architecture with Symbol-Level Features for Printed Mathematical Expression RecognitionYingnan Fu, Tingting Liu, Ming Gao and Aoying Zhou
O12.34261Semantic Graph Representation Learning for Handwritten Mathematical Expression RecognitionZhuang Liu, Ye Yuan, Zhilong Ji, Jinfeng Bai and Xiang Bai
O12.48247An Encoder-Decoder Method with Position-Aware for Printed Mathematical Expression RecognitionQuan Hong, Jun Long and Liu Yang
Oral Session 13 – DAR 3: Text and Document Recognition

Chair: Mickael Coustaty
Wednesday, August 23, 2023 – 10:50-12:30 PDT

O13.19627A hybrid model for multilingual OCRDavid Etter, Cameron Carpenter and Nolan King
O13.21261Multi-Teacher Knowledge Distillation for End-to-End Text Image Machine TranslationCong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou and Chengqing Zong
O13.3J162Printed Ottoman Text Recognition Using Synthetic Data and Data Augmentation (IJDAR Track)Esma F. Bilgin Tasdemir
O13.4J149Classification of Incunable Glyphs and Out-of-distribution Detection with Joint Energy-based Models (IJDAR Track)Florian Kordon, Nikolaus Weichselbaumer, Randall Herz, Stephen Mossman, Edward Potten, Mathias Seuret, Martin Mayr, Vincent Christlein
O13.5J154Analyzing the Potential of Active Learning for Document Image Classification (IJDAR Track)Saifullah Saifullah, Stefan Agne, Andreas Dengel, Sheraz Ahmed
Oral Session 14 – Applications 2: Document Analysis Systems

Chair: Faisal Shafait
Wednesday, August 23, 2023 – 10:50-12:30 PDT

O14.17163Multimodal Scoring Model for Handwritten Chinese EssayTonghua Su, Jifeng Wang, Hongming You and Zhongjie Wang
O14.23025FCN-Boosted Historical Map Segmentation with Little Training DataJosef Baloun, Ladislav Lenc and Pavel Král
O14.31830MemeGraphs: Linking Memes to Knowledge GraphsVasiliki Kougia, Simon Fetzel, Thomas Kirchmair, Erion Çano, Sina Baharlou, Sahand Sharifzadeh and Benjamin Roth
O14.4J159Scheme for Palimpsests Reconstruction Using Synthesized Dataset (IJDAR Track)Boraq Madi, Reem Alaasam, Raed Shammas and Jihad El-Sana
O14.59420Context Aware Document Binarization and Its Application to Information Extraction from Structured DocumentsJán Koloda and Jue Wang
Poster Session 2

Wednesday, August 23, 2023 – 14:30-16:00 PDT

P2.11419Analyzing the Impact of Tokenization on Multilingual Epidemic Surveillance in Low-resource LanguagesStephen Mutuvi, Emanuela Boros, Antoine Doucet, Adam Jatowt, Gaël Lejeune and Moses OdeoD-NLP
P2.22100Analysing Textual Information from Financial Statements for Default PredictionChinesh Doshi, Himani Shrotriya, Rohit Bhiogade, Himanshu Sharad Bhatt and Abhishek JhaD-NLP
P2.33165An Iterative Graph Learning Convolution Network for Key Information Extraction Based on the Document Inductive BiasJiyao Deng, Yi Zhang, Xinpeng Zhang, Zhi Tang and Liangcai GaoD-NLP
P2.44804A Benchmark of Nested Named Entity Recognition Approaches in Historical Structured DocumentsSolenn Tual, Nathalie Abadie, Bertrand Duménieu, Joseph Chazalon and Edwin CarlinetD-NLP
P2.55441LayoutGCN: A Lightweight Architecture for Visually Rich Document UnderstandingDengliang Shi, Siliang Liu, Jintao Du and Huijia ZhuD-NLP
P2.66475Detecting Forged Receipts with Domain-specific Ontology-based Entities & RelationsBeatriz Martínez Tornés, Emanuela Boros, Petra Gomez-Krämer, Antoine Doucet and Jean-Marc OgierD-NLP
P2.77131A Character-level Document Key Information Extraction Method with Contrastive LearningXinpeng Zhang, Liangcai Gao and Jiyao DengD-NLP
P2.89403Semantic triple-assisted learning for question answering passage re-rankingDinesh Nagumothu, Bahadorreza Ofoghi and Peter EklundD-NLP
P2.99981Information Redundancy and Biases in Public Document Information Extraction BenchmarksSeif Edinne Laatiri, Pirashanth Ratnamogan, Joël Tang, Laurent Lam, William Vanhuffel and Fabien CaspaniD-NLP
P2.103928Ambigram Generation by A Diffusion ModelTakahiro Shirakawa and Seiichi UchidaData and Synthesis
P2.115155CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl DataMichał Turski, Tomasz Stanisławek, Karol Kaczmarek, Paweł Dyda and Filip GralińskiData and Synthesis
P2.126077Augraphy: A Data Augmentation Library for Document ImagesAlexander Groleau, Kok Wei Chee, Stefan Larson, Samay Maini and Jonathan BoarmanData and Synthesis
P2.137774SIMARA: a database for key-value information extraction from full-page handwritten documentsSolène Tarride, Mélodie Boillet, Jean-François Moufflet and Christopher KermorvantData and Synthesis
P2.149867Receipt Dataset for Document Forgery DetectionBeatriz Martínez Tornés, Théo Taburet, Emanuela Boros, Kais Rouis, Petra Gomez-Krämer, Nicolas Sidere, Antoine Doucet and Vincent Poulain d’AndecyData and Synthesis
P2.15200MIDV-Holo: a dataset for ID document hologram detection in a video streamLeisan Koliaskina, Ekaterina Emelianova, Daniil Tropin, Vladimir Popov, Konstantin Bulatov, Dmitry Nikolaev and Vladimir V. ArlazarovData and Synthesis
P2.162013LineFormer: Line Chart Data Extraction using Instance SegmentationJay Lal, Aditya Mitkari, Mahesh Bhosale and David DoermannGraphics
P2.172566PyramidTabNet: Transformer based Table Recognition in Image-based DocumentsMuhammad Umer, Ahmed Mohsin, Adnan Ul-Hasan and Faisal ShafaitGraphics
P2.185671Linear Object Detection in Document Images using Multiple Object TrackingPhilippe Bernet, Joseph Chazalon, Edwin Carlinet, Alexandre Bourquelot and Elodie PuybareauGraphics
P2.196754Contour Completion by Transformers and Its Application to Vector Font DataYusuke Nagata, Brian Kenji Iwana and Seiichi UchidaGraphics
P2.209308Formerge: Recover spanning cells in complex table structure using transformer networkNam Quan Nguyen, Anh Duy Le, Anh Khoa Lu, Xuan Toan Mai and Tuan Anh TranGraphics
P2.21125A Shallow Graph Neural Network with Innovative Node Updating for Online Handwritten Stroke ClassificationYan-Rong Wang, Da-Han Wang, Xiao-Long Yun, Yan-Ming Zhang, Fei Yin and Shunzhi ZhuHandwriting
P2.22590Improved Learning for Online Handwritten Chinese Text Recognition with Convolutional Prototye NetworkYi Chen, Heng Zhang and Cheng-Lin LiuHandwriting
P2.231118Modeling Cross-layer Interaction for Chinese Calligraphy Style ClassificationZhigang Li, Li Liu, Taorong Qiu, Yue Lu and Ching Y. SuenHandwriting
P2.241887OCR Language Models with Custom VocabulariesPeter Garst, Yasuhisa Fuji and Reeve IngleHandwriting
P2.252745Sampling and Ranking for Digital Ink Generation on a tight computational budgetAndrii Maksai, Andrei Afonin, Aleksandr Timofeev and Claudiu MusatHandwriting
P2.264033Decoupled Learning for Long-Tailed Oracle Character RecognitionJing Li, Bin Dong, Qiu-Feng Wang, Lei Ding, Rui Zhang and Kaizhu HuangHandwriting
P2.274287Shared-Operation Hypercomplex Networks for Handwritten Text RecognitionGiorgos Sfikas, George Retsinas, Panagiotis Dimitrakopoulos, Basilis Gatos and Christophoros NikouHandwriting
P2.286036Precise Segmentation for Children Handwriting Analysis by Combining Multiple Deep Models with Online KnowledgeSimon Corbillé, Éric Anquetil and Élisa FromontHandwriting
P2.297310Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting RecognitionJan Kohút and Michal HradišHandwriting
P2.307663AFFGANwriting: A handwriting image generation method based on multi-feature fusionHeng Wang, Yiming Wang and Hongxi WeiHandwriting
P2.318156SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit SegmentationJiarong Huang, Dezhi Peng, Hongliang Li, Hao Ni and Lianwen JinHandwriting
P2.328727A System for Processing and Recognition of Greek Byzantine and Post-Byzantine DocumentsPanagiotis Kaddas, Konstantinos Palaiologos, Basilis Gatos, Vassilis Katsouros and Katerina ChristopoulouHandwriting
P2.339669Historical document image segmentation combining deep learning and Gabor featuresMaroua Mehri, Akrem Sellami and Salvatore TabboneHandwriting
P2.349897Content-Aware Urdu Handwriting GenerationZeeshan Memon, Adnan Ul-Hasan and Faisal ShafaitHandwriting
P2.351429Text Reading Order in Uncontrolled Conditions by Sparse Graph SegmentationRenshen Wang, Yasuhisa Fujii and Alessandro BissaccoScene Text
P2.362111Visual Information Extraction in the Wild: Practical Dataset and End-to-end SolutionJianfeng Kuang, Wei Hua, Dingkang Liang, Mingkun Yang, Deqiang Jiang, Bo Ren and Xiang BaiScene Text
P2.372850E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine TranslationCong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou and Chengqing ZongScene Text
P2.383409Accelerating Transformer-Based Scene Text Detection and Recognition via Token PruningSergi Garcia-Bordils, Dimosthenis Karatzas and Marçal RusiñolScene Text
P2.395000Reading Between the Lanes: Text VideoQA on the RoadGeorge Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas and C.V. JawaharScene Text
P2.40864Transductive Learning for Near-Duplicate Image Detection in Scanned Photo CollectionsLluis Gomez, Francesc Net, Pep Casals-Puig and Marc FoliaText & Document Recognition
P2.412194EEBO-Verse: Sifting for Poetry in Large Early Modern Corpora using Visual FeaturesDanlu Chen, Nan Jiang and Taylor Berg-KirkpatrickText & Document Recognition
P2.422627Gaussian Kernels based Network for Multiple License Plate Number Detection in Day-Night ImagesSoumi Das, Shivakumara Palaiahnakote, Umapada Pal and Raghavendra RamachandraText & Document Recognition
P2.432771Unraveling confidence: examining confidence scores as proxy for OCR qualityMirjam Cuper, Corine van Dongen and Tineke KosterText & Document Recognition
P2.443792FTDNet: Joint Semantic Learning for Scene Text Detection in Adverse Weather ConditionsJiakun Tian, Gang Zhou, Yangxin Liu, En Deng and Zhenhong JiaText & Document Recognition
P2.454178MUGS: A Multiple Granularity Semi-Supervised Method for Text RecognitionQi Song, Qianyi Jiang, Wang Lei, Lingling Zhao and Rui ZhangText & Document Recognition
P2.464319ColDBin: Cold Diffusion for Document Image BinarizationSaifullah Saifullah, Stefan Agne, Andreas Dengel and Sheraz AhmedText & Document Recognition
P2.474548SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text RecognitionJunyi Zhang, Chang Liu and Chun YangText & Document Recognition
P2.485951End-to-end Multi-line License Plate Recognition with Cascaded PerceptionSong-Lu Chen, Qi Liu, Feng Chen and Xu-Cheng YinText & Document Recognition
P2.496516UTRNet: High-Resolution Urdu Text Recognition In Printed DocumentsAbdur Rahman, Chetan Arora and Arjun GhoshText & Document Recognition
P2.507319Combining OCR Models for Reading Early Modern BooksMathias Seuret, Janne van der Loop, Nikolaus Weichselbaumer, Martin Mayr, Janina Molnar, Tatjana Hass and Vincent ChristleinText & Document Recognition
P2.51inv-6ICDAR 2023 Competition on Visual Question Answering on Business Document ImagesSachin Raja, Ajoy Mondal and C. V. JawaharCompetition
P2.52inv-7ICDAR 2023 Competition on Robust Layout Segmentation in Corporate DocumentsChristoph Auer, Ahmed Nassar, Maksym Lysak, Michele Dolfi, Nikolaos Livathinos and Peter StaarCompetition
P2.53inv-12ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document ImagesWenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang and Xiang BaiCompetition
P2.54inv-9ICDAR 2023 Competition on Detection and Recognition of Greek Letters on PapyriMathias Seuret, Isabelle Marthot-Santaniello, Stephen A. White, Olga Serbaeva Saraogi, Selaudin Agolli, Guillaume Carrière, Dalia Rodriguez-Salas and Vincent ChristleinCompetition
P2.55inv-14ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical ExpressionsChenyang Gao, Yuliang Liu, Shiyu Yao, Jinfeng Bai, Xiang Bai, Lianwen Jin and Cheng-Lin LiuCompetition
P2.56N/AICDAR 2023 Competition on Document Information Localization and ExtractionStepan Simsa, Milan Sulc , Matyas Skalicky, Yash Patel, and Ahmed HamdiCompetition
Doctoral Consortium

Tuesday, August 22, 2023 – 14:30-16:00 PDT
Wednesday, August 23, 2023 – 14:30-16:00 PDT

DC.1Computer Vision Techniques for Handwritten Optical Music RecognitionPau Torras
DC.2Graph based deep learning research for recognition of on-line handwritten mathematical expressionYejing Xie
DC.3Strokes Trajectory Recovery for Unconstrained Handwritten Documents with Automatic EvaluationSidra Hanif
DC.4Enabling Deep Document Image Analysis with Generative ModelsKonstantina Nikolaidou
DC.5Enhancing Information Extraction in Business Documents through Line-Level Analysis and AutomationEliott Thomas
DC.6Writer Retrieval for Historical DocumentsMarco Peer
DC.7HTR for distant reading of medieval chartersNicolas Renet
DC.8Line-of-Sight Graph Attention and Graph-based Task Interaction (LGATI) for Visual Parsing of Math Formulas and Chemical DiagramsAyush Kumar Shah