Construction regulatory document digitalization with layout knowledge-informed object detection and semantic text recognition
Citations

WEB OF SCIENCE

5
Citations

SCOPUS

9

초록

Construction documents, containing extensive project information, are often stored and shared in unstructured paper formats, leading to inefficiencies in retrieval and transfer among stakeholders. There has been a pressing need for digitalizing construction documents by converting Portable Document Format documents into machinereadable, structured texts. However, current optical character recognition technologies struggle with complex layouts commonly found in construction project documents. To address this issue, we propose a construction document digitalization approach integrated with layout knowledge-informed object detection and semantic text recognition, improving recognition accuracy across various layouts and preserving the structural integrity of texts. Results show that our approach can reduce the average word error rate by 5.6 %p with the assistance of layout knowledge and achieve a structural similarity of 78.8 %, while achieving 87.4 % mAP@50 for layout analysis. These findings highlight the positive impacts of layout knowledge on digitalizing construction documents and underscore the practical viability of our approach.

키워드

Construction documentsDigitalizationLayout knowledgeOptical Character Recognition (OCR)Text recognition
제목
Construction regulatory document digitalization with layout knowledge-informed object detection and semantic text recognition
저자
Wang, ShuyiMoon, SeonghyeonFu, YuguangKim, Jinwoo
DOI
10.1016/j.aei.2025.103278
발행일
2025-05
유형
Article
저널명
Advanced Engineering Informatics
65