Construction regulatory document digitalization with layout knowledge-informed object detection and semantic text recognition

Wang, Shuyi; Moon, Seonghyeon; Fu, Yuguang; Kim, Jinwoo

doi:10.1016/j.aei.2025.103278

상세 보기

Construction regulatory document digitalization with layout knowledge-informed object detection and semantic text recognition

Wang, Shuyi;
Moon, Seonghyeon;
Fu, Yuguang;
Kim, Jinwoo

Citations

WEB OF SCIENCE

5

Citations

SCOPUS

9

초록

Construction documents, containing extensive project information, are often stored and shared in unstructured paper formats, leading to inefficiencies in retrieval and transfer among stakeholders. There has been a pressing need for digitalizing construction documents by converting Portable Document Format documents into machinereadable, structured texts. However, current optical character recognition technologies struggle with complex layouts commonly found in construction project documents. To address this issue, we propose a construction document digitalization approach integrated with layout knowledge-informed object detection and semantic text recognition, improving recognition accuracy across various layouts and preserving the structural integrity of texts. Results show that our approach can reduce the average word error rate by 5.6 %p with the assistance of layout knowledge and achieve a structural similarity of 78.8 %, while achieving 87.4 % mAP@50 for layout analysis. These findings highlight the positive impacts of layout knowledge on digitalizing construction documents and underscore the practical viability of our approach.

키워드

Construction documents; Digitalization; Layout knowledge; Optical Character Recognition (OCR); Text recognition

제목: Construction regulatory document digitalization with layout knowledge-informed object detection and semantic text recognition

저자: Wang, Shuyi; Moon, Seonghyeon; Fu, Yuguang; Kim, Jinwoo

DOI: 10.1016/j.aei.2025.103278

발행일: 2025-05

유형: Article

저널명: Advanced Engineering Informatics

권: 65