Layout Analysis of Historic Architectural Program Documents

被引:0
|
作者
Oliaee, Amir Hossein [1 ]
Tripp, Andrew R. [1 ]
机构
[1] Texas A&M Univ, Dept Architecture, College Stn, TX 77843 USA
来源
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023 | 2023年
关键词
Architectural Programs; Archives; Computer Vision; Data Mining; Layout Analysis; Object Detection;
D O I
10.1145/3573128.3609339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce and make publicly available the CRS Visual Dataset, a new dataset consisting of 7,029 pages of human-annotated and validated scanned archival documents from the field of 20th-century architectural programming; and ArcLayNet, a fine-tuned machine learning model based on the YOLOv6-S object detection architecture. Architectural programming is an essential professional service in the Architecture, Engineering, Construction, and Operations (AECO) Industry, and the documents it produces are powerful instruments of this service. The documents in this dataset are the product of a creative process; they exhibit a variety of sizes, orientations, arrangements, and modes of content, and are underrepresented in current datasets. This paper describes the dataset and narrates an iterative process of quality control in which several deficiencies were identified and addressed to improve the performance of the model. In this process, our key performance indicators, mAP@0.5 and mAP@0.5:0.95, both improved by approximately 10%.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Page Layout Analysis System for Unconstrained Historic Documents
    Kodym, Oldrich
    Hradis, Michal
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 492 - 506
  • [2] Interactive Layout Analysis and Transcription Systems for Historic Handwritten Documents
    Ramos-Terrades, Oriol
    Tose, Alejandro H.
    Serrano, Nicolas
    Romero, Veronica
    Vidal, Enrique
    Juan, Alfons
    DOCENG2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2010, : 219 - 222
  • [3] Layout analysis of complex documents
    Watanabe, T
    Sobue, T
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS: APPLICATIONS, ROBOTICS SYSTEMS AND ARCHITECTURES, 2000, : 447 - 450
  • [4] Layout analysis of historical Tibetan documents
    Liu, Huaming
    Bi, Xuehui
    Wangt, Weilan
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 74 - 78
  • [5] MAJOR BREAKTHROUGH IN THE ANALYSIS OF HISTORIC DOCUMENTS
    THOMPSON, AR
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1983, 2 (04) : R5 - R6
  • [6] ARCHITECTURAL AND STRUCTURAL ANALYSIS FOR INTERVENTION IN HISTORIC BUILDINGS
    Coimbra Veloso, Luciana Bracarense
    de Araujo, Ernani Carlos
    ICEM15: 15TH INTERNATIONAL CONFERENCE ON EXPERIMENTAL MECHANICS, 2012,
  • [7] Extraction, layout analysis and classification of diagrams in PDF documents
    Futrelle, RP
    Shao, MY
    Cieslik, C
    Grimes, AE
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1007 - 1014
  • [8] Comparative Study of Layout Analysis of Tabulated Historical Documents
    Liang, Xusheng
    Cheddad, Abbas
    Hall, Johan
    Big Data Research, 2021, 24
  • [9] Graph-based Layout Analysis for PDF Documents
    Xu, Canhui
    Tang, Zhi
    Tao, Xin
    Li, Yun
    Shi, Cao
    IMAGING AND PRINTING IN A WEB 2.0 WORLD IV, 2013, 8664
  • [10] Page layout analysis and classification for complex scanned documents
    Erkilinc, M. Sezer
    Jaber, Mustafa
    Saber, Eli
    Bauer, Peter
    Depalov, Dejan
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXIV, 2011, 8135