Developing computer vision and machine learning strategies to unlock government-created records

被引:0
|
作者
Jansen, Greg [1 ]
Marciano, Richard [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
关键词
Computer vision; Machine learning; Artificial intelligence; 1950 US Census records; Sacramento; WWII Japanese American incarceration;
D O I
10.1007/s00146-025-02231-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper outlines the development of a proof-of-concept workflow using machine learning and computer vision techniques to unlock the data within digitized handwritten US Census forms from the 1950s. The 1950s US Census includes over 6.5 million page images and was only recently made available to the public on April 1, 2022, following a 72-year access restriction period. Our project uses computational treatments to assist researchers in their efforts to recover and preserve the history of the erased Sacramento Japantown. Sacramento once housed the fourth largest Japantown in the United States before experiencing WWII Japanese American Incarceration and the 1950s US Government program of urban renewal. The goal is to augment a researcher's work in selecting a subset of Census pages for further transcription and analysis. We demonstrate a workflow for extracting demographic information using computer vision for image segmentation, and machine learning for handwritten character recognition. The workflow consists of a computational filtering process for Census records and a user interface for page review. These computational techniques are suitable for other cities, states, and communities, and demonstrate new strategies to unlock vital demographic information. The approach highlights the potential benefits of computational techniques for the analysis of form-based historical records of the twentieth century that can have an impact on social justice.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Developing a Machine Learning Tool for Dynamic Cancer Treatment Strategies
    Zeng, Jiaming
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13742 - 13743
  • [42] Data Driven Feature Selection for Machine Learning Algorithms in Computer Vision
    Zhang, Fan
    Li, Wei
    Zhang, Yifan
    Feng, Zhiyong
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (06): : 4262 - 4272
  • [43] Goat Leather Quality Classification Using Computer Vision and Machine Learning
    Pereira, Renato F.
    Medeiros, Claudio M. S.
    Reboucas Filho, Pedro P.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [44] Computer Vision and Machine Learning based approaches for Food Security: A Review
    Sood, Shivani
    Singh, Harjeet
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 27973 - 27999
  • [45] Realtime Indoor Workout Analysis Using Machine Learning & Computer Vision
    Nagarkoti, Amit
    Teotia, Revant
    Mahale, Amith K.
    Das, Pankaj K.
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 1440 - 1443
  • [46] Detection of slight variations in combustion conditions with machine learning and computer vision
    Compais, Pedro
    Arroyo, Jorge
    Castan-Lascorz, Miguel Angel
    Barrio, Jorge
    Gil, Antonia
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [47] Computer Vision, Machine Learning, and the Promise of Phenomics in Ecology and Evolutionary Biology
    Lurig, Moritz D.
    Donoughe, Seth
    Svensson, Erik I.
    Porto, Arthur
    Tsuboi, Masahito
    FRONTIERS IN ECOLOGY AND EVOLUTION, 2021, 9
  • [48] Adversarial machine learning for cybersecurity and computer vision: Current developments and challenges
    Xi, Bowei
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2020, 12 (05):
  • [49] A Detective and Corrective Exercise Assistant using Computer Vision and Machine Learning
    Grewe, Lynne
    Pham, Dung Nu Thanh
    Jain, Dikshant Pravin
    Mahajan, Ankush
    Shahshahani, Allen
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXI, 2022, 12122
  • [50] Computer vision and machine learning applied in the mushroom industry: A critical review
    Yin, Hua
    Yi, Wenlong
    Hu, Dianming
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 198