Automatic Quality Control of Transportation Reports Using Statistical Language Processing

被引:3
|
作者
Gerber, Matthew S. [1 ]
Tang, Lu [2 ]
机构
[1] Univ Virginia, Dept Syst & Informat Engn, Charlottesville, VA 22904 USA
[2] Univ Virginia, Dept Stat, Charlottesville, VA 22904 USA
关键词
Natural language processing (NLP); quality control; transportation reports; SEARCH;
D O I
10.1109/TITS.2013.2265892
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The processes of developing, monitoring, and maintaining transportation systems produce large volumes of information. Human fieldworkers are often responsible for gathering this information, and despite their best efforts, they will inevitably introduce errors into the collected data. This is a critical problem since: 1) the collected data are used to justify key infrastructure maintenance and development decisions; and 2) the volume of unstructured information (e. g., plain text) makes manual quality control prohibitively expensive. We introduce a solution to this problem in the example domain of vehicle accident reports. First, we analyzed a sample of accident reports and confirmed the existence of many data entry errors. Second, we developed and evaluated a statistical language processing approach that automatically identifies reports containing data entry errors. We tested a variety of system configurations on real-world data and compared their performance with multiple baseline methods. The best configuration achieved a performance score of 84%, far outperforming the baseline methods. Our results and analyses have quality control implications for any data source that pairs structured text (e. g., coded fields) with unstructured text.
引用
收藏
页码:1681 / 1689
页数:9
相关论文
共 50 条
  • [31] Automatic Assessment of Mathematical Creativity using Natural Language Processing
    Marrone, Rebecca
    Cropley, David H.
    Wang, Z.
    CREATIVITY RESEARCH JOURNAL, 2023, 35 (04) : 661 - 676
  • [32] Automatic Review of Construction Specifications Using Natural Language Processing
    Moon, Seonghyeon
    Lee, Gitaek
    Chi, Seokho
    Oh, Hyunchul
    COMPUTING IN CIVIL ENGINEERING 2019: DATA, SENSING, AND ANALYTICS, 2019, : 401 - 407
  • [33] AUTOMATIC PROCESSING OF SURGICAL BIOPSY REPORTS
    BECKER, H
    BEITRAGE ZUR PATHOLOGIE, 1972, 146 (03): : 301 - &
  • [34] Automatic processing and classification of citizens' reports
    Angiani, Giulio
    Fornacciari, Paolo
    Lombardo, Gianfranco
    Mordonini, Monica
    Pietroni, Umberto
    Tomaiuolo, Michele
    GOODTECHS '18: PROCEEDINGS OF THE 4TH EAI INTERNATIONAL CONFERENCE ON SMART OBJECTS AND TECHNOLOGIES FOR SOCIAL GOOD (GOODTECHS), 2018, : 310 - 311
  • [35] Statistical methods in language processing
    Abney, Steven
    WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2011, 2 (03) : 315 - 322
  • [36] Quality Management of Pulmonary Nodule Radiology Reports Based on Natural Language Processing
    Fei, Xiaolu
    Chen, Pengyu
    Wei, Lan
    Huang, Yue
    Xin, Yi
    Li, Jia
    BIOENGINEERING-BASEL, 2022, 9 (06):
  • [37] Representing information in patient reports using natural language processing and the extensible markup language
    Friedman, C
    Hripcsak, G
    Shagina, L
    Liu, HF
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (01) : 76 - 87
  • [38] Identification of Long Bone Fractures in Radiology Reports Using Natural Language Processing to support Healthcare Quality Improvement
    Grundmeier, Robert W.
    Masino, Aaron J.
    Casper, T. Charles
    Dean, Jonathan M.
    Bell, Jamie
    Enriquez, Rene
    Deakyne, Sara
    Chamberlain, James M.
    Alpern, Elizabeth R.
    APPLIED CLINICAL INFORMATICS, 2016, 7 (04): : 1051 - 1068
  • [39] Digital image processing techniques for automatic textile quality control
    Bennamoun, M.
    Bodnarova, A.
    Systems Analysis Modelling Simulation, 2003, 43 (11): : 1581 - 1614
  • [40] IMPORTANCE OF STATISTICAL QUALITY-CONTROL IN FOOD-PROCESSING
    KLOOS, C
    JOURNAL OF FOOD PROTECTION, 1987, 50 (10) : 891 - 891