Harvest - a System for Creating Structured Rate Filing Data from Filing PDFs

被引:0
|
作者
Tekin, Ender [1 ]
You, Qian [2 ]
Conathan, Devin M. [1 ]
Fung, Glenn M. [1 ]
Kneubuehl, Thomas S. [1 ]
机构
[1] Amer Family Mutual Insurance Co SI, Madison, WI 53783 USA
[2] Coupang Corp, Seoul, South Korea
来源
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a machine-learning-guided process that can efficiently extract factor tables from unstructured rate filing documents. Our approach combines multiple deep-learning-based models that work in tandem to create structured representations of tabular data present in unstructured documents such as pdf files. This process combines CNN's to detect tables, language-based models to extract table metadata and conventional computer vision techniques to improve the accuracy of tabular data on the machine-learning side. The extracted tabular data is validated through an intuitive user interface. This process, which we call Harvest, significantly reduces the time needed to extract tabular information from PDF files, enabling analysis of such data at a speed and scale that was previously unattainable.
引用
收藏
页码:12414 / 12422
页数:9
相关论文
共 50 条
  • [41] An expert-based system to predict population survival rate from health data
    Schwacke, Lori H.
    Thomas, Len
    Wells, Randall S.
    Rowles, Teresa K.
    Bossart, Gregory D.
    Townsend Jr, Forrest
    Mazzoil, Marilyn
    Allen, Jason B.
    Balmer, Brian C.
    Barleycorn, Aaron A.
    Barratclough, Ashley
    Burt, Louise
    De Guise, Sylvain
    Fauquier, Deborah
    Gomez, Forrest M.
    Kellar, Nicholas M.
    Schwacke, John H.
    Speakman, Todd R.
    Stolen, Eric D.
    Quigley, Brian M.
    Zolman, Eric S.
    Smith, Cynthia R.
    CONSERVATION BIOLOGY, 2024, 38 (01)
  • [42] Parametric inference from system lifetime data under a proportional hazard rate model
    Ng, Hon Keung Tony
    Navarro, Jorge
    Balakrishnan, Narayanaswamy
    METRIKA, 2012, 75 (03) : 367 - 388
  • [43] Parametric inference from system lifetime data under a proportional hazard rate model
    Hon Keung Tony Ng
    Jorge Navarro
    Narayanaswamy Balakrishnan
    Metrika, 2012, 75 : 367 - 388
  • [44] ON EVALUATION OF FETAL HEART RATE FROM COMPUTERIZED DATA .1. PROGRAMMED SYSTEM FOR RETRIEVAL OF CLINICAL DATA
    SCHMIDT, H
    MORGENSTERN, J
    GEBURTSHILFE UND FRAUENHEILKUNDE, 1973, 33 (12) : 929 - 930
  • [45] Recording system and data fusion algorithm for enhancing the estimation of the respiratory rate from photoplethysmogram
    Cernat, Roxana A.
    Ciorecan, Silvia I.
    Ungureanu, Constantin
    Arends, Johan
    Strungaru, Rodica
    Ungureanu, G. Mihaela
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 5977 - 5980
  • [46] MODELS FOR INTERPRETING DEPOSITION RATE DATA FROM A CLOSED CHEMICAL VAPOR-DEPOSITION SYSTEM
    CARLSSON, JO
    JOURNAL OF THE LESS-COMMON METALS, 1980, 71 (01): : 15 - 32
  • [47] Using Heart Rate Data From Ambulatory Hemodynamic Monitoring System in Heart Failure Management
    Nitecki, Cassandra
    Sauld, Christina Rivera
    Cheema, Omar
    Hastings, T. Edward
    Roberts, Eric
    Thohan, Vinay
    Sulemanjee, Nasir Z.
    JOURNAL OF CARDIAC FAILURE, 2017, 23 (08) : S98 - S98
  • [48] Upper limits on performance from two hop relaying in a high data rate cellular system
    Dinnis, A. K.
    Thompson, J. S.
    2006 INTERNATIONAL ZURICH SEMINAR ON COMMUNICATIONS: ACCESS - TRANSMISSION - NETWORKING, PROCEEDINGS, 2006, : 194 - +
  • [49] A new framework for discovering knowledge from two-dimensional structured data using layout formal graph system
    Uchida, T
    Itokawa, Y
    Shoudai, T
    Miyahara, T
    Nakamura, Y
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2000, 1968 : 141 - 155
  • [50] THE HARVEST OF MICROALGAE FROM THE EFFLUENT OF A SEWAGE FED HIGH-RATE STABILIZATION POND BY TILAPIA-NILOTICA .1. DESCRIPTION OF THE SYSTEM AND THE STUDY OF THE HIGH-RATE POND
    EDWARDS, P
    SINCHUMPASAK, OA
    AQUACULTURE, 1981, 23 (1-4) : 83 - 105