Human-Machine Information Fxtraction Simulator for Biological Collections

被引:0
|
作者
Alzuru, Icaro [1 ]
Malladi, Aditi [1 ]
Matsunaga, Andrea [2 ]
Tsugawa, Mauricio [2 ]
Fortes, Jose A. B. [2 ]
机构
[1] Univ Florida, CISE Dept, Gainesville, FL 32611 USA
[2] Univ Florida, ACIS Lab, Gainesville, FL USA
基金
美国国家科学基金会;
关键词
Information extraction; simulator; human-machine human-in-the-loop; crowdsourcing; optical character recognition; natural language processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last decade, institutions from around the world have implemented initiatives for digitizing biological collections (biocollections) and sharing their information online. The transcription of the metadata from photographs of specimens' labels is performed through human-centered approaches (e.g., crowdsourcing) because fully automated Information Extraction (IE) methods still generate a significant number of errors. The integration of human and machine tasks has been proposed to accelerate the IE from the billions of specimens waiting to be digitized. Nevertheless, in order to conduct research and trying new techniques, IE practitioners need to prepare sets of images, crowdsourcing experiments, recruit volunteers, process the transcriptions, generate ground truth values, program automated methods, etc. These research resources and processes require time and effort to be developed and architected into a functional system. In this paper, we present a simulator intended to accelerate the ability to experiment with workflows for extracting Darwin Core (DC) terms from images of specimens. The so-called HuMaIN Simulator includes the engine, the human-machine IE workflows for three DC terms, the code of the automated IE methods, crowdsourced and ground truth transcriptions of the DC terms of three biocollections, and several experiments that exemplify its potential use. The simulator adds Human-in-the-loop capabilities, for iterative IE and research on optimal methods. Its practical design permits the quick definition, customization, and implementation of experimental IE scenarios.
引用
收藏
页码:4565 / 4572
页数:8
相关论文
共 50 条
  • [1] Cooperative Human-Machine Data Extraction from Biological Collections
    Alzuru, Icaro
    Matsunaga, Andrea
    Tsugawa, Mauricio
    Fortes, Jose A. B.
    PROCEEDINGS OF THE 2016 IEEE 12TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2016, : 41 - 50
  • [2] Simulator-based human-machine interaction design
    HiQ Ace AB, Teknikringen 8, Linköping SE-583 30, Sweden
    不详
    不详
    Int. J. Veh. Syst. Model. Test., 2009, 1-2 (1-16):
  • [3] Human-Machine Interface: Design Principles of Visual Information in Human-Machine Interface Design
    Gong Chao
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 262 - 265
  • [4] Human-Machine Interface Evaluation Using EEG in Driving Simulator
    Liu, Yuan-Cheng
    Figalova, Nikol
    Baumann, Martin
    Bengler, Klaus
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [5] Development of a driving simulator for telematics human-machine interface studies
    Koo, T-Y
    Bae, C-H
    Kim, B-Y
    Rowland, Z.
    Suh, M-W
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2008, 222 (D11) : 2077 - 2086
  • [6] A Platform for Human-Machine Information Data Fusion
    Hodzic, Migdat
    ADVANCED TECHNOLOGIES, SYSTEMS, AND APPLICATIONS III, VOL 1, 2019, 59 : 430 - 456
  • [8] Information Coding for Cockpit Human-machine Interface
    Zhang Lei
    Zhuang Damin
    Wanyan Xiaoru
    CHINESE JOURNAL OF MECHANICAL ENGINEERING, 2011, 24 (04) : 707 - 712
  • [10] Information digitalization and optimization of human-machine operation
    Zhang Xinmin
    Jing Guoyu
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS A AND B: BUILDING CORE COMPETENCIES THROUGH IE&EM, 2007, : 1209 - 1213