Periodic Update and Automatic Extraction of Web Data for Creating a Google Earth Based Tool

被引:0
|
作者
Abidin, T. F. [1 ]
Subianto, M. [1 ]
Gani, T. A. [2 ]
Ferdhiana, R. [3 ]
机构
[1] Syiah Kuala Univ, Fac Math & Nat Sci, Dept Informat, Darussalam, Banda Aceh, Indonesia
[2] Syiah Kuala Univ, Fac Engn, Dept Elect Engn, Darussalam, Banda Aceh, Indonesia
[3] Syiah Kuala Univ, Fac Math & Nat Sci, Dept Stat, Darussalam, Banda Aceh, Indonesia
关键词
Google Earth; monitoring system; periodic update and automatic extraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A lot of tropical disease cases that occurred in Indonesia are reported online in Indonesian news portals. Online news portals are now becoming great sources of information because online news articles are updated frequently. A rule-based, combined with machine learning algorithm, to identify the location of the cases has been developed. In this paper, a complete flow to routinely search, crawl, clean, classify, extract, and integrate the extracted entities into Google Earth is presented. The algorithm is started by searching for Indonesian news articles using a set of selected queries and Google Site Search API, and then crawling them. After the articles are crawled, they are cleaned and classified. The articles that discuss about tropical disease cases (classified as positive) are further examined to extract the locution of the incidence and to determine the sentences containing the date of occurrence and the number of casualties. The extracted entities are then stored in a relational database and annotated in an XML keyhole markup language notation to create a geographic visualization in Google Earth. The evaluation shows that it takes approximately 6 minutes to search, crawl, clean, classify, extract, and annotate the extracted entities into an XML keyhole markup language notation from 5 Web articles. In other words, it takes about 72.40 seconds to process a new page.
引用
收藏
页码:293 / 296
页数:4
相关论文
共 50 条
  • [11] Quantifying shoreline dynamics in the Indian Sundarban delta with Google Earth Engine (GEE)-based automatic extraction approach
    Santra, Manali
    Dwivedi, Chandra Shekhar
    Pandey, Arvind Chandra
    [J]. TROPICAL ECOLOGY, 2024, 65 (03) : 426 - 442
  • [12] Web Data Mining: Validity of Data from Google Earth for Food Retail Evaluation
    de Menezes, Mariana Carvalho
    de Matos, Vanderlei Pascoal
    de Pina, Maria de Fatima
    de Lima Costa, Bruna Vieira
    Mendes, Larissa Loures
    Pessoa, Milene Cristine
    de Souza-Junior, Paulo Roberto Borges
    de Lima Friche, Amelia Augusta
    Caiaffa, Waleska Teixeira
    de Oliveira Cardoso, Leticia
    [J]. JOURNAL OF URBAN HEALTH-BULLETIN OF THE NEW YORK ACADEMY OF MEDICINE, 2021, 98 (02): : 285 - 295
  • [13] Geolokit: An interactive tool for visualising and exploring geoscientific data in Google Earth
    Triantafyllou, Antoine
    Watlet, Arnaud
    Bastin, Christophe
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2017, 62 : 39 - 46
  • [14] Google Earth elevation data extraction and accuracy assessment for transportation applications
    Wang, Yinsong
    Zou, Yajie
    Henrickson, Kristian
    Wang, Yinhai
    Tang, Jinjun
    Park, Byung-Jung
    [J]. PLOS ONE, 2017, 12 (04):
  • [15] Web Data Mining: Validity of Data from Google Earth for Food Retail Evaluation
    Mariana Carvalho de Menezes
    Vanderlei Pascoal de Matos
    Maria de Fátima de Pina
    Bruna Vieira de Lima Costa
    Larissa Loures Mendes
    Milene Cristine Pessoa
    Paulo Roberto Borges de Souza-Junior
    Amélia Augusta de Lima Friche
    Waleska Teixeira Caiaffa
    Letícia de Oliveira Cardoso
    [J]. Journal of Urban Health, 2021, 98 : 285 - 295
  • [16] Research on the Automatic Extraction Method of Web Data Objects Based on Deep Learning
    Peng, Hao
    Li, Qiao
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (03): : 609 - 616
  • [17] AUTOMATIC EXTRACTION OF SURFACE DYNAMICS USING GOOGLE EARTH ENGINE FOR UNDERSTANDING DROUGHT PHENOMENON
    Polat, A. B.
    Akcay, O.
    [J]. 39TH INTERNATIONAL SYMPOSIUM ON REMOTE SENSING OF ENVIRONMENT ISRSE-39 FROM HUMAN NEEDS TO SDGS, VOL. 48-M-1, 2023, : 559 - 564
  • [18] A robust approach of automatic web data record extraction
    School of Computer Science and Technology, Shandong University, Jinan, China
    不详
    [J]. J. Comput. Inf. Syst., 2009, 6 (1757-1766):
  • [19] Automatic generation of wrapper for data extraction from the Web
    Zhang, SZ
    Lu, ZD
    [J]. WEB ENGINEERING, PROCEEDINGS, 2003, 2722 : 390 - 394
  • [20] Automatic Data Extraction from Web Discussion Forums
    Li, Suke
    Tang, Liyong
    Hu, Jianbin
    Chen, Zhong
    [J]. FCST 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, : 219 - 225