Periodic Update and Automatic Extraction of Web Data for Creating a Google Earth Based Tool

被引:0
|
作者
Abidin, T. F. [1 ]
Subianto, M. [1 ]
Gani, T. A. [2 ]
Ferdhiana, R. [3 ]
机构
[1] Syiah Kuala Univ, Fac Math & Nat Sci, Dept Informat, Darussalam, Banda Aceh, Indonesia
[2] Syiah Kuala Univ, Fac Engn, Dept Elect Engn, Darussalam, Banda Aceh, Indonesia
[3] Syiah Kuala Univ, Fac Math & Nat Sci, Dept Stat, Darussalam, Banda Aceh, Indonesia
关键词
Google Earth; monitoring system; periodic update and automatic extraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A lot of tropical disease cases that occurred in Indonesia are reported online in Indonesian news portals. Online news portals are now becoming great sources of information because online news articles are updated frequently. A rule-based, combined with machine learning algorithm, to identify the location of the cases has been developed. In this paper, a complete flow to routinely search, crawl, clean, classify, extract, and integrate the extracted entities into Google Earth is presented. The algorithm is started by searching for Indonesian news articles using a set of selected queries and Google Site Search API, and then crawling them. After the articles are crawled, they are cleaned and classified. The articles that discuss about tropical disease cases (classified as positive) are further examined to extract the locution of the incidence and to determine the sentences containing the date of occurrence and the number of casualties. The extracted entities are then stored in a relational database and annotated in an XML keyhole markup language notation to create a geographic visualization in Google Earth. The evaluation shows that it takes approximately 6 minutes to search, crawl, clean, classify, extract, and annotate the extracted entities into an XML keyhole markup language notation from 5 Web articles. In other words, it takes about 72.40 seconds to process a new page.
引用
收藏
页码:293 / 296
页数:4
相关论文
共 50 条
  • [1] Automatic extraction of aquaculture ponds based on Google Earth Engine
    Xia, Zilong
    Guo, Xiaona
    Chen, Ruishan
    [J]. OCEAN & COASTAL MANAGEMENT, 2020, 198
  • [2] AN OVERVIEW OF THE WEB-BASED GOOGLE EARTH COINCIDENT IMAGING TOOL
    Chander, G.
    Killough, B.
    Gowda, S.
    [J]. 2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 1679 - 1682
  • [3] Development of a web-based hydrologic education tool using Google Earth resources
    Department of Civil Engineering, University of Louisiana at Lafayette, P.O. Box 42991, Lafayette, LA 70504, United States
    不详
    [J]. Spec. Pap. Geol. Soc. Am., 1600, (431-439):
  • [4] Google Trends Extraction Tool for Google Trends Extended for Health data
    Raubenheimer, Jacques Eugene
    [J]. SOFTWARE IMPACTS, 2021, 8
  • [5] Automatic Extraction of Complex Web Data
    Zhang, Ming
    Zhou, Ying
    Patrick, Jon
    [J]. PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS 2006, SECTIONS 1-8, 2006, : 1436 - 1449
  • [6] PFIME: Parallel automatic deep web data extraction based on hadoop
    Feng, Yong
    Jia, Dongfeng
    Wang, Huijuan
    [J]. Journal of Computational Information Systems, 2014, 10 (09): : 3863 - 3870
  • [7] Automatic Data Extraction from Lists in Web Pages Based on XML
    Xin, Zhou
    Hao, Wang
    [J]. ADVANCED TECHNOLOGY IN TEACHING - PROCEEDINGS OF THE 2009 3RD INTERNATIONAL CONFERENCE ON TEACHING AND COMPUTATIONAL SCIENCE (WTCS 2009), VOL 2: EDUCATION, PSYCHOLOGY AND COMPUTER SCIENCE, 2012, 117 : 915 - 921
  • [8] The Research of automatic extraction dynamic web data
    Qu Jubao
    [J]. 2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 143 - 146
  • [9] On the automatic extraction of data from the hidden web
    Liddle, SW
    Yau, SH
    Embley, DW
    [J]. CONCEPTUAL MODELING FOR NEW INFORMATION SYSTEMS TECHNOLOGIES, 2002, 2465 : 212 - 226
  • [10] Quantifying shoreline dynamics in the Indian Sundarban delta with Google Earth Engine (GEE)-based automatic extraction approach
    Santra, Manali
    Dwivedi, Chandra Shekhar
    Pandey, Arvind Chandra
    [J]. TROPICAL ECOLOGY, 2024, 65 (03) : 426 - 442