Machine Learning based Classification of Online News Data for Disaster Management

被引:2
|
作者
Gopal, Lakshmi S. [1 ]
Prabha, Rekha [1 ]
Pullarkatt, Divya [1 ]
Ramesh, Maneesha Vinodini [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Ctr Wireless Networks & Applicat WNA, Amritapuri, India
基金
英国自然环境研究理事会;
关键词
Web Crawling; Hazards; Supervised Learning; Text Classification;
D O I
10.1109/GHTC46280.2020.9342921
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The exponential escalation of disaster loss in our country has led to the awareness that disaster risks are presumably increasing. As per statistics, India has confronted 371 natural hazards over the past few decades and severe casualties, infrastructural, agricultural and economic damages were recorded [1]. Credible and real time data such as news content are accessible liberally in legitimate websites and its analysis may provide assistance in administering hazard emergencies, preparedness and relief efficiently. On this grounds, a data scraping approach is proposed to gather hazard relevant news stories from the web by building a crawler software and incorporate machine learning approaches to filter out insightful information. The developed crawler software visits news reporting web pages and extracts news stories related to hazards. News illustrations are often unstructured as it includes less newsworthy content such as author's opinions, interview responses and past studies. Hence, a supervised learning based text classification is performed to classify newsworthy content from news articles and approximately 70 percent accuracy was achieved.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Developing Machine Learning Models to Automate News Classification
    Singh, Roshan
    Chun, Soon Ae
    Atluri, Vijay
    [J]. PROCEEDINGS OF THE 21ST ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2020, 2020, : 354 - 355
  • [22] Multilevel Classification of Pakistani News using Machine Learning
    Ilyas, Anum
    Obaid, Surayya
    Bawany, Narmeen Zakaria
    [J]. 2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 760 - 764
  • [23] Incomplete data classification with voting based extreme learning machine
    Yan, Yuan-Ting
    Zhang, Yan-Ping
    Chen, Jie
    Zhang, Yi-Wen
    [J]. NEUROCOMPUTING, 2016, 193 : 167 - 175
  • [24] IMBALANCED DATA CLASSIFICATION BASED ON EXTREME LEARNING MACHINE AUTOENCODER
    Shen, Chu
    Zhang, Su-Fang
    Zhai, Jun-Hal
    Luo, Ding-Sheng
    Chen, Jun-Fen
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 399 - 404
  • [25] Medical and Health Data Classification Method Based on Machine Learning
    Zeng, Yu
    Cheng, Fuchao
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [26] Classification of Uncertain Data Streams Based on Extreme Learning Machine
    Cao, Keyan
    Wang, Guoren
    Han, Donghong
    Ning, Jingwei
    Zhang, Xin
    [J]. COGNITIVE COMPUTATION, 2015, 7 (01) : 150 - 160
  • [27] Medical and Health Data Classification Method Based on Machine Learning
    Zeng, Yu
    Cheng, Fuchao
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [28] Data Stream Classification Based on Extreme Learning Machine: Review
    Zheng, Xiulin
    Li, Peipei
    Wu, Xindong
    [J]. BIG DATA RESEARCH, 2022, 30
  • [29] Ship Classification Based on AIS Data and Machine Learning Methods
    Huang, I-Lun
    Lee, Man-Chun
    Nieh, Chung-Yuan
    Huang, Juan-Chen
    [J]. ELECTRONICS, 2024, 13 (01)
  • [30] Classification and Prediction of Network Abnormal Data Based on Machine Learning
    Ren, Bin
    Hu, Ming
    Yan, Hui
    Yu, Ping
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2019), 2019, : 273 - 276