A System for Unstructured Data Mining using Dynamic Ensemble Selection

被引:0
|
作者
Calado, Raquel Bezerra [1 ]
Rodriguez Torres, Leandro Sigfredo [2 ]
Maciel, Alexandre M. A. [1 ]
机构
[1] Univ Pernambuco, Recife, PE, Brazil
[2] Kurier Inteligencia Juridica, Recife, PE, Brazil
关键词
Unstructured Data; Text Mining; Dynamic Ensemble Selection; CLASSIFIER SELECTION; COMPETENCE; FRAMEWORK;
D O I
10.1109/smc42975.2020.9282967
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unstructured data represent as much as 90% of all business-relevant information. In Brazil, the practice of printing official journals dates back to the 19th century. Today more than 200 official journals in circulation, which together accumulate around 1.4 billion publications without textual standard. This work proposes the development of a system for unstructured data mining using a Dynamic Ensemble Selection. JudEasy implements, added in addition to classic text pre-processing methods, a set of twelve DES and a static method for creating categorized textual models for Brazilian of official journals. As results the DES-KL model obtained the highest accuracy rate of 96.81% and exceptional precision of 0.99.
引用
收藏
页码:1988 / 1993
页数:6
相关论文
共 50 条
  • [21] Data Stream Mining in Fog Computing Environment with Feature Selection Using Ensemble of Swarm Search Algorithms
    Ma, Bin Bin
    Fong, Simon
    Millham, Richard
    2018 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2018,
  • [22] Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study
    Anurag Kumar Verma
    Saurabh Pal
    Surjeet Kumar
    Applied Biochemistry and Biotechnology, 2020, 190 : 341 - 359
  • [23] Mining and fusing unstructured online reviews and structured public index data for hospital selection
    Liao, Huchang
    Qi, Jiaxin
    Zhang, Jiawei
    Zhang, Chonghui
    Liu, Fan
    Ding, Weiping
    INFORMATION FUSION, 2024, 103
  • [24] Dynamic Ensemble Selection for Imbalanced Data Streams With Concept Drift
    Jiao, Botao
    Guo, Yinan
    Gong, Dunwei
    Chen, Qiuju
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1278 - 1291
  • [25] Solar Irradiance Forecasting Using Dynamic Ensemble Selection
    Santos, Domingos S. de O. Jr Jr
    Neto, Paulo S. G. de Mattos
    de Oliveira, Joao F. L.
    Siqueira, Hugo Valadares
    Barchi, Tathiana Mikamura
    Lima, Aranildo R.
    Madeiro, Francisco
    Dantas, Douglas A. P.
    Converti, Attilio
    Pereira, Alex C.
    de Melo Filho, Jose Bione
    Marinho, Manoel H. N.
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [26] A scalable dynamic ensemble selection using fuzzy hyperboxes
    Davtalab, Reza
    Cruz, Rafael M. O.
    Sabourin, Robert
    INFORMATION FUSION, 2024, 102
  • [27] Using Dynamic Data Mining in Association Rule Mining
    Qaddoum, Kifaya
    MESM '2006: 9TH MIDDLE EASTERN SIMULATION MULTICONFERENCE, 2008, : 89 - 92
  • [28] From dynamic classifier selection to dynamic ensemble selection
    Ko, Albert H. R.
    Sabourin, Robert
    Britto, Alceu Souza, Jr.
    PATTERN RECOGNITION, 2008, 41 (05) : 1718 - 1731
  • [29] On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems
    Debie, Essam
    Shafi, Kamran
    Merrick, Kathryn
    Lokan, Chris
    COMPUTATIONAL INTELLIGENCE, 2017, 33 (03) : 554 - 578
  • [30] A dynamic ensemble learning based data mining framework for medical imbalanced big data
    Rithani, M.
    Kumar, R. Prasanna
    Ali, Altalbe
    KNOWLEDGE-BASED SYSTEMS, 2025, 310