Towards Knowledge Discovery in Big Data

被引:17
|
作者
Lomotey, Richard K. [1 ]
Deters, Ralph [1 ]
机构
[1] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK S7N 0W0, Canada
关键词
Tagging; Filtering; Terms; Topics; Association Rules; Dictionary; Big Data; Unstructured Data Mining; Analytics as a Service;
D O I
10.1109/SOSE.2014.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Analytics-as-a-Service (AaaS) has become indispensable because it affords stakeholders to discover knowledge in Big Data. Previously, data stored in data warehouses follow some schema and standardization which leads to efficient data mining. However, the Big Data epoch has witnessed the rise of structured, semi-structured, and unstructured data; a trend that motivated enterprises to employ the NoSQL data storages to accommodate the high-dimensional data. Unfortunately, the existing data mining techniques which are designed for schema-oriented storages are non-applicable to the unstructured data style. Thus, the AaaS though still in its infancy, is gaining widespread attention for its ability to provide novel ways and opportunities to mine the heterogeneous data. In this paper, we discuss our AaaS tool that performs terms and topics extraction and organization from unstructured data sources such as NoSQL databases, textual contents (e.g., websites), and structured sources (e.g. SQL). The tool is built on methodologies such as tagging, filtering, association maps, and adaptable dictionary. The evaluation of the tool shows high accuracy in the mining process.
引用
收藏
页码:181 / 191
页数:11
相关论文
共 50 条
  • [1] Towards Differentiating Business Intelligence, Big Data, Data Analytics and Knowledge Discovery
    Dedic, Nedim
    Stanier, Clare
    [J]. INNOVATIONS IN ENTERPRISE INFORMATION SYSTEMS MANAGEMENT AND ENGINEERING, 2017, 285 : 114 - 122
  • [2] Big Data knowledge discovery
    Xhafa, Fatos
    Taniar, David
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 79 : 1 - 2
  • [3] Big data analytics and knowledge discovery
    Bellatreche, Ladjel
    Mohania, Mukesh
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (15): : 3945 - 3947
  • [4] Big Data Analytics and Knowledge Discovery
    Golfarelli, Matteo
    Wrembel, Robert
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 146
  • [5] Big Data Trend: Knowledge Discovery on the Unstructured Data
    Abu Muntalib, Shamsiah
    Sidi, Fatimah
    Jabar, Marzanah A.
    Ishak, Iskandar
    [J]. PROCEEDING OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2014, VOLS 1 AND 2, 2014, : 338 - 342
  • [6] 23-bit Metaknowledge Template Towards Big Data Knowledge Discovery and Management
    Bari, Nima
    Vichr, Roman
    Kowsari, Kamran
    Berkovich, Simon
    [J]. 2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 519 - 526
  • [7] Sampling and Evaluating the Big Data for Knowledge Discovery
    Sung, Andrew H.
    Ribeiro, Bernardete
    Liu, Qingzhong
    [J]. IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 378 - 382
  • [8] Unsupervised Knowledge Discovery in 'Big' Materials Data
    Sun, Wenhao
    [J]. ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2021, 77 : C187 - C187
  • [9] Data Analysis & Classification Methodology for Knowledge Discovery in Big Data
    Patil, Shilpa
    Kumar, Ashok P. S.
    Patil, Prasadgouda
    Palagi, Puneet
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2017), 2017, : 40 - 43
  • [10] Assessing reliability of Big Data Knowledge Discovery process
    Safhi, Hicham Moad
    Frikh, Bouchra
    Ouhbi, Brahim
    [J]. SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 30 - 36