Towards Knowledge Discovery in Big Data

被引:17
|
作者
Lomotey, Richard K. [1 ]
Deters, Ralph [1 ]
机构
[1] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK S7N 0W0, Canada
关键词
Tagging; Filtering; Terms; Topics; Association Rules; Dictionary; Big Data; Unstructured Data Mining; Analytics as a Service;
D O I
10.1109/SOSE.2014.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Analytics-as-a-Service (AaaS) has become indispensable because it affords stakeholders to discover knowledge in Big Data. Previously, data stored in data warehouses follow some schema and standardization which leads to efficient data mining. However, the Big Data epoch has witnessed the rise of structured, semi-structured, and unstructured data; a trend that motivated enterprises to employ the NoSQL data storages to accommodate the high-dimensional data. Unfortunately, the existing data mining techniques which are designed for schema-oriented storages are non-applicable to the unstructured data style. Thus, the AaaS though still in its infancy, is gaining widespread attention for its ability to provide novel ways and opportunities to mine the heterogeneous data. In this paper, we discuss our AaaS tool that performs terms and topics extraction and organization from unstructured data sources such as NoSQL databases, textual contents (e.g., websites), and structured sources (e.g. SQL). The tool is built on methodologies such as tagging, filtering, association maps, and adaptable dictionary. The evaluation of the tool shows high accuracy in the mining process.
引用
收藏
页码:181 / 191
页数:11
相关论文
共 50 条
  • [41] Making Knowledge Discovery Services Scalable on Clouds for Big Data Mining
    Talia, Domenico
    [J]. PROCEEDINGS 2015 SECOND IEEE INTERNATIONAL CONFERENCE ON SPATIAL DATA MINING AND GEOGRAPHICAL KNOWLEDGE SERVICES (ICSDM 2015), 2015, : 1 - 4
  • [42] SOA Based System for Big Genomic Data Analytics and Knowledge Discovery
    Gancheva, Veska
    Borovska, Plamenka
    [J]. PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, : 536 - 541
  • [43] Applying Knowledge Discovery System to Intelligent Manufacturing based on the Big Data
    Chen, Qing
    Pang, Jun
    Chen, Yefeng
    Yang, Ming
    Luo, Yang
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 782 - 787
  • [44] Advanced knowledge discovery techniques from Big Data and Cloud Computing
    Xhafa, Fatos
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2016, 10 (09) : 945 - 946
  • [45] Towards Development of National Health Data Warehouse for Knowledge Discovery
    Khan, Shahidul Islam
    Hoque, Abu Sayed Md Latiful
    [J]. INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 2, 2016, 385 : 413 - 421
  • [46] Big Data, Big Results: Knowledge Discovery in Output from Large-Scale Analytics
    McCormick, Tyler H.
    Ferrell, Rebecca
    Karr, Alan F.
    Ryan, Patrick B.
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2014, 7 (05) : 404 - 412
  • [47] Use of Big Data and Knowledge Discovery to Create Data Backbones for Decision Support Systems
    Renu, Rahul Sharan
    Mocko, Gregory
    Koneru, Abhiram
    [J]. COMPLEX ADAPTIVE SYSTEMS: EMERGING TECHNOLOGIES FOR EVOLVING SYSTEMS: SOCIO-TECHNICAL, CYBER AND BIG DATA, 2013, 20 : 446 - 453
  • [48] Concept of Operations for Knowledge Discovery from "Big Data" Across Enterprise Data Warehouses
    Sukumar, Sreenivas R.
    Olama, Mohammed M.
    McNair, Allen W.
    Nutaro, James J.
    [J]. NEXT-GENERATION ANALYST, 2013, 8758
  • [49] Towards an Architecture for Big Data-Driven Knowledge Management Systems
    Thang Le Dinh
    Thuong-Cang Phan
    Bui, Trung
    [J]. AMCIS 2016 PROCEEDINGS, 2016,
  • [50] Big Data, Big Knowledge: Big Data for Personalized Healthcare
    Viceconti, Marco
    Hunter, Peter
    Hose, Rod
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (04) : 1209 - 1215