Knowledge discovery in sociological databases: An application on general society survey dataset

被引:1
|
作者
Pan Z. [1 ]
Li J. [2 ]
Chen Y. [1 ]
Pacheco J. [3 ]
Dai L. [4 ]
Zhang J. [4 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[2] High School Affiliated to Renmin University of China, Beijing
[3] Universidad de Sonora, Hermosillo
[4] Information Centre of China Disabled Persons' Federation, Beijing
关键词
Crowdsourced big data and analytics; Data management; Data mining; Knowledge discovery;
D O I
10.1108/IJCS-09-2019-0023
中图分类号
学科分类号
摘要
Purpose: The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society. GSS data set is regarded as one of the authoritative source for the government and organization practitioners to make data-driven policies. The previous analytic approaches for GSS data set are designed by combining expert knowledges and simple statistics. By utilizing the emerging data mining algorithms, we proposed a comprehensive data management and data mining approach for GSS data sets. Design/methodology/approach: The approach are designed to be operated in a two-phase manner: a data management phase which can improve the quality of GSS data by performing attribute pre-processing and filter-based attribute selection; a data mining phase which can extract hidden knowledge from the data set by performing data mining analysis including prediction analysis, classification analysis, association analysis and clustering analysis. Findings: According to experimental evaluation results, the paper have the following findings: Performing attribute selection on GSS data set can increase the performance of both classification analysis and clustering analysis; all the data mining analysis can effectively extract hidden knowledge from the GSS data set; the knowledge generated by different data mining analysis can somehow cross-validate each other. Originality/value: By leveraging the power of data mining techniques, the proposed approach can explore knowledge in a fine-grained manner with minimum human interference. Experiments on Chinese General Social Survey data set are conducted at the end to evaluate the performance of our approach. © 2019, Zhiwen Pan, Jiangtian Li, Yiqiang Chen, Jesus Pacheco, Lianjun Dai and Jun Zhang.
引用
收藏
页码:315 / 332
页数:17
相关论文
共 50 条
  • [21] Logical calculi for knowledge discovery in databases
    Rauch, J
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1263 : 47 - 57
  • [22] Discovery of knowledge in Dutch dairy databases
    Kroeze, GH
    Lokhorst, C
    van den Berg, JV
    Blok, B
    de Vries, F
    WORK SCIENCES IN SUSTAINABLE AGRICULTURE, 1999, : 142 - 148
  • [23] An XML approach to knowledge discovery in databases
    Kotásek, P
    Zendulka, J
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2000, 62 : 141 - 148
  • [24] Discovery of temporal knowledge from databases
    Watanabe, K
    Miura, T
    Shioya, I
    INFORMATION REUSE AND INTEGRATION, 2001, : 13 - 17
  • [25] Knowledge Discovery in Multiple Spatial Databases
    Aleksandar Lazarevic
    Zoran Obradovic
    Neural Computing & Applications, 2002, 10 : 339 - 350
  • [26] A FRAMEWORK FOR KNOWLEDGE DISCOVERY AND EVOLUTION IN DATABASES
    YOON, JP
    KERSCHBERG, L
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) : 973 - 979
  • [27] Novelty framework for knowledge discovery in databases
    Al-Hegami, AS
    Bhatnagar, V
    Kumar, N
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 48 - 57
  • [28] Knowledge discovery in bibliographic databases.
    Wool, G
    LIBRARY RESOURCES & TECHNICAL SERVICES, 2000, 44 (04): : 218 - 219
  • [29] COMPUTER AIDED OF KNOWLEDGE DISCOVERY IN DATABASES
    Janoscova, Renata
    INTERNATIONAL CONFERENCE ON MANAGEMENT: TRENDS OF MANAGEMENT IN THE CONTEMPORARY SOCIETY, 2016, : 82 - 86
  • [30] ADVANCES IN KNOWLEDGE DISCOVERY IN DISTRIBUTED DATABASES
    Pupezescu, Valentin
    RETHINKING EDUCATION BY LEVERAGING THE ELEARNING PILLAR OF THE DIGITAL AGENDA FOR EUROPE!, VOL. I, 2015, : 311 - 319