Knowledge discovery in sociological databases: An application on general society survey dataset

被引:1
|
作者
Pan Z. [1 ]
Li J. [2 ]
Chen Y. [1 ]
Pacheco J. [3 ]
Dai L. [4 ]
Zhang J. [4 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[2] High School Affiliated to Renmin University of China, Beijing
[3] Universidad de Sonora, Hermosillo
[4] Information Centre of China Disabled Persons' Federation, Beijing
关键词
Crowdsourced big data and analytics; Data management; Data mining; Knowledge discovery;
D O I
10.1108/IJCS-09-2019-0023
中图分类号
学科分类号
摘要
Purpose: The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society. GSS data set is regarded as one of the authoritative source for the government and organization practitioners to make data-driven policies. The previous analytic approaches for GSS data set are designed by combining expert knowledges and simple statistics. By utilizing the emerging data mining algorithms, we proposed a comprehensive data management and data mining approach for GSS data sets. Design/methodology/approach: The approach are designed to be operated in a two-phase manner: a data management phase which can improve the quality of GSS data by performing attribute pre-processing and filter-based attribute selection; a data mining phase which can extract hidden knowledge from the data set by performing data mining analysis including prediction analysis, classification analysis, association analysis and clustering analysis. Findings: According to experimental evaluation results, the paper have the following findings: Performing attribute selection on GSS data set can increase the performance of both classification analysis and clustering analysis; all the data mining analysis can effectively extract hidden knowledge from the GSS data set; the knowledge generated by different data mining analysis can somehow cross-validate each other. Originality/value: By leveraging the power of data mining techniques, the proposed approach can explore knowledge in a fine-grained manner with minimum human interference. Experiments on Chinese General Social Survey data set are conducted at the end to evaluate the performance of our approach. © 2019, Zhiwen Pan, Jiangtian Li, Yiqiang Chen, Jesus Pacheco, Lianjun Dai and Jun Zhang.
引用
收藏
页码:315 / 332
页数:17
相关论文
共 50 条
  • [1] Knowledge discovery in databases: application to chromatography
    Bryant, CH
    Rowe, RC
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1998, 17 (01) : 18 - 24
  • [2] KNOWLEDGE DISCOVERY IN DATABASES
    PIATETSKYSHAPIRO, G
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1991, 6 (05): : 74 - 76
  • [3] Knowledge discovery in databases
    Norton, MJ
    LIBRARY TRENDS, 1999, 48 (01) : 9 - 21
  • [4] Knowledge discovery in databases
    Düsing, R
    WIRTSCHAFTSINFORMATIK, 2000, 42 (01): : 74 - 75
  • [5] Knowledge discovery in databases: an application to market segmentation in retail supermarkets
    Endler K.D.
    Scarpin C.T.
    Steiner M.T.A.
    Sfeir T.A.
    Pereira da Veiga C.
    International Journal of Business Intelligence and Data Mining, 2023, 22 (03) : 310 - 332
  • [6] A Survey on Knowledge Discovery of Healthcare Dataset using Graph based approach
    Saravanan, M. S.
    Kumar, R. Sai Manoj
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 : 8 - 13
  • [7] Revisable knowledge discovery in databases
    Narayanan, A
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1996, 11 (02) : 75 - 96
  • [8] Knowledge discovery in endgame databases
    Schlosser, M
    ADVANCES IN INTELLIGENT DATA ANALYSIS: REASONING ABOUT DATA, 1997, 1280 : 423 - 435
  • [9] Relational knowledge discovery in databases
    Blockeel, H
    De Raedt, L
    INDUCTIVE LOGIC PROGRAMMING, 1997, 1314 : 199 - 211
  • [10] SYSTEMS FOR KNOWLEDGE DISCOVERY IN DATABASES
    MATHEUS, CJ
    CHAN, PK
    PIATETSKYSHAPIRO, G
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) : 903 - 913