A comparison of textual data mining methods for sex identification in chat conversations

被引:0
|
作者
Kose, Cemal [1 ]
Ozyurt, Ozcan [1 ]
Ikibas, Cevat [1 ]
机构
[1] Karadeniz Tech Univ, Fac Engn, Dept Comp Engn, TR-61080 Trabzon, Turkey
来源
关键词
mining chat conversations; sex identification; information extraction; text mining; machine learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining textual data in chat mediums is becoming more important because these mediums contain a vast amount of information, which is potentially relevant to a society's current interests, habits, social behaviors, crime tendency and other tendencies. Here, sex identification is taken as a base study in information mining in chat mediums. In order to do this, a simple discrimination function and semantic analysis method are proposed for sex identification in Turkish chat mediums. Then, the proposed sex identification method is compared with the Support Vector Machine (SVM) and Naive Bayes (NB) methods. Finally, results show that the proposed system has achieved accuracy over 90% in sex identification.
引用
收藏
页码:638 / 643
页数:6
相关论文
共 50 条
  • [11] Methods for Mining and Summarizing Text Conversations
    Carenini, Giuseppe
    Murray, Gabriel
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1178 - 1179
  • [12] Reviewing textual data mining in japan
    Ohsumi, N
    Yasuda, A
    [J]. SOCIOLOGICAL THEORY AND METHODS, 2004, 19 (02) : 135 - 159
  • [13] Phase Identification in Distribution Systems by Data Mining Methods
    Ni, F.
    Liu, J. Q.
    Wei, F.
    Zhu, C. D.
    Xie, S. X.
    [J]. 2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017,
  • [14] Combining two data mining methods for system identification
    Saitta, Sandro
    Raphael, Benny
    Smith, Ian F. C.
    [J]. INTELLIGENT COMPUTING IN ENGINEERING AND ARCHITECTURE, 2006, 4200 : 606 - 614
  • [15] Comparison between document vectorization methods: a case study for textual data
    Kubrusly, Jessica
    Valenotti, Gabriel G. L.
    [J]. SIGMAE, 2024, 13 (01): : 79 - 90
  • [16] Predicting Juvenile Offending: A Comparison of Data Mining Methods
    Ang, Rebecca P.
    Goh, Dion H.
    [J]. INTERNATIONAL JOURNAL OF OFFENDER THERAPY AND COMPARATIVE CRIMINOLOGY, 2013, 57 (02) : 191 - 207
  • [17] Chat Summarization and Sentiment Analysis Techniques in Data Mining
    Rani, Reeta
    Tandon, Sawal
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS), 2018, : 102 - 106
  • [18] Detecting emerging concepts in textual data mining
    Pottenger, WM
    Yang, TH
    [J]. COMPUTATIONAL INFORMATION RETRIEVAL, 2001, : 89 - 105
  • [19] Mining causality knowledge from textual data
    Pechsiri, C
    Kawtrakul, A
    Piriyakul, R
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2006, : 85 - +
  • [20] Mining explanation knowledge from textual data
    Pechsiri, Chaveevan
    Kawtrakul, Asance
    Piriyakul, Rapepun
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND TECHNOLOGY, 2006, : 322 - +