A comparison of textual data mining methods for sex identification in chat conversations

被引:0
|
作者
Kose, Cemal [1 ]
Ozyurt, Ozcan [1 ]
Ikibas, Cevat [1 ]
机构
[1] Karadeniz Tech Univ, Fac Engn, Dept Comp Engn, TR-61080 Trabzon, Turkey
来源
关键词
mining chat conversations; sex identification; information extraction; text mining; machine learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining textual data in chat mediums is becoming more important because these mediums contain a vast amount of information, which is potentially relevant to a society's current interests, habits, social behaviors, crime tendency and other tendencies. Here, sex identification is taken as a base study in information mining in chat mediums. In order to do this, a simple discrimination function and semantic analysis method are proposed for sex identification in Turkish chat mediums. Then, the proposed sex identification method is compared with the Support Vector Machine (SVM) and Naive Bayes (NB) methods. Finally, results show that the proposed system has achieved accuracy over 90% in sex identification.
引用
收藏
页码:638 / 643
页数:6
相关论文
共 50 条
  • [41] Identification of Sarcasm in Textual Data: A Comparative Study
    Pulkit Mehndiratta
    Devpriya Soni
    [J]. Journal of Data and Information Science, 2019, (04) : 56 - 83
  • [42] Identification of Sarcasm in Textual Data: A Comparative Study
    Mehndiratta, Pulkit
    Soni, Devpriya
    [J]. JOURNAL OF DATA AND INFORMATION SCIENCE, 2019, 4 (04) : 56 - 83
  • [43] Impact of Personalised AI Chat Assistant on Mediated Human-Human Textual Conversations: Exploring Female-Male Differences
    Wang, Jindi
    Ivrissimtzis, Ioannis
    Li, Zhaoxing
    Shi, Lei
    [J]. COMPANION PROCEEDINGS OF 2024 29TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2024 COMPANION, 2024, : 78 - 83
  • [44] Identification of locational influence on real property values using data mining methods
    Melanda, Edson
    Hunter, Andrew
    Barry, Michael
    [J]. CYBERGEO-EUROPEAN JOURNAL OF GEOGRAPHY, 2016,
  • [45] Identification of causal factors for the Majiagou landslide using modern data mining methods
    Ma, Junwei
    Tang, Huiming
    Hu, Xinli
    Bobet, Antonio
    Zhang, Ming
    Zhu, Tingwei
    Song, Youjian
    Eldin, Mutasim A. M. Ez
    [J]. LANDSLIDES, 2017, 14 (01) : 311 - 322
  • [46] Plant Identification using New Geometric Features with Standard Data Mining Methods
    Rojas-Hernandez, Rafael
    Lopez-Chau, Asdrubal
    Trujillo-Mora, Valentin
    Rojas-Hernandez, Carlos A.
    [J]. 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL (ICNSC), 2016,
  • [47] The use of feature selection based data mining methods in biomarkers identification of disease
    Zhao, Huihui
    Chen, Jianxin
    Liu, Y.
    Shi, Qi
    Yang, Yi
    Zheng, Chenglong
    Hou, Na
    Wang, Juan
    Zhao, Lingyan
    Wang, Wei
    [J]. CEIS 2011, 2011, 15
  • [48] Plant Identification using New Geometric Features with Standard Data Mining Methods
    Rojas-Hernandez, Rafael
    Lopez-Chau, Asdrubal
    Trujillo-Mora, Valentin
    Rojas-Hernandez, Carlos A.
    [J]. 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL (ICNSC), 2016,
  • [49] Identification of causal factors for the Majiagou landslide using modern data mining methods
    Junwei Ma
    Huiming Tang
    Xinli Hu
    Antonio Bobet
    Ming Zhang
    Tingwei Zhu
    Youjian Song
    Mutasim A. M. Ez Eldin
    [J]. Landslides, 2017, 14 : 311 - 322
  • [50] Data-Mining Textual Responses to Uncover Misconception Patterns
    Michalenko, Joshua
    Lan, Andrew S.
    Baraniuk, Richard G.
    [J]. PROCEEDINGS OF THE FOURTH (2017) ACM CONFERENCE ON LEARNING @ SCALE (L@S'17), 2017, : 245 - 248