A comparison of textual data mining methods for sex identification in chat conversations

被引:0
|
作者
Kose, Cemal [1 ]
Ozyurt, Ozcan [1 ]
Ikibas, Cevat [1 ]
机构
[1] Karadeniz Tech Univ, Fac Engn, Dept Comp Engn, TR-61080 Trabzon, Turkey
来源
关键词
mining chat conversations; sex identification; information extraction; text mining; machine learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining textual data in chat mediums is becoming more important because these mediums contain a vast amount of information, which is potentially relevant to a society's current interests, habits, social behaviors, crime tendency and other tendencies. Here, sex identification is taken as a base study in information mining in chat mediums. In order to do this, a simple discrimination function and semantic analysis method are proposed for sex identification in Turkish chat mediums. Then, the proposed sex identification method is compared with the Support Vector Machine (SVM) and Naive Bayes (NB) methods. Finally, results show that the proposed system has achieved accuracy over 90% in sex identification.
引用
收藏
页码:638 / 643
页数:6
相关论文
共 50 条
  • [1] Mining chat conversations for sex identification
    Koese, Cemal
    Oezyurt, Oezcan
    Amanmyradov, Guychmyrat
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 45 - 55
  • [2] Chat mining: Automatically determination of chat conversations' topic in Turkish text based chat mediums
    Oezyurt, Oezcan
    Koese, Cemal
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (12) : 8705 - 8710
  • [3] A Comparison of Data Mining Methods in Microfinance
    Wu, Jia
    Vadera, Sunil
    Dayson, Karl
    Burridge, Diane
    Clough, Ian
    [J]. 2010 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND FINANCIAL ENGINEERING (ICIFE), 2010, : 499 - 502
  • [4] Identification and Prediction of Human Behavior through Mining of Unstructured Textual Data
    Davahli, Mohammad Reza
    Karwowski, Waldemar
    Gutierrez, Edgar
    Fiok, Krzysztof
    Wrobel, Grzegorz
    Taiar, Redha
    Ahram, Tareq
    [J]. SYMMETRY-BASEL, 2020, 12 (11): : 1 - 23
  • [5] Mining the Students' Chat Conversations in a Personalized e-Learning Environment
    Al-Abri, Amal
    Al-Khanjari, Zuhoor
    Jamoussi, Yassine
    Kraiem, Naoufel
    [J]. INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2019, 14 (23) : 98 - 124
  • [6] Websom for Textual Data Mining
    Krista Lagus
    Timo Honkela
    Samuel Kaski
    Teuvo Kohonen
    [J]. Artificial Intelligence Review, 1999, 13 : 345 - 364
  • [7] Websom for textual data mining
    Lagus, K
    Honkela, T
    Kaski, S
    Kohonen, T
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 1999, 13 (5-6) : 345 - 364
  • [8] Textual data mining by parsing
    Bellacicco, A
    [J]. DATA MINING III, 2002, 6 : 311 - 319
  • [9] A Comparison of Data Mining Methods in Analyzing Educational Data
    Jung, Euihyun
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 173 - 178
  • [10] Applying data mining methods to structural identification
    Michaela, Horalova Kalinova
    German, Michalconok
    Darja, Gabriska
    [J]. ADVANCED DESIGNS AND RESEARCHES FOR MANUFACTURING, PTS 1-3, 2013, 605-607 : 2279 - 2283