Author Profiles Prediction Using Syntactic and Content-Based Features

被引:0
|
作者
Reddy, T. Raghunadha [1 ]
Srilatha, M. [2 ]
Sreenivas, M. [3 ]
Rajasekhar, N. [4 ]
机构
[1] Vardhaman Coll Engn, Dept IT, Hyderabad, India
[2] VR Siddhartha Engn Coll, Dept CSE, Vijayawada, India
[3] Sreenidhi Inst Sci & Technol, Dept IT, Hyderabad, India
[4] Gokaraju Rangaraju Inst Engn & Technol, Dept IT, Hyderabad, India
关键词
Gender prediction; Author profiling; PDW model; Syntactic features; Content-based features;
D O I
10.1007/978-981-15-1097-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In digital forensics, the forensic analysts raised the major questions about the details of the author of a document like identity, demographic information of authors and the documents which were related these documents. To answer these questions, the researchers proposed a new research field of stylometry which uses the set of linguistic features and machine learning algorithms. Information extraction from the textual documents has become a popular research area in the last few years to know the details of the authors. In this context, author profiling is one research area concentrated by the several researchers to know the authors' demographic profiles like age, gender, and location by examining their style of writing. Several researchers proposed various types of stylistic features to analyze the style of the authors writing. In this paper, the experiment was performed with combination of syntactic features and content-based features. Various machine learning classifiers were used to evaluate the performance of the prediction of gender of reviews dataset. The proposed method achieved best accuracy for profiles prediction in author profiling.
引用
收藏
页码:265 / 273
页数:9
相关论文
共 50 条
  • [1] Using Content-Based Features for Author Profiling of Vietnamese Forum Posts
    Duc Tran Duong
    Son Bao Pham
    Hanh Tan
    RECENT DEVELOPMENTS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2016, 642 : 287 - 296
  • [2] Augmenting content-based rating prediction with link stream features
    Viard, Tiphaine
    Fournier-S'niehotta, Raphael
    COMPUTER NETWORKS, 2019, 150 : 127 - 133
  • [3] Content-based image retrieval using composite features
    Kauniskangas, H
    Sauvola, J
    Pietikainen, M
    Doermann, D
    SCIA '97 - PROCEEDINGS OF THE 10TH SCANDINAVIAN CONFERENCE ON IMAGE ANALYSIS, VOLS 1 AND 2, 1997, : 35 - 42
  • [4] Figure Plagiarism Detection Using Content-Based Features
    Eisa, Taiseer
    Salim, Naomie
    Alzahrani, Salha
    RECENT DEVELOPMENTS IN INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, ICCD 2016, 2017, 555 : 17 - 20
  • [5] Content-based image retrieval using multiple features
    Zhang, Chi
    Huang, Lei
    Journal of Computing and Information Technology, 2014, 22 (SpecialIssue) : 1 - 10
  • [6] Content-based image retrieval using texture features
    Honda, MO
    Azevedo-Marques, PM
    Rodrigues, JAH
    CARS 2002: COMPUTER ASSISTED RADIOLOGY AND SURGERY, PROCEEDINGS, 2002, : 1036 - 1036
  • [7] Content-based microarray search using differential expression profiles
    Jesse M Engreitz
    Alexander A Morgan
    Joel T Dudley
    Rong Chen
    Rahul Thathoo
    Russ B Altman
    Atul J Butte
    BMC Bioinformatics, 11
  • [8] Content-based microarray search using differential expression profiles
    Engreitz, Jesse M.
    Morgan, Alexander A.
    Dudley, Joel T.
    Chen, Rong
    Thathoo, Rahul
    Altman, Russ B.
    Butte, Atul J.
    BMC BIOINFORMATICS, 2010, 11
  • [9] Code Authorship Attribution using content-based and non-content-based features
    Bayrami, Parinaz
    Rice, Jacqueline E.
    2021 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2021,
  • [10] Content-based image retrieval using colour and shape features
    Park, YoungJae
    Park, KeeHong
    Kim, GyeYoung
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2013, 48 (02) : 155 - 161