Author Profiles Prediction Using Syntactic and Content-Based Features

被引:0
|
作者
Reddy, T. Raghunadha [1 ]
Srilatha, M. [2 ]
Sreenivas, M. [3 ]
Rajasekhar, N. [4 ]
机构
[1] Vardhaman Coll Engn, Dept IT, Hyderabad, India
[2] VR Siddhartha Engn Coll, Dept CSE, Vijayawada, India
[3] Sreenidhi Inst Sci & Technol, Dept IT, Hyderabad, India
[4] Gokaraju Rangaraju Inst Engn & Technol, Dept IT, Hyderabad, India
关键词
Gender prediction; Author profiling; PDW model; Syntactic features; Content-based features;
D O I
10.1007/978-981-15-1097-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In digital forensics, the forensic analysts raised the major questions about the details of the author of a document like identity, demographic information of authors and the documents which were related these documents. To answer these questions, the researchers proposed a new research field of stylometry which uses the set of linguistic features and machine learning algorithms. Information extraction from the textual documents has become a popular research area in the last few years to know the details of the authors. In this context, author profiling is one research area concentrated by the several researchers to know the authors' demographic profiles like age, gender, and location by examining their style of writing. Several researchers proposed various types of stylistic features to analyze the style of the authors writing. In this paper, the experiment was performed with combination of syntactic features and content-based features. Various machine learning classifiers were used to evaluate the performance of the prediction of gender of reviews dataset. The proposed method achieved best accuracy for profiles prediction in author profiling.
引用
收藏
页码:265 / 273
页数:9
相关论文
共 50 条
  • [31] Content-based image retrieval system using ORB and SIFT features
    Payal Chhabra
    Naresh Kumar Garg
    Munish Kumar
    Neural Computing and Applications, 2020, 32 : 2725 - 2733
  • [32] Content-based image retrieval system using ORB and SIFT features
    Chhabra, Payal
    Garg, Naresh Kumar
    Kumar, Munish
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2725 - 2733
  • [33] Content-based image authentication using sparse features and SOM trajectories
    Peinado, Alberto
    Ortiz, Andres
    Cotrina, Guillermo
    LOGIC JOURNAL OF THE IGPL, 2016, 24 (01) : 4 - 15
  • [34] Content-Based Image Retrieval Using Multiresolution Color and Texture Features
    Chun, Young Deok
    Kim, Nam Chul
    Jang, Ick Hoon
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (06) : 1073 - 1084
  • [35] Content-Based Image Retrieval Using a Combination of Texture and Color Features
    Bu, Hee-Hyung
    Kim, Nam-Chul
    Kim, Sung-Ho
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2021, 11
  • [36] Content-based image retrieval using Gabor-Zernike features
    Fu, X.
    Li, Y.
    Harrison, R.
    Belkasim, S.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 417 - +
  • [37] Structure features for content-based image retrieval
    Brunner, G
    Burkhardt, H
    PATTERN RECOGNITION, PROCEEDINGS, 2005, 3663 : 425 - 433
  • [38] Benchmarking of image features for content-based retrieval
    Ma, WY
    Zhang, HJ
    CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 253 - 257
  • [39] Content-based Video Retrieval with Multi Features
    Lin, Jhih-Long
    Chien, Ou-Yang
    Yu, Han-Yen
    Chen, Jiann-Jone
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1248 - 1257
  • [40] Prosemantic Features for Content-Based Image Retrieval
    Ciocca, Gianluigi
    Cusano, Claudio
    Santini, Simone
    Schettini, Raimondo
    ADAPTIVE MULTIMEDIA RETRIEVAL: UNDERSTANDING MEDIA AND ADAPTING TO THE USER, 2011, 6535 : 87 - +