Author Profiles Prediction Using Syntactic and Content-Based Features

被引:0
|
作者
Reddy, T. Raghunadha [1 ]
Srilatha, M. [2 ]
Sreenivas, M. [3 ]
Rajasekhar, N. [4 ]
机构
[1] Vardhaman Coll Engn, Dept IT, Hyderabad, India
[2] VR Siddhartha Engn Coll, Dept CSE, Vijayawada, India
[3] Sreenidhi Inst Sci & Technol, Dept IT, Hyderabad, India
[4] Gokaraju Rangaraju Inst Engn & Technol, Dept IT, Hyderabad, India
关键词
Gender prediction; Author profiling; PDW model; Syntactic features; Content-based features;
D O I
10.1007/978-981-15-1097-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In digital forensics, the forensic analysts raised the major questions about the details of the author of a document like identity, demographic information of authors and the documents which were related these documents. To answer these questions, the researchers proposed a new research field of stylometry which uses the set of linguistic features and machine learning algorithms. Information extraction from the textual documents has become a popular research area in the last few years to know the details of the authors. In this context, author profiling is one research area concentrated by the several researchers to know the authors' demographic profiles like age, gender, and location by examining their style of writing. Several researchers proposed various types of stylistic features to analyze the style of the authors writing. In this paper, the experiment was performed with combination of syntactic features and content-based features. Various machine learning classifiers were used to evaluate the performance of the prediction of gender of reviews dataset. The proposed method achieved best accuracy for profiles prediction in author profiling.
引用
收藏
页码:265 / 273
页数:9
相关论文
共 50 条
  • [21] Content-Based Image Retrieval Using Color Features of Partitioned Images
    Fathian, Mohsen
    Tab, Fardin Akhlaghian
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [22] Content-based image retrieval using visually significant point features
    Banerjee, Minakshi
    Kundu, Malay K.
    Maji, Pradipta
    FUZZY SETS AND SYSTEMS, 2009, 160 (23) : 3323 - 3341
  • [23] Hybrid Content-based Trademark Retrieval using Region and Contour Features
    Hong, Zhiling
    Jiang, Qingshan
    2008 22ND INTERNATIONAL WORKSHOPS ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOLS 1-3, 2008, : 1163 - +
  • [24] Content-Based Image Retrieval Using Invariant Color and Texture Features
    Afifi, Ahmed J.
    Ashour, Wesam M.
    2012 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING TECHNIQUES AND APPLICATIONS (DICTA), 2012,
  • [25] Content-based image retrieval using color and texture fused features
    Yue, Jun
    Li, Zhenbo
    Liu, Lu
    Fu, Zetian
    MATHEMATICAL AND COMPUTER MODELLING, 2011, 54 (3-4) : 1121 - 1127
  • [26] Secure Content-Based Image Retrieval Using Combined Features in Cloud
    Anju, J.
    Shreelekshmi, R.
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2020), 2020, 11969 : 179 - 197
  • [27] CONTENT-BASED IMAGE RETRIEVAL USING COLOR FEATURES OF SALIENT REGIONS
    An, Jaehyun
    Lee, Sang Hwa
    Cho, Nam Ik
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3042 - 3046
  • [28] Content-based Image Retrieval using Visual Attention Point Features
    Wang, Xiang-Yang
    Li, Yong-Wei
    Niu, Pan-Pan
    Yang, Hong-Ying
    Li, Dong-Ming
    FUNDAMENTA INFORMATICAE, 2014, 135 (03) : 309 - 329
  • [29] Content-based image retrieval using a fusion of global and local features
    Bu, Hee Hyung
    Kim, Nam Chul
    Kim, Sung Ho
    ETRI JOURNAL, 2023, 45 (03) : 505 - 518
  • [30] Content-based mobile spam classification using stylistically motivated features
    Sohn, Dae-Neung
    Lee, Jung-Tae
    Han, Kyoung-Soo
    Rim, Hae-Chang
    PATTERN RECOGNITION LETTERS, 2012, 33 (03) : 364 - 369