Author Profiles Prediction Using Syntactic and Content-Based Features

被引:0
|
作者
Reddy, T. Raghunadha [1 ]
Srilatha, M. [2 ]
Sreenivas, M. [3 ]
Rajasekhar, N. [4 ]
机构
[1] Vardhaman Coll Engn, Dept IT, Hyderabad, India
[2] VR Siddhartha Engn Coll, Dept CSE, Vijayawada, India
[3] Sreenidhi Inst Sci & Technol, Dept IT, Hyderabad, India
[4] Gokaraju Rangaraju Inst Engn & Technol, Dept IT, Hyderabad, India
关键词
Gender prediction; Author profiling; PDW model; Syntactic features; Content-based features;
D O I
10.1007/978-981-15-1097-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In digital forensics, the forensic analysts raised the major questions about the details of the author of a document like identity, demographic information of authors and the documents which were related these documents. To answer these questions, the researchers proposed a new research field of stylometry which uses the set of linguistic features and machine learning algorithms. Information extraction from the textual documents has become a popular research area in the last few years to know the details of the authors. In this context, author profiling is one research area concentrated by the several researchers to know the authors' demographic profiles like age, gender, and location by examining their style of writing. Several researchers proposed various types of stylistic features to analyze the style of the authors writing. In this paper, the experiment was performed with combination of syntactic features and content-based features. Various machine learning classifiers were used to evaluate the performance of the prediction of gender of reviews dataset. The proposed method achieved best accuracy for profiles prediction in author profiling.
引用
收藏
页码:265 / 273
页数:9
相关论文
共 50 条
  • [11] The Content-based Image Retrieval Method Using Multiple Features
    Ha, Jeong-Yo
    Kim, Gye-Young
    Choi, Hyung-Il
    NCM 2008 : 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 1, PROCEEDINGS, 2008, : 652 - 657
  • [12] Content-based image retrieval using perceptual shape features
    Wu, M
    Gao, QG
    IMAGE ANALYSIS AND RECOGNITION, 2005, 3656 : 567 - 574
  • [13] Research on Author Identification Based on Deep Syntactic Features
    Zhao, Chen
    Song, Wei
    Liu, Lizhen
    Du, Chao
    Zhao, Xinlei
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 276 - 279
  • [14] Features for Content-Based Audio Retrieval
    Mitrovic, Dalibor
    Zeppelzauer, Matthias
    Breiteneder, Christian
    ADVANCES IN COMPUTERS, VOL 78: IMPROVING THE WEB, 2010, 78 : 71 - 150
  • [15] Content-based author co-citation analysis
    Jeong, Yoo Kyung
    Song, Min
    Ding, Ying
    JOURNAL OF INFORMETRICS, 2014, 8 (01) : 197 - 211
  • [16] Content-based Image Retrieval Using Colour and Shape Fused Features
    Mustaffa, Mas Rina
    Ahmad, Fatimah
    Mahmod, Ramlan
    Doraisamy, Shyamala
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2013, 21 (01): : 161 - 168
  • [17] Spammer Classification Using Ensemble Methods over Content-Based Features
    Makkar, Aaisha
    Goel, Shivani
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2016, VOL 2, 2017, 547 : 1 - 9
  • [18] Content-Based Image Retrieval Using Features in Spatial and Frequency Domains
    Kobayashi, Kazuhiro
    Chen, Qiu
    INTELLIGENCE IN THE ERA OF BIG DATA, ICSIIT 2015, 2015, 516 : 269 - 277
  • [19] Content-based image retrieval method using color and shape features
    Kim, IJ
    Lee, JH
    Kwon, YM
    Park, SH
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 948 - 952
  • [20] Content-Based Image Retrieval Using Color and Edge Direction Features
    Zhang, Jianlin
    Zou, Wensheng
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 5, 2010, : 459 - 462