Website Classification Using Latent Dirichlet Allocation and its Application for Internet Advertising

被引:0
|
作者
Katsumata, Sotaro [1 ]
Motohashi, Eiji [2 ]
Nishimoto, Akihiro [3 ]
Toyosawa, Eiji [4 ]
机构
[1] Osaka Univ, Graduage Sch Econ, 1-7 Machikaneyama, Toyonaka, Osaka 5650043, Japan
[2] Yokohama Natl Univ, Grad Sch Int Social Sci, Hodogaya Ku, 79-4 Tokiwadai, Yokohama, Kanagawa 2408501, Japan
[3] Kwansei Gakuin Univ, Sch Business Adm, 1-155 Uegahara Ichiban Cho, Nishinomiya, Hyogo 6628501, Japan
[4] F N Commun Inc, Shibuya Ku, Aoyama Diamond Bldg,Recept 2nd Floor, Tokyo 1500002, Japan
关键词
D O I
10.1109/ICDMW.2016.141
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a model for website classification using website content, and discusses applications for internet advertising (ad) strategies. Internet ad agencies have many ad-spaces embedded in many websites and can choose where to place advertisements. Therefore, ad agencies have to know the properties and topics of each website in order to optimize advertising submission strategy. However, since website content is in natural languages, they have to convert these qualitative sentences into quantitative data if they want to classify websites using statistical models. To address this issue, this study applies statistical analysis to website information written in natural languages. We apply a dictionary of neologisms in order to decompose website sentences into words and create a dataset of {0, 1} indicator matrices to classify the websites. From the dataset, we estimate the topics of each website using latent Dirichlet allocation. Finally, we discuss how to apply the results obtained to optimize ad strategies.
引用
收藏
页码:538 / 544
页数:7
相关论文
共 50 条
  • [21] A New Latent generalized Dirichlet Allocation Model for Image Classification
    Ihou, Koffi Eddy
    Bouguila, Nizar
    PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [22] Classification of Indonesian News Articles based on Latent Dirichlet Allocation
    Kusumaningrum, Retno
    Adhy, Satriyo
    Wiedjayanto, M. Ihsan Aji
    Suryono
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
  • [23] Topic-Based User Segmentation for Online Advertising with Latent Dirichlet Allocation
    Tu, Songgao
    Lu, Chaojun
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 259 - 269
  • [24] The Contents-Based Website Classification for the Internet Advertising Planning: An Empirical Application of the Natural Language Analysis
    Sotaro Katsumata
    Eiji Motohashi
    Akihiro Nishimoto
    Eiji Toyosawa
    The Review of Socionetwork Strategies, 2017, 11 (2) : 129 - 142
  • [25] Comparing Hierarchical Dirichlet Process with Latent Dirichlet Allocation in Bug Report Multiclass Classification
    Limsettho, Nachai
    Hata, Hideaki
    Matsumoto, Ken-ichi
    2014 15TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2014, : 137 - 142
  • [26] A PERCEPTUAL HASHING ALGORITHM USING LATENT DIRICHLET ALLOCATION
    Vretos, Nicholas
    Nikolaidis, Nikos
    Pitas, Ioannis
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 362 - 365
  • [27] Using Latent Dirichlet Allocation for Automatic Categorization of Software
    Tian, Kai
    Revelle, Meghan
    Poshyvanyk, Denys
    2009 6TH IEEE INTERNATIONAL WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES, 2009, : 163 - 166
  • [28] Topic Modeling Using Latent Dirichlet allocation: A Survey
    Chauhan, Uttam
    Shah, Apurva
    ACM COMPUTING SURVEYS, 2021, 54 (07)
  • [29] A Review of Cyberattack Research using Latent Dirichlet Allocation
    Xiao, Ming
    Dhillon, Gurpreet
    Smith, Kane J.
    28th Americas Conference on Information Systems, AMCIS 2022, 2022,
  • [30] Unsupervised Language Filtering using the Latent Dirichlet Allocation
    Zhang, Wei
    Clark, Robert A. J.
    Wang, Yongyuan
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1268 - 1272