Website Classification Using Latent Dirichlet Allocation and its Application for Internet Advertising

被引:0
|
作者
Katsumata, Sotaro [1 ]
Motohashi, Eiji [2 ]
Nishimoto, Akihiro [3 ]
Toyosawa, Eiji [4 ]
机构
[1] Osaka Univ, Graduage Sch Econ, 1-7 Machikaneyama, Toyonaka, Osaka 5650043, Japan
[2] Yokohama Natl Univ, Grad Sch Int Social Sci, Hodogaya Ku, 79-4 Tokiwadai, Yokohama, Kanagawa 2408501, Japan
[3] Kwansei Gakuin Univ, Sch Business Adm, 1-155 Uegahara Ichiban Cho, Nishinomiya, Hyogo 6628501, Japan
[4] F N Commun Inc, Shibuya Ku, Aoyama Diamond Bldg,Recept 2nd Floor, Tokyo 1500002, Japan
关键词
D O I
10.1109/ICDMW.2016.141
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a model for website classification using website content, and discusses applications for internet advertising (ad) strategies. Internet ad agencies have many ad-spaces embedded in many websites and can choose where to place advertisements. Therefore, ad agencies have to know the properties and topics of each website in order to optimize advertising submission strategy. However, since website content is in natural languages, they have to convert these qualitative sentences into quantitative data if they want to classify websites using statistical models. To address this issue, this study applies statistical analysis to website information written in natural languages. We apply a dictionary of neologisms in order to decompose website sentences into words and create a dataset of {0, 1} indicator matrices to classify the websites. From the dataset, we estimate the topics of each website using latent Dirichlet allocation. Finally, we discuss how to apply the results obtained to optimize ad strategies.
引用
收藏
页码:538 / 544
页数:7
相关论文
共 50 条
  • [1] Semi-Supervised Latent Dirichlet Allocation and its Application for Document Classification
    Wang, Di
    Thint, Marcus
    Al-Rubaie, Ahmad
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 306 - 310
  • [2] Latent Dirichlet Allocation for Classification using Gene Expression Data
    Yalamanchili, Hima Bindu
    Kho, Soon Jye
    Raymer, Michael L.
    2017 IEEE 17TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2017, : 39 - 44
  • [3] Latent Dirichlet Allocation for Internet Price War
    Li, Chenchen
    Yan, Xiang
    Deng, Xiaotie
    Qi, Yuan
    Chu, Wei
    Song, Le
    Qiao, Junlong
    He, Jianshan
    Xiong, Junwu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 639 - 646
  • [4] Feature Substitution Using Latent Dirichlet Allocation for Text Classification
    Mathivanan, Norsyela Muhammad Noor
    Janor, Roziah Mohd
    Abd Razak, Shukor
    Ghani, Nor Azura Md.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 1087 - 1098
  • [5] Latent Dirichlet Allocation Models for Image Classification
    Rasiwasia, Nikhil
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2665 - 2679
  • [6] Latent Dirichlet Allocation Based Multilevel Classification
    Bhutada, Sunil
    Balaram, V. V. S. S. S.
    Bulusu, Vishnu Vardhan
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1020 - 1024
  • [7] Inference Algorithms in Latent Dirichlet Allocation for Semantic Classification
    Zubir, Wan Mohammad Aflah Mohammad
    Aziz, Izzatdin Abdul
    Jaafar, Jafreezal
    Hasan, Mohd Hilmi
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 173 - 184
  • [8] THE VARIANT OF LATENT DIRICHLET ALLOCATION FOR NATURAL SCENE CLASSIFICATION
    Tang Yingjun
    COMPUTING AND INFORMATICS, 2011, 30 (02) : 311 - 319
  • [9] A Hybrid Latent Dirichlet Allocation Approach for Topic Classification
    Hsu, Chi-I
    Chiu, Chaochang
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 312 - 315