Query sampler: generating query sets for analyzing search engines using keyword research tools

被引:2
|
作者
Schultheiss, Sebastian [1 ]
Lewandowski, Dirk [1 ,2 ]
von Mach, Sonja [1 ]
Yagci, Nurce [1 ]
机构
[1] Hamburg Univ Appl Sci, Dept Informat, Hamburg, Germany
[2] Univ Duisburg Essen, Dept Comp Sci & Appl Cognit Sci, Duisburg, Germany
关键词
Search engines; Queries; Query set; Keyword research tool; INFORMATION; INTERNET; WEB;
D O I
10.7717/peerj-cs.1421
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual user behavior. Therefore, the evidential value of the studies must be questioned. We address this problem by developing an approach (query sampler) to sample queries from commercial search engines, using keyword research tools designed to support search engine marketing. This allows us to generate large numbers of queries related to a given topic and derive information on how often each keyword is searched for, that is, the query volume. We empirically test our approach with queries from two published studies, and the results show that the number of queries and total search volume could be considerably expanded. Our approach has a wide range of applications for studies that seek to draw conclusions about social phenomena using search engine queries. The approach can be applied flexibly to different topics and is relatively straightforward to implement, as we provide the code for querying Google Ads API. Limitations are that the approach needs to be tested with a broader range of topics and thoroughly checked for problems with topic drift and the role of close variants provided by keyword research tools.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Using query reformulation to compare learning behaviors in Web search engines
    Tibau, Marcelo
    Siqueira, Sean W. M.
    Nunes, Bernardo Pereira
    Nurmikko-Fuller, Terhi
    Manrique, Ruben Francisco
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2019), 2019, : 219 - 223
  • [32] Intent Identification by Semantically Analyzing the Search Query
    Sultana, Tangina
    Mandal, Ashis Kumar
    Saha, Hasi
    Sultan, Md. Nahid
    Hossain, Md. Delowar
    MODELLING, 2024, 5 (01): : 292 - 314
  • [33] Research on Query Results Cache Based on Log Analysis in Web Search Engines
    Ma, Hongyuan
    2013 3RD INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, COMMUNICATIONS AND NETWORKS (CECNET), 2013, : 551 - 554
  • [34] RESEARCH ON THE SECOND-LEVEL QUERY RESULTS CACHE IN WEB SEARCH ENGINES
    Ma, Hongyuan
    Zhang, Xuesong
    Wang, Bin
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 619 - 623
  • [35] Query Processing Using Negative and Temporal Tuples in Stream Query Engines
    Gorawski, Marcin
    Chroszcz, Aleksander
    ADVANCES IN SOFTWARE ENGINEERING TECHNIQUES, 2012, 7054 : 70 - 83
  • [36] Generating Distributed Query Plans Using Modified Cuckoo Search Algorithm
    Kumar, T. V. Vijay
    Yadav, Monika
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 128 - 140
  • [37] Location-aware query reformulation for search engines
    Zhipeng Huang
    Yuqiu Qian
    Nikos Mamoulis
    GeoInformatica, 2018, 22 : 869 - 893
  • [38] Efficient query subscription processing for prospective search engines
    Irmak, Utku
    Mihaylov, Svilen
    Suel, Torsten
    Ganguly, Samrat
    Izmailov, Rauf
    USENIX ASSOCIATION PROCEEDINGS OF THE 2006 USENIX ANNUAL TECHNICAL CONFERENCE, 2006, : 375 - 380
  • [39] Space Efficient Caching of Query Results in Search Engines
    Ozcan, Rifat
    Altingovde, Ismail Sengor
    Ulusoy, Oezguer
    23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 558 - 563
  • [40] Location-aware query reformulation for search engines
    Huang, Zhipeng
    Qian, Yuqiu
    Mamoulis, Nikos
    GEOINFORMATICA, 2018, 22 (04) : 869 - 893