Query sampler: generating query sets for analyzing search engines using keyword research tools

被引:2
|
作者
Schultheiss, Sebastian [1 ]
Lewandowski, Dirk [1 ,2 ]
von Mach, Sonja [1 ]
Yagci, Nurce [1 ]
机构
[1] Hamburg Univ Appl Sci, Dept Informat, Hamburg, Germany
[2] Univ Duisburg Essen, Dept Comp Sci & Appl Cognit Sci, Duisburg, Germany
关键词
Search engines; Queries; Query set; Keyword research tool; INFORMATION; INTERNET; WEB;
D O I
10.7717/peerj-cs.1421
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual user behavior. Therefore, the evidential value of the studies must be questioned. We address this problem by developing an approach (query sampler) to sample queries from commercial search engines, using keyword research tools designed to support search engine marketing. This allows us to generate large numbers of queries related to a given topic and derive information on how often each keyword is searched for, that is, the query volume. We empirically test our approach with queries from two published studies, and the results show that the number of queries and total search volume could be considerably expanded. Our approach has a wide range of applications for studies that seek to draw conclusions about social phenomena using search engine queries. The approach can be applied flexibly to different topics and is relatively straightforward to implement, as we provide the code for querying Google Ads API. Limitations are that the approach needs to be tested with a broader range of topics and thoroughly checked for problems with topic drift and the role of close variants provided by keyword research tools.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Query recommendation using query logs in search engines
    BaezaYates, R
    Hurtado, C
    Mendoza, M
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 588 - 596
  • [2] Query recommendation using query logs in search engines
    Baeza-Yates, Ricardo
    Hurtado, Carlos
    Mendoza, Marcelo
    De Chile, Universidad
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3268 : 588 - 596
  • [3] Comparative performance evaluation of keyword and semantic search engines using different query set categories
    Jatwani P.
    Tomar P.
    Dhingra V.
    Recent Advances in Computer Science and Communications, 2020, 13 (05) : 1057 - 1070
  • [4] Combining Query Translation with Query Answering for Efficient Keyword Search
    Ladwig, Guenter
    Tran, Thanh
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PT 2, PROCEEDINGS, 2010, 6089 : 288 - 303
  • [5] RDF Keyword Search by Query Computation
    Ma, Zongmin
    Lin, Xiaoqing
    Yan, Li
    Zhao, Zhen
    JOURNAL OF DATABASE MANAGEMENT, 2018, 29 (04) : 1 - 27
  • [6] Improving the Effectiveness of Keyword Search in Databases Using Query Logs
    Zhou, Jing
    Liu, Yang
    Yu, Ziqiang
    WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 193 - 206
  • [7] Improving the effectiveness of keyword search in databases using query logs
    Yu, Ziqiang
    Abraham, Ajith
    Yu, Xiaohui
    Liu, Yang
    Zhou, Jing
    Ma, Kun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 81 : 169 - 179
  • [8] Query Reformulation Using Ontology and Keyword for Durian Web Search
    Azizan, Azilawati
    Abu Bakar, Zainab
    Noah, Shahrul Azman
    2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, : 94 - 100
  • [9] Improving search engines by query clustering
    Baeza-Yates, Ricardo
    Hurtado, Carlos
    Mendoza, Marcelo
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (12): : 1793 - 1804
  • [10] Integrating keyword search into XML query processing
    Florescu, D
    Kossmann, D
    Manolescu, I
    COMPUTER NETWORKS, 2000, 33 (1-6) : 119 - 135