Multi-objective Topic Modeling for Exploratory Search in Tech News

被引:5
|
作者
Ianina, Anastasia [1 ]
Golitsyn, Lev [2 ]
Vorontsov, Konstantin [1 ]
机构
[1] Moscow Inst Phys & Technol, Moscow, Russia
[2] Integrated Syst, Moscow, Russia
关键词
Information retrieval; Exploratory search; Relevance feedback; Topic modeling; Additive regularization for topic modeling; ARTM; BigARTM; ADDITIVE REGULARIZATION;
D O I
10.1007/978-3-319-71746-3_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploratory search is a paradigm of information retrieval, in which the user's intention is to learn the subject domain better. To do this the user repeats "query-browse-refine" interactions with the search engine many times. We consider typical exploratory search tasks formulated by long text queries. People usually solve such a task in about half an hour and find dozens of documents using conventional search facilities iteratively. The goal of this paper is to reduce the time-consuming multi-step process to one step without impairing the quality of the search. Probabilistic topic modeling is a suitable text mining technique to retrieve documents, which are semantically relevant to a long text query. We use the additive regularization of topic models (ARTM) to build a model that meets multiple objectives. The model should have sparse, diverse and interpretable topics. Also, it should incorporate meta-data and multimodal data such as n-grams, authors, tags and categories. Balancing the regularization criteria is an important issue for ARTM. We tackle this problem with coordinate-wise optimization technique, which chooses the regularization trajectory automatically. We use the parallel online implementation of ARTM from the open source library BigARTM. Our evaluation technique is based on crowdsourcing and includes two tasks for assessors: the manual exploratory search and the explicit relevance feedback. Experiments on two popular tech news media show that our topic-based exploratory search outperforms assessors as well as simple baselines, achieving precision and recall of about 85-92%.
引用
收藏
页码:181 / 193
页数:13
相关论文
共 50 条
  • [31] Hybrid multi-objective cuckoo search with dynamical local search
    Maoqing Zhang
    Hui Wang
    Zhihua Cui
    Jinjun Chen
    Memetic Computing, 2018, 10 : 199 - 208
  • [32] Multi-Objective Stochastic Fractal Search: a powerful algorithm for solving complex multi-objective optimization problems
    Soheyl Khalilpourazari
    Bahman Naderi
    Saman Khalilpourazary
    Soft Computing, 2020, 24 : 3037 - 3066
  • [33] Multi-objective interior search algorithm for optimization: A new multi-objective meta-heuristic algorithm
    Torabi, Navid
    Tavakkoli-Moghaddam, Reza
    Najafi, Esmaiel
    Lotfi, Farhad Hosseinzadeh
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (03) : 3307 - 3319
  • [34] Multi-Objective Stochastic Fractal Search: a powerful algorithm for solving complex multi-objective optimization problems
    Khalilpourazari, Soheyl
    Naderi, Bahman
    Khalilpourazary, Saman
    SOFT COMPUTING, 2020, 24 (04) : 3037 - 3066
  • [35] Multi-objective sparrow search algorithm: A novel algorithm for solving complex multi-objective optimisation problems
    Li, Bin
    Wang, Honglei
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
  • [36] Multi-Objective Neural Architecture Search by Learning Search Space Partitions
    Zhao, Yiyang
    Wang, Linnan
    Guo, Tian
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [37] An exploratory study of mono and multi-objective metaheuristics to ensemble of classifiers
    Feitosa Neto, Antonino A.
    Canuto, Anne M. P.
    APPLIED INTELLIGENCE, 2018, 48 (02) : 416 - 431
  • [38] Multi-Objective Neighborhood Search Algorithm Based on Decomposition for Multi-Objective Minimum Weighted Vertex Cover Problem
    Hu, Shuli
    Wu, Xiaoli
    Liu, Huan
    Wang, Yiyuan
    Li, Ruizhi
    Yin, Minghao
    SUSTAINABILITY, 2019, 11 (13)
  • [39] An exploratory study of mono and multi-objective metaheuristics to ensemble of classifiers
    Antonino A. Feitosa Neto
    Anne M. P. Canuto
    Applied Intelligence, 2018, 48 : 416 - 431
  • [40] Multi-Objective Ergodic Search for Dynamic Information Maps
    Rao, Ananya
    Breitfeld, Abigail
    Candela, Alberto
    Jensen, Benjamin
    Wettergreen, David
    Choset, Howie
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10197 - 10204