Reducing the user labeling effort in effective high recall tasks by fine-tuning active learning

被引：3

作者：

Dal Bianco, Guilherme ^{[1
]}

Duarte, Denio ^{[1
]}

Goncalves, Marcos Andre ^{[2
]}

机构：

[1] Univ Fed Fronteira Sul, Campus Chapeco, Chapeco, Brazil

[2] Univ Fed Minas Gerais, Dept Ciencia Comp, Belo Horizonte, Brazil

来源：

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS | 2023年 / 61卷 / 02期

关键词：

Information retrieval; Hire; Active learning; SSAR; Labeling process; Supervised classifier; SELECTION;

D O I：

10.1007/s10844-022-00772-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High recall Information REtrieval (HIRE) aims at identifying only and (almost) all relevant documents for a given query. HIRE is paramount in applications such as systematic literature review, medicine, legal jurisprudence, among others. To address the HIRE goals, active learning methods have proven valuable in determining informative and non-redundant documents to reduce user effort for manual labeling. We propose a new active learning framework for the HIRE task. REVEAL-HIRE selects a very reduced set of documents to be labeled, significantly mitigating the user's effort. The proposed approach selects the most representative documents by exploiting a novel, specifically designed active learning strategy for HIRE, called REVEAL (RelEVant rulE-based Active Learning). REVEAL aims at selecting the maximum number of relevant documents for a given query based on discriminative rule-based patterns and a penalization factor. The method is applied to the top-ranked documents to choose the most informative ones to be labeled, a hard task due to data skewness - most documents are irrelevant for a given query. The enhanced active learning process is repeated incrementally until a stopping point is achieved, using REVEAL to identify the point in the process when relevant documents should stop to be sampled. Experimental results in several standard benchmark datasets (e.g. 20-Newsgroups, Trec Total Recall, and CLEF eHealth) demonstrate that REVEAL-HIRE can reduce the user labeling effort up to 3 times (320% of reduction) in comparison with state-of-the-art baselines while keeping the effectiveness at the highest levels.

引用

页码：453 / 472

页数：20

共 26 条

[21] Fine-Tuning Active Layer Morphology via Modification of Both Side Chains and Terminal Groups toward High-Performance Organic Solar Cells
Huang, Jinfeng
Gao, Cai-Yan
Fan, Xin-Heng
Zhu, Xiaozhang
Yang, Lian-Ming
ENERGY TECHNOLOGY, 2022, 10 (02)
[22] Fine-Tuning Alkyl Chains on Quinoxaline Nonfullerene Acceptors Enables High-Efficiency Ternary Organic Solar Cells with Optimizing Molecular Stacking and Reducing Energy Loss
Guo, Yuntong
Chen, Zhenyu
Ge, Jinfeng
Zhu, Jintao
Zhang, Jinna
Meng, Yuanyuan
Ye, Qinrui
Wang, Shijie
Chen, Fei
Ma, Wei
Ge, Ziyi
ADVANCED FUNCTIONAL MATERIALS, 2023, 33 (47)
[23] Fine-tuning convolutional neural network with transfer learning for semantic segmentation of ground-level oilseed rape images in a field with high weed pressure
Abdalla, Alwaseela
Cen, Haiyan
Wan, Liang
Rashid, Reem
Weng, Haiyong
Zhou, Weijun
He, Yong
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2019, 167
[24] Fine-Tuning Graph Neural Networks via Active Learning: Unlocking the Potential of Graph Neural Networks Trained on Nonaqueous Systems for Aqueous CO2 Reduction
Jiao, Zihao
Mao, Yu
Lu, Ruihu
Liu, Ya
Guo, Liejin
Wang, Ziyun
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2025, 21 (06) : 3176 - 3186
[25] Electron-rich active sites created by fine-tuning the electronic structure of Co(II) porphyrin frameworks for high-performance CO2 electroreduction
He, Qizhe
Huang, Shengsheng
Li, Hongwei
Li, Ting -Ting
INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 51 : 1347 - 1356
[26] Domain Adaptation and Fine-Tuning of a Deep Learning Segmentation Model of Small Agricultural Burn Area Detection Using High-Resolution Sentinel-2 Observations: A Case Study of Punjab, India
Anand, Anamika
Imasu, Ryoichi
Dhaka, Surendra K.
Patra, Prabir K.
REMOTE SENSING, 2025, 17 (06)

← 1 2 3 →