Improving automatic query classification via semi-supervised learning

被引:31
|
作者
Beitzel, SM
Jensen, EC
Frieder, O
Lewis, DD
Chowdhury, A
Kolcz, A
机构
关键词
D O I
10.1109/ICDM.2005.80
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate topical classification of user queries allows for increased effectiveness and efficiency in general-purpose web search systems. Such classification becomes critical if the system is to return results not just from a general web collection but from topic-specific back-end databases as well. Maintaining sufficient classification recall is very difficult as web queries are typically short, yielding few features per query. This feature sparseness coupled with the high query volumes typical for a large-scale search service makes manual and supervised learning approaches alone insufficient. We use an application of computational linguistics to develop an approach for mining the vast amount of unlabeled data in web query logs to improve automatic topical web query classification. We show that our approach in combination with manual matching and supervised learning allows its to classify a substantially larger proportion of queries than any single technique. We examine the performance of each approach on a real web query stream and show that our combined method accurately classifies 46% of queries, out performing the recall of best single approach by nearly 20% with a 7% improvement in overall effectiveness.
引用
收藏
页码:42 / 49
页数:8
相关论文
共 50 条
  • [21] Semi-Supervised Classification via Hypergraph Convolutional Extreme Learning Machine
    Liu, Zhewei
    Zhang, Zijia
    Cai, Yaoming
    Miao, Yilin
    Chen, Zhikun
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [22] DisenSemi: Semi-Supervised Graph Classification via Disentangled Representation Learning
    Wang, Yifan
    Luo, Xiao
    Chen, Chong
    Hua, Xian-Sheng
    Zhang, Ming
    Ju, Wei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [23] Scalable Semi-Supervised Query Classification Using Matrix Sketching
    Kim, Young-Bum
    Stratos, Karl
    Sarikaya, Ruhi
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 8 - 13
  • [24] A semi-supervised deep-learning approach for automatic crystal structure classification
    Lolla, Satvik
    Liang, Haotong
    Kusne, A. Gilad
    Takeuchi, Ichiro
    Ratcliff, William
    JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2022, 55 : 882 - 889
  • [25] Improving Landmark Localization with Semi-Supervised Learning
    Honari, Sina
    Molchanov, Pavlo
    Tyree, Stephen
    Vincent, Pascal
    Pal, Christopher
    Kautz, Jan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1546 - 1555
  • [26] Improving Multi-class Classification for Endomicroscopic Images by Semi-supervised Learning
    Wu, Hang
    Tong, Li
    Wang, May D.
    2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 5 - 8
  • [27] Image Classification via Semi-Supervised pLSA
    Zhuang, Liansheng
    She, Lanbo
    Jiang, Yuning
    Tang, Ketan
    Yu, Nenghai
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS (ICIG 2009), 2009, : 205 - 208
  • [28] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [29] A Semi-Supervised Learning Algorithm for Data Classification
    Kuo, Cheng-Chien
    Shieh, Horng-Lin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (05)
  • [30] Semi-supervised tensor learning for image classification
    Zhang, Jianguang
    Han, Yahong
    Jiang, Jianmin
    MULTIMEDIA SYSTEMS, 2017, 23 (01) : 63 - 73