Text-based approaches for non-topical image categorization

被引:8
|
作者
Sable C.L. [1 ]
Hatzivassiloglou V. [1 ]
机构
[1] Department of Computer Science, Columbia University, New York, NY 10027, 450 Computer Science Building
关键词
Evaluation in the presence of uncertainty; High-level image features; Image categorization; Probabilistic TF*IDF; Text similarity features;
D O I
10.1007/s007990000038
中图分类号
学科分类号
摘要
The rapid expansion of multimedia digital collections brings to the fore the need for classifying not only text documents but their embedded non-textual parts as well. We propose a model for basing classification of multimedia on broad, non-topical features, and show how information on targeted nearby pieces of text can be used to effectively classify photographs on a first such feature, distinguishing between indoor and outdoor images. We examine several variations to a TF*IDFbased approach for this task, empirically analyze their effects, and evaluate our system on a large collection of images from current news newsgroups. In addition, we investigate alternative classification and evaluation methods, and the effects that secondary features have on indoor/outdoor classification. Using density estimation over the raw TF*IDF values, we obtain a classification accuracy of 82%, a number that outperforms baseline estimates and earlier, image-based approaches, at least in the domain of news articles, and that nears the accuracy of humans who perform the same task with access to comparable information. © Springer-Verlag 2000.
引用
收藏
页码:261 / 275
页数:14
相关论文
共 50 条
  • [1] Text-based approaches for the categorization of images
    Sable, CL
    Hatzivassiloglou, V
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 1999, 1696 : 19 - 38
  • [2] Topical and Non-Topical Approaches to Measure Similarity between Arabic Questions
    Daoud, Mohammad
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (03)
  • [3] Integration of manual and automatic text categorization. A categorization workbench for text-based email and spam
    Sun, Q
    Schommer, C
    Lang, A
    KI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3238 : 156 - 167
  • [4] Text-Based Image Segmentation Methodology
    Mehul, Gupta
    Ankita, Patel
    Namrata, Dave
    Rahul, Goradia
    Sheth, Saurin
    2ND INTERNATIONAL CONFERENCE ON INNOVATIONS IN AUTOMATION AND MECHATRONICS ENGINEERING, ICIAME 2014, 2014, 14 : 465 - 472
  • [5] Text-based Sequential Image Generation
    Efimova, Valeria
    Filchenkov, Andrey
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [6] Image Sense Classification in Text-Based Image Retrieval
    Chang, Yih-Chen
    Chen, Hsin-Hsi
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 124 - 135
  • [7] Hierarchical approaches to Text-based Offense Classification
    Choi, Jay
    Kilmer, David
    Mueller-Smith, Michael
    Taheri, Sema A.
    SCIENCE ADVANCES, 2023, 9 (09)
  • [8] Non-Topical Classification of Healthcare Information on the Web
    Hirokawa, Sachio
    Ishita, Emi
    SMART DIGITAL FUTURES 2014, 2014, 262 : 237 - 247
  • [9] Text-based Image Style Transfer and Synthesis
    He, Yifan
    Li, Jian
    Zhu, Anna
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 8TH INTERNATIONAL WORKSHOP ON CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, VOL 4, 2019, : 43 - 48
  • [10] Image Captioning with Text-Based Visual Attention
    Chen He
    Haifeng Hu
    Neural Processing Letters, 2019, 49 : 177 - 185