Text-based approaches for non-topical image categorization

被引:8
|
作者
Sable C.L. [1 ]
Hatzivassiloglou V. [1 ]
机构
[1] Department of Computer Science, Columbia University, New York, NY 10027, 450 Computer Science Building
关键词
Evaluation in the presence of uncertainty; High-level image features; Image categorization; Probabilistic TF*IDF; Text similarity features;
D O I
10.1007/s007990000038
中图分类号
学科分类号
摘要
The rapid expansion of multimedia digital collections brings to the fore the need for classifying not only text documents but their embedded non-textual parts as well. We propose a model for basing classification of multimedia on broad, non-topical features, and show how information on targeted nearby pieces of text can be used to effectively classify photographs on a first such feature, distinguishing between indoor and outdoor images. We examine several variations to a TF*IDFbased approach for this task, empirically analyze their effects, and evaluate our system on a large collection of images from current news newsgroups. In addition, we investigate alternative classification and evaluation methods, and the effects that secondary features have on indoor/outdoor classification. Using density estimation over the raw TF*IDF values, we obtain a classification accuracy of 82%, a number that outperforms baseline estimates and earlier, image-based approaches, at least in the domain of news articles, and that nears the accuracy of humans who perform the same task with access to comparable information. © Springer-Verlag 2000.
引用
收藏
页码:261 / 275
页数:14
相关论文
共 50 条
  • [31] Old fashion text-based image retrieval using FCA
    Ahmad, I
    Jang, TS
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 33 - 36
  • [32] Text-Based and Content-Based Image Retrieval on Flickr: DEMO
    Manuel Barrios, Juan
    Diaz-Espinoza, Diego
    Bustos, Benjamin
    SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 156 - 157
  • [33] Spam detection proposal in regular and text-based image emails
    Issac, Biju
    Raman, Valliappan
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 1624 - +
  • [34] Exploiting Visual Concepts to Improve Text-Based Image Retrieval
    Tollari, Sabrina
    Detyniecki, Marcin
    Marsala, Christophe
    Fakeri-Tabrizi, Ali
    Amini, Massih-Reza
    Gallinari, Patrick
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 701 - 705
  • [35] Imagic: Text-Based Real Image Editing with Diffusion Models
    Kawar, Bahjat
    Zada, Shiran
    Lang, Oran
    Tov, Omer
    Chang, Huiwen
    Dekel, Tali
    Mosseri, Inbar
    Irani, Michal
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6007 - 6017
  • [36] Document Expansion for Text-Based Image Retrieval at CLEF 2009
    Min, Jinming
    Wilkins, Peter
    Leveling, Johannes
    Jones, Gareth J. F.
    MULTILINGUAL INFORMATION ACCESS EVALUATION II: MULTIMEDIA EXPERIMENTS, PT II, 2010, 6242 : 172 - 176
  • [37] Detected text-based image retrieval approach for textual images
    Unar, Salahuddin
    Wang, Xingyuan
    Zhang, Chuan
    Wang, Chunpeng
    IET IMAGE PROCESSING, 2019, 13 (03) : 515 - 521
  • [38] Text-based informatics
    Valdes-Perez, RE
    SCIENTIST, 1998, 12 (14): : 10 - 10
  • [39] Machine learning based approaches to text categorization with resampling methods
    Jo, T
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 93 - 98
  • [40] Comparing text-based and dependence-based approaches for determining the origins of bugs
    Davies, Steven
    Roper, Marc
    Wood, Murray
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2014, 26 (01) : 107 - 139