Detecting the modality of a medical image using visual and textual features

被引:1
|
作者
Miranda, Diana [1 ]
Thenkanidiyoor, Veena [1 ]
Dinesh, Dileep Aroor [2 ]
机构
[1] Natl Inst Technol Goa, Goa, India
[2] Indian Inst Technol Mandi, Mandi, India
关键词
Modality classification; Biomedical word2vec models; Convolutional neural networks; CLASSIFICATION; SCALE; RETRIEVAL; KERNEL; FUSION; COLOR;
D O I
10.1016/j.bspc.2022.104035
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Knowing the modality of a medical image is crucial in understanding the characteristics of the image. Therefore, it is important to classify medical images as per their modality. The image and its accompanying text caption contain information that could help in identifying the modality of a given medical image. This work proposes an approach for medical image modality classification using visual and textual features. The proposed approach uses convolutional neural networks to extract visual features from a medical image. Word embeddings obtained from biomedical word2vec models are used to generate textual features from the image captions. Support vector machine based classifiers are then used to classify medical images using these features. We propose to use the late fusion approach to combine visual and textual features. The proposed approach performs better than the state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] MIRACLE at ImageCLEFmed 2007: Merging Textual and Visual Strategies to Improve Medical Image Retrieval
    Villena-Roman, Julio
    Lana-Serrano, Sara
    Carlos Gonzalez-Cristobal, Jose
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 593 - +
  • [42] Clustering presentation of web image retrieval results using textual information and image features
    Uematsu, Y
    Kataoka, R
    Takeno, H
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON INTERNET AND MULTIMEDIA SYSTEMS AND APPLICATIONS, 2006, : 217 - +
  • [43] Late fusion of deep learning and handcrafted visual features for biomedical image modality classification
    Lee, Sheng Long
    Zare, Mohammad Reza
    Muller, Henning
    IET IMAGE PROCESSING, 2019, 13 (02) : 382 - 391
  • [44] Genetic Approach Based Image Retrieval by Using CCM and Textual Features
    Shrivas, Parul
    Lilhore, Umesh
    Agrawal, Nitin
    2017 6TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2017, : 364 - 369
  • [45] Modality matters: Testing bilingual irony comprehension in the textual, auditory, and audio-visual modality
    Bromberek-Dyzman, Katarzyna
    Jankowiak, Katarzyna
    Chelminiak, Pawel
    JOURNAL OF PRAGMATICS, 2021, 180 : 219 - 231
  • [46] Fusion of Textural and Visual Information for Medical Image Modality Retrieval Using Deep Learning-Based Feature Engineering
    Iqbal, Saeed
    Qureshi, Adnan N.
    Alhussein, Musaed
    Choudhry, Imran Arshad
    Aurangzeb, Khursheed
    Khan, Tariq M.
    IEEE ACCESS, 2023, 11 : 93238 - 93253
  • [47] Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
    Mafla, Andres
    Dey, Sounak
    Biten, Ali Furkan
    Gomez, Lluis
    Karatzas, Dimosthenis
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2939 - 2948
  • [48] Predicting Sentiment toward Transportation in Social Media using Visual and Textual Features
    Giancristofaro, Gabriel T.
    Panangadan, Anand
    2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2016, : 2113 - 2118
  • [49] Twitter Photo Geo-Localization Using Both Textual and Visual Features
    Matsuo, Shin
    Shimoda, Wataru
    Yanai, Keiji
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 22 - 25
  • [50] Integrating visual and textual cues for image classification
    Gevers, T
    Aldershoff, F
    Geusebroek, JM
    ADVANCES IN VISUAL INFORMATION SYSTEMS, PROCEEDINGS, 2000, 1929 : 419 - 429