Multimodal concept fusion using semantic closeness for image concept disambiguation

被引:0
|
作者
Ahmad Adel Abu-Shareha
Rajeswari Mandava
Latifur Khan
Dhanesh Ramachandram
机构
[1] Universiti Sains Malaysia,School of Computer Science
[2] University of Texas at Dallas,Department of Computer Science
来源
关键词
Disambiguation; Multi-modal data; Ontology; Path length; Semantic closeness;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we show how to resolve the ambiguity of concepts that are extracted from visual stream with the help of identified concepts from associated textual stream. The disambiguation is performed at the concept-level based on semantic closeness over the domain ontology. The semantic closeness is a function of the distance between the concept to be disambiguated and selected associated concepts in the ontology. In this process, the image concepts will be disambiguated with any associated concept from the image and/or the text. The ability of the text concepts to resolve the ambiguity in the image concepts is varied. The best talent to resolve the ambiguity of an image concept occurs when the same concept(s) is stated clearly in both image and text, while, the worst case occurs when the image concept is an isolated concept that has no semantically close text concept. WordNet and the image labels with selected senses are used to construct the domain ontology used in the disambiguation process. The improved accuracy, as shown in the results, proves the ability of the proposed disambiguation process.
引用
收藏
页码:69 / 86
页数:17
相关论文
共 50 条
  • [41] Deep Incremental Hashing for Semantic Image Retrieval With Concept Drift
    Tian, Xing
    Ng, Wing W. Y.
    Xu, Huihui
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (04) : 1102 - 1115
  • [42] Effective Semantic Annotation by Image-to-Concept Distribution Model
    Su, Ja-Hwung
    Chou, Chien-Li
    Lin, Ching-Yung
    Tseng, Vincent S.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (03) : 530 - 538
  • [43] Hidden semantic concept discovery in region based image retrieval
    Zhang, RF
    Zhang, ZF
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 996 - 1001
  • [44] Semantic concept and weighted visual feature based image retrieval
    Zhu, Nana
    Zhang, Huaxiang
    Kong, Wenjie
    Zhang, Huaxiang, 1600, Binary Information Press (11): : 6461 - 6469
  • [45] Multimedia event detection with multimodal feature fusion and temporal concept localization
    Sangmin Oh
    Scott McCloskey
    Ilseo Kim
    Arash Vahdat
    Kevin J. Cannons
    Hossein Hajimirsadeghi
    Greg Mori
    A. G. Amitha Perera
    Megha Pandey
    Jason J. Corso
    Machine Vision and Applications, 2014, 25 : 49 - 69
  • [46] Multimedia event detection with multimodal feature fusion and temporal concept localization
    Oh, Sangmin
    McCloskey, Scott
    Kim, Ilseo
    Vahdat, Arash
    Cannons, Kevin J.
    Hajimirsadeghi, Hossein
    Mori, Greg
    Perera, A. G. Amitha
    Pandey, Megha
    Corso, Jason J.
    MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 49 - 69
  • [47] Remote Sensing Image Semantic Segmentation Network Based on Multimodal Fusion
    Hu, Yuxiang
    Yu, Changhong
    Gao, Ming
    Computer Engineering and Applications, 60 (15): : 234 - 242
  • [48] Context-enhanced concept disambiguation in Wikification
    Saeidi, Mozhgan
    Mahdaviani, Kaveh
    Milios, Evangelos
    Zeh, Norbert
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19
  • [49] Ornament Image Retrieval Using Multimodal Fusion
    Islam S.M.
    Joardar S.
    Dogra D.P.
    Sekh A.A.
    SN Computer Science, 2021, 2 (4)
  • [50] Probabilistic Ensemble Fusion for Multimodal Word Sense Disambiguation
    Peng, Yang
    Wang, Daisy Zhe
    Patwa, Ishan
    Gong, Dihong
    Fang, Chunsheng Victor
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 172 - 177