Multi-modal remote perception learning for object sensory data

被引:0
|
作者
Almujally, Nouf Abdullah [1 ]
Rafique, Adnan Ahmed [2 ]
Al Mudawi, Naif [3 ]
Alazeb, Abdulwahab [3 ]
Alonazi, Mohammed [4 ]
Algarni, Asaad [5 ]
Jalal, Ahmad [6 ,7 ]
Liu, Hui [8 ]
机构
[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh, Saudi Arabia
[2] Univ Poonch Rawalakot, Dept Comp Sci & IT, Rawalakot, Pakistan
[3] Najran Univ, Coll Comp Sci & Informat Syst, Dept Comp Sci, Najran, Saudi Arabia
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Engn, Al Kharj, Saudi Arabia
[5] Northern Border Univ, Fac Comp & Informat Technol, Dept Comp Sci, Rafha, Saudi Arabia
[6] Air Univ, Fac Comp Sci, Islamabad, Pakistan
[7] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul, South Korea
[8] Univ Bremen, Cognit Syst Lab, Bremen, Germany
来源
关键词
multi-modal; sensory data; objects recognition; visionary sensor; simulation environment multi-modal; simulation environment; RECOGNITION;
D O I
10.3389/fnbot.2024.1427786
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Introduction When it comes to interpreting visual input, intelligent systems make use of contextual scene learning, which significantly improves both resilience and context awareness. The management of enormous amounts of data is a driving force behind the growing interest in computational frameworks, particularly in the context of autonomous cars.Method The purpose of this study is to introduce a novel approach known as Deep Fused Networks (DFN), which improves contextual scene comprehension by merging multi-object detection and semantic analysis.Results To enhance accuracy and comprehension in complex situations, DFN makes use of a combination of deep learning and fusion techniques. With a minimum gain of 6.4% in accuracy for the SUN-RGB-D dataset and 3.6% for the NYU-Dv2 dataset.Discussion Findings demonstrate considerable enhancements in object detection and semantic analysis when compared to the methodologies that are currently being utilized.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Lightweight Multi-modal Representation Learning for RGB Salient Object Detection
    Xiao, Yun
    Huang, Yameng
    Li, Chenglong
    Liu, Lei
    Zhou, Aiwu
    Tang, Jin
    COGNITIVE COMPUTATION, 2023, 15 (06) : 1868 - 1883
  • [22] Lightweight Multi-modal Representation Learning for RGB Salient Object Detection
    Yun Xiao
    Yameng Huang
    Chenglong Li
    Lei Liu
    Aiwu Zhou
    Jin Tang
    Cognitive Computation, 2023, 15 : 1868 - 1883
  • [23] Multi-Modal Object Tracking and Image Fusion With Unsupervised Deep Learning
    LaHaye, Nicholas
    Ott, Jordan
    Garay, Michael J.
    El-Askary, Hesham Mohamed
    Linstead, Erik
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) : 3056 - 3066
  • [24] Learning Adaptive Fusion Bank for Multi-Modal Salient Object Detection
    Wang, Kunpeng
    Tu, Zhengzheng
    Li, Chenglong
    Zhang, Cheng
    Luo, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7344 - 7358
  • [25] Adaptive Automatic Object Recognition in Single and Multi-Modal Sensor Data
    Khuon, Timothy
    Rand, Robert
    Truslow, Eric
    2014 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2014,
  • [26] Multi-modal Data Fusion for People Perception in the Social Robot Haru
    Ragel, Ricardo
    Rey, Rafael
    Paez, Lvaro
    Ponce, Javier
    Nakamura, Keisuke
    Caballero, Fernando
    Merino, Luis
    Gomez, Randy
    SOCIAL ROBOTICS, ICSR 2022, PT I, 2022, 13817 : 174 - 187
  • [27] Unsupervised Multi-modal Learning
    Iqbal, Mohammed Shameer
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 343 - 346
  • [28] Learning Multi-modal Similarity
    McFee, Brian
    Lanckriet, Gert
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 491 - 523
  • [29] A Massive Multi-Modal Perception Data Classification Method Using Deep Learning Based on Internet of Things
    Linli Jiang
    Chunmei Wu
    International Journal of Wireless Information Networks, 2020, 27 : 226 - 233
  • [30] A Massive Multi-Modal Perception Data Classification Method Using Deep Learning Based on Internet of Things
    Jiang, Linli
    Wu, Chunmei
    INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2020, 27 (02) : 226 - 233