Prompt Learning for Light Field Semantic Segmentation in the Consumer-Centric Internet of Intelligent Computing Things

被引:0
|
作者
Jia, Chen [1 ,2 ]
Shi, Fan [1 ,2 ]
Liu, Xiufeng [3 ]
Cheng, Xu [1 ,2 ]
Zhang, Zixuan [1 ,2 ]
Zhao, Meng [1 ,2 ]
Chen, Shengyong [1 ,2 ]
机构
[1] Tianjin Univ Technol, Engn Res Ctr Learning Based Intelligent Syst, Minist Educ, Key Lab Comp Vis & Syst, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
[3] Tech Univ Denmark, Dept Technol Management & Econ, DK-2800 Lyngby, Denmark
基金
中国国家自然科学基金;
关键词
Transformers; Semantic segmentation; Light fields; Feature extraction; Consumer electronics; Data models; Computational modeling; Light field imaging; semantic segmentation; CIoT; intelligent computing; explicit angle prompting;
D O I
10.1109/TCE.2024.3436010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Light field semantic segmentation accurately identifies the semantic information of the scene, providing solutions for various intelligent computing tasks in consumer electronics and CIoT, such as portrait segmentation, image editing and environmental perception. However, the high dimensionality, redundancy and computational cost of light field 4D data limit its application. To address this challenge, we decode highly interleaved data into multi-scale macro-pixel images and propose a prompt-based light field semantic segmentation network. This network incorporates an efficient transformer architecture to capture and learn global and long-range dependencies. Unlike the previous implicit embedding method, we introduce a visual prompt component called Explicit Angle Prompting (EAP) in the model. The key insight is to adaptively generate crucial angle-based visual prompts explicitly during the training stage, enhancing the understanding of geometric information such as object shape and structure. Furthermore, the self-attention design in the encoder and the multi-scale feature fusion mechanism ensure that the network comprehends the global and contextual information of the image from the perspective of the light field spatial dimensions. Extensive experiments showcase the potential of light field prompt learning for semantic segmentation, demonstrating the model's capability to efficiently and accurately segment objects. The code will be released.
引用
收藏
页码:5493 / 5505
页数:13
相关论文
共 25 条
  • [21] Project-based learning model based on intelligent computing of the internet of things: characteristics, hidden worries, and beyond
    Hu, Liangliang
    NEURAL COMPUTING & APPLICATIONS, 2023, 37 (12): : 7897 - 7908
  • [22] A Semantic Segmentation Method for Three- Dimensional Image of Internet Data Center Machine Room Constructed by Intelligent Power Internet of Things Based on Fabric Architecture
    Li, Qiang
    Qin, Long
    Lu, Weizhi
    Zhang, Yaqiang
    Wang, Xu
    JOURNAL OF NANOELECTRONICS AND OPTOELECTRONICS, 2022, 17 (11) : 1426 - 1434
  • [23] Graph Structure Learning-Based Multivariate Time Series Anomaly Detection in Internet of Things for Human-Centric Consumer Applications
    He, Shiming
    Li, Genxin
    Yi, Tongzhijian
    Alfarraj, Osama
    Tolba, Amr
    Sangaiah, Arun Kumar
    Sherratt, R. Simon
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 5419 - 5431
  • [24] Toward intelligent resource management in dynamic Fog Computing-based Internet of Things environment with Deep Reinforcement Learning: A survey
    Gupta, Shally
    Singh, Nanhay
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 36 (04)
  • [25] Artificial-Intelligence-Based Low-Light Marine Image Enhancement for Semantic Segmentation in Edge-Intelligence-Empowered Internet of Things Environment
    Im, Su Jin
    Yun, Chaeyeong
    Lee, Sung Jae
    Park, Kang Ryoung
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4086 - 4114