Agent-Driven Generative Semantic Communication With Cross-Modality and Prediction

被引:0
|
作者
Yang, Wanting [1 ]
Xiong, Zehui [1 ]
Yuan, Yanli [2 ]
Jiang, Wenchao [1 ]
Quek, Tony Q. S. [1 ]
Debbah, Merouane [3 ,4 ]
机构
[1] Singapore Univ Technol & Design, Pillar Informat Syst Technol & Design, Singapore 487372, Singapore
[2] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
[3] Khalifa Univ Sci & Technol, KU 6G Res Ctr, Abu Dhabi, U Arab Emirates
[4] Univ Paris Saclay, CentraleSupelec, F-91192 Gif Sur Yvette, France
基金
新加坡国家研究基金会;
关键词
Semantics; Decoding; Surveillance; 6G mobile communication; Wireless communication; Semantic communication; Real-time systems; Layout; Training; Symbols; video streaming; diffusion model; deep reinforcement learning; semantic sampling; DEEP; SYSTEMS;
D O I
10.1109/TWC.2024.3519325
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the era of 6G, with compelling visions of intelligent transportation systems and digital twins, remote surveillance is poised to become a ubiquitous practice. Substantial data volume and frequent updates present challenges in wireless networks. To address these challenges, we propose a novel agent-driven generative semantic communication (A-GSC) framework based on reinforcement learning. In contrast to the existing research on semantic communication (SemCom), which mainly focuses on either semantic extraction or semantic sampling, we seamlessly integrate both by jointly considering the intrinsic attributes of source information and the contextual information regarding the task. Notably, the introduction of generative artificial intelligence (GAI) enables the independent design of semantic encoders and decoders. In this work, we develop an agent-assisted semantic encoder with cross-modality capability, which can track the semantic changes, channel condition, to perform adaptive semantic extraction and sampling. Accordingly, we design a semantic decoder with both predictive and generative capabilities, consisting of two tailored modules. Moreover, the effectiveness of the designed models has been verified using the UA-DETRAC dataset, demonstrating the performance gains of the overall A-GSC framework in both energy saving and reconstruction accuracy.
引用
收藏
页码:2233 / 2248
页数:16
相关论文
共 50 条
  • [1] SENTENCE AND PICTURE MEMORY - CROSS-MODALITY SEMANTIC INTEGRATION
    PEZDEK, K
    MARSH, G
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1975, 6 (NB4) : 435 - 435
  • [2] Semantic consistency generative adversarial network for cross-modality domain adaptation in ultrasound thyroid nodule classification
    Zhao, Jun
    Zhou, Xiaosong
    Shi, Guohua
    Xiao, Ning
    Song, Kai
    Zhao, Juanjuan
    Hao, Rui
    Li, Keqin
    APPLIED INTELLIGENCE, 2022, 52 (09) : 10369 - 10383
  • [3] CROSS-MODALITY SEMANTIC INTEGRATION OF SENTENCE AND PICTURE MEMORY
    PEZDEK, K
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN LEARNING AND MEMORY, 1977, 3 (05): : 515 - 524
  • [4] Semantic consistency generative adversarial network for cross-modality domain adaptation in ultrasound thyroid nodule classification
    Jun Zhao
    Xiaosong Zhou
    Guohua Shi
    Ning Xiao
    Kai Song
    Juanjuan Zhao
    Rui Hao
    Keqin Li
    Applied Intelligence, 2022, 52 : 10369 - 10383
  • [5] Review of Cross-Modality Medical Image Prediction
    Zhou P.
    Chen H.-J.
    Yu Z.-K.
    Peng Y.-H.
    Li Y.-F.
    Yang F.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 220 - 226
  • [6] CROSS-MODALITY DISTILLATION: A CASE FOR CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS
    Roheda, Siddharth
    Riggan, Benjamin S.
    Krim, Hamid
    Dai, Liyi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2926 - 2930
  • [7] Bridging the Cross-Modality Semantic Gap in Visual Question Answering
    Wang, Boyue
    Ma, Yujian
    Li, Xiaoyan
    Gao, Junbin
    Hu, Yongli
    Yin, Baocai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (03) : 4519 - 4531
  • [8] Semantic Change Driven Generative Semantic Communication Framework
    Yang, Wanting
    Xiong, Zehui
    Du, Hongyang
    Yuan, Yanli
    Quek, Tony Q. S.
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [9] Bridging the Cross-Modality Semantic Gap in Visual Question Answering
    Wang, Boyue
    Ma, Yujian
    Li, Xiaoyan
    Gao, Junbin
    Hu, Yongli
    Yin, Baocai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 13
  • [10] CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation
    Xia, Ruihao
    Zhao, Chaoqiang
    Zheng, Meng
    Wu, Ziyan
    Sun, Qiyu
    Tang, Yang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21515 - 21524