Semantic-Electromagnetic Inversion With Pretrained Multimodal Generative Model

被引:0
|
作者
Chen, Yanjin [1 ]
Zhang, Hongrui [1 ]
Ma, Jie [1 ]
Cui, Tie Jun [2 ,3 ]
del Hougne, Philipp [4 ]
Li, Lianlin [1 ,3 ]
机构
[1] Peking Univ, Sch Elect, State Key Lab Adv Opt Commun Syst & Networks, Beijing 100871, Peoples R China
[2] Southeast Univ, State Key Lab Millimeter Waves, Nanjing 210096, Peoples R China
[3] Pazhou Lab Huangpu, Guangzhou 510555, Peoples R China
[4] Univ Rennes, CNRS, IETR, UMR 6164, F-35000 Rennes, France
基金
中国国家自然科学基金;
关键词
inverse scattering; microwave imaging; pretrained large-capacity foundation models; semantic-electromagnetic inverse problem; RADAR TOMOGRAPHY;
D O I
10.1002/advs.202406793
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Across diverse domains of science and technology, electromagnetic (EM) inversion problems benefit from the ability to account for multimodal prior information to regularize their inherent ill-posedness. Indeed, besides priors that are formulated mathematically or learned from quantitative data, valuable prior information may be available in the form of text or images. Besides handling semantic multimodality, it is furthermore important to minimize the cost of adapting to a new physical measurement operator and to limit the requirements for costly labeled data. Here, these challenges are tackled with a frugal and multimodal semantic-EM inversion technique. The key ingredient is a multimodal generator of reconstruction results that can be pretrained, being agnostic to the physical measurement operator. The generator is fed by a multimodal foundation model encoding the multimodal semantic prior and a physical adapter encoding the measured data. For a new physical setting, only the lightweight physical adapter is retrained. The authors' architecture also enables a flexible iterative step-by-step solution to the inverse problem where each step can be semantically controlled. The feasibility and benefits of this methodology are demonstrated for three EM inverse problems: a canonical two-dimensional inverse-scattering problem in numerics, as well as three-dimensional and four-dimensional compressive microwave meta-imaging experiments. This work presents a semantic-EM inversion method capable of incorporating multimodal semantic priors in a flexible and frugal manner. It shows great advantages in handling semantic multimodality through a semantic-guided step-by-step manner and minimizing the cost of adapting to a new physical measurement operator and to limit the requirements for costly labeled training data. image
引用
收藏
页数:11
相关论文
共 50 条
  • [41] DocLLM: A Layout-Aware Generative Language Model for Multimodal Document Understanding
    Wang, Dongsheng
    Raman, Natraj
    Sibue, Mathieu
    Ma, Zhiqiang
    Babkin, Petr
    Kaur, Simerjot
    Pei, Yulong
    Nourbakhsh, Armineh
    Liu, Xiaomo
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 8529 - 8548
  • [42] Hysteresis compensation in electromagnetic actuators through Preisach model inversion
    Mittal, S
    Menq, CH
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2000, 5 (04) : 394 - 409
  • [43] Estimating ocean tide model uncertainties for electromagnetic inversion studies
    Saynisch, Jan
    Irrgang, Christopher
    Thomas, Maik
    ANNALES GEOPHYSICAE, 2018, 36 (04) : 1009 - 1014
  • [44] Modelling and inversion of electromagnetic data using an approximate plate model
    Pirttijärvi, M
    Pietilä, R
    Hattula, A
    Hjelt, SE
    GEOPHYSICAL PROSPECTING, 2002, 50 (05) : 425 - 440
  • [45] Center-enhanced video captioning model with multimodal semantic alignment
    Zhang, Benhui
    Gao, Junyu
    Yuan, Yuan
    NEURAL NETWORKS, 2024, 180
  • [46] GenSC: Generative Semantic Communication Systems Using BART-Like Model
    Chang, Min-Kuan
    Hsu, Chun-Tse
    Yang, Guu-Chang
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (10) : 2298 - 2302
  • [47] DEPAS: De-novo Pathology Semantic Masks using a Generative Model
    Larey, Ariel
    Daniel, Nati
    Aknin, Eliel
    Fisher, Yael
    Savir, Yonatan
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [48] Motion Graphs++: a Compact Generative Model for Semantic Motion Analysis and Synthesis
    Min, Jianyuan
    Chai, Jinxiang
    ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06):
  • [49] Simple linear inversion of soil electromagnetic properties from analytical model of electromagnetic induction sensor
    Vasic, Darko
    Ambrus, Davorin
    Bilas, Vedran
    2014 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2014, : 15 - 19
  • [50] Fast Electromagnetic Inversion Solver Based on Conditional Generative Adversarial Network for High-Contrast and Heterogeneous Scatterers
    Yao, He Ming
    Zhang, Huan Huan
    Jiang, Lijun
    Ng, Michael Kwok Po
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2024, 72 (04) : 3485 - 3494