Multimodal feature assessment using multibranch 3D CNN to BI-LSTM for feature level multi-polarization SAR image data fusion and vehicle identification
被引:1
|
作者:
Arnous, Ferris, I
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Dept Engr Sci & Mech, University Pk, PA 16802 USAPenn State Univ, Dept Engr Sci & Mech, University Pk, PA 16802 USA
Arnous, Ferris, I
[1
]
Narayanan, Ram M.
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Dept Elect Engr, University Pk, PA 16802 USAPenn State Univ, Dept Engr Sci & Mech, University Pk, PA 16802 USA
Narayanan, Ram M.
[2
]
机构:
[1] Penn State Univ, Dept Engr Sci & Mech, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Elect Engr, University Pk, PA 16802 USA
Image data fusion;
SAR target recognition;
multimodal image fusion;
machine learning;
vehicle identification;
feature fusion;
D O I:
10.1117/12.2663374
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Deep machine learning computer vision algorithms have been widely explored for the purpose of multisensory data fusion. The ability to combine feature, pixel, and decision level information from multiple sensors in order to enhance accurate assessments and decisions made by the platforms has been a significant point of interest for the remote sensing community. In this paper, we propose a dual branch 3D convolutional neural network (CNN) to bi-long short-term memory network (BILSTM) algorithm that seeks to fuse sparse multiresolution, multi-pose and multimodal VV and HV polarizations of synthetic aperture radar (SAR) vehicle image information to enhance vehicle identification in unfamiliar and uncoherent environments. We cultivated and explored the proposed algorithm using the SDMS CV Data Domes repository of 14,430 augmented images per modality, equally represented over ten vehicle classes under similar and dissimilar vehicle pose augmentations with low to high levels of added testing set noise via zero-mean white Gaussian noise. Our results indicated that the local individual modality 3D convolution fusion of multiple poses and resolutions as well as dual-modality fusion of both polarizations enhanced the developed algorithm's ability to classify SAR vehicle image information in unfamiliar pose, elevation angle and moderate to low noise environments.
机构:
Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Chinese Acad Sci, Key Lab Optoelect Informat Proc, Shenyang 110016, Peoples R China
Key Lab Image Understanding & Comp Vis, Shenyang 110016, Liaoning, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Wang, Ende
Xue, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Shenyang Ligong Univ, Sch Informat Sci & Engn, Shenyang 110159, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Xue, Lei
Li, Yong
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Li, Yong
Zhang, Zhenxin
论文数: 0引用数: 0
h-index: 0
机构:
Capital Normal Univ, Key Lab 3D Informat Acquisit & Applicat, MOE, Beijing 100048, Peoples R China
Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 100048, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Zhang, Zhenxin
Hou, Xukui
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Chinese Acad Sci, Key Lab Optoelect Informat Proc, Shenyang 110016, Peoples R China
Key Lab Image Understanding & Comp Vis, Shenyang 110016, Liaoning, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China