A New Method of 3D Scene Recognition from Still Images

被引:0
|
作者
Zheng Li-ming [1 ]
Wang Xing-song [1 ]
机构
[1] Southeast Univ, Sch Mech Engn, Nanjing 210096, Jiangsu, Peoples R China
关键词
Unsupervised learning; monocular visual; 3D scene recognition; superpixels; spectral clustering; CAMERA CALIBRATION; SPECTRAL METHODS; GROUND SURFACE; DISTANCE; KERNEL; RECONSTRUCTION; REPRESENTATION; SHIFT;
D O I
10.1117/12.2064179
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most methods of monocular visual three dimensional (3D) scene recognition involve supervised machine learning. However, these methods often rely on prior knowledge. Specifically, they learn the image scene as part of a training dataset. For this reason, when the sampling equipment or scene is changed, monocular visual 3D scene recognition may fail. To cope with this problem, a new method of unsupervised learning for monocular visual 3D scene recognition is here proposed. First, the image is made using superpixel segmentation based on the CIELAB color space values L, a, and b and on the coordinate values x and y of pixels, forming a superpixel image with a specific density. Second, a spectral clustering algorithm based on the superpixels' color characteristics and neighboring relationships was used to reduce the dimensions of the superpixel image. Third, the fuzzy distribution density functions representing sky, ground, and facade are multiplied with the segment pixels, where the expectations of these segments are obtained. A preliminary classification of sky, ground, and facade is generated in this way. Fourth, the most accurate classification images of sky, ground, and facade were extracted through the tier-1 wavelet sampling and Manhattan direction feature. Finally, a depth perception map is generated based on the pinhole imaging model and the linear perspective information of ground surface. Here, 400 images of Make3D Image data from the Cornell University website were used to test the algorithm. The experimental results showed that this unsupervised learning method provides a more effective monocular visual 3D scene recognition model than other methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A New Shadow Detection and Depth Removal Method for 3D Text Recognition in Scene Images
    Zhong, Wencan
    Raj, Alex Noel Joseph
    Shivakumara, Palaiahnakote
    Zhuang, Zhemin
    Lu, Tong
    Pal, Umapada
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 277 - 281
  • [2] 3D scene retrieval and recognition with Depth Gradient Images
    Adan, Antonio
    Merchan, Pilar
    Salamanca, Santiago
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (09) : 1337 - 1353
  • [3] 3D SCENE RECONSTRUCTION FROM RGB IMAGES
    Rotaru, Razvan-Paul
    Gradinaru, Alexandru
    Moldoveanu, Florica
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2024, 86 (02): : 101 - 112
  • [4] 3D SCENE RECONSTRUCTION FROM RGB IMAGES
    Rotaru, Răzvan-Paul
    Grădinaru, Alexandru
    Moldoveanu, Florica
    [J]. UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2024, 86 (02): : 101 - 112
  • [5] Pose Invariant Method for Emotion Recognition from 3D Images
    Suja, P.
    Krishnasri, D.
    Tripathi, Shikha
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [6] Reconstructing 3D garment from Still Images
    Zhong, Yueqi
    [J]. 2009 IEEE 10TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED INDUSTRIAL DESIGN & CONCEPTUAL DESIGN, VOLS 1-3: E-BUSINESS, CREATIVE DESIGN, MANUFACTURING - CAID&CD'2009, 2009, : 1900 - 1905
  • [7] Indoor Scene Recognition in 3D
    Huang, Shengyu
    Usvyatsov, Mikhail
    Schindler, Konrad
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8041 - 8048
  • [8] A method of face recognition using 3D images
    Yuan, X
    Lu, JM
    Yahagi, T
    [J]. 2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2004, : 221 - 224
  • [9] 3D scene reconstruction from cylindrical panoramic images
    Bunschoten, R
    Kröse, B
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2002, 41 (2-3) : 111 - 118
  • [10] An Interactive Registration Method for Images to the 3D Urban Scene Model
    Shen, Xiaorong
    Hong, Peng
    Xiu, Quanfa
    Zhang, Tianlong
    [J]. PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 176 - 179