Photo-realistic 3D model based accurate visual positioning system for large-scale indoor spaces

被引:2
|
作者
Hyeon, Janghun [1 ]
Jang, Bumchul [2 ,3 ]
Choi, Hyunga [3 ]
Kim, Joohyung [2 ]
Kim, Dongwoo [4 ]
Doh, Nakju [3 ,5 ]
机构
[1] Korea Univ, Semicond Res Inst, Seoul 02841, South Korea
[2] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
[3] TeeLabs, Seoul 02857, South Korea
[4] Hyundai Mobis, Seoul 16891, South Korea
[5] Korea Univ, Inst Convergence Sci, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
Visual localization; Visual positioning systems; Camera pose estimation; Image retrieval; Place recognition; Indoor spaces;
D O I
10.1016/j.engappai.2023.106256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel and reliable visual positioning system (VPS), KR-Net, for kidnap recovery tasks, which predicts an accurate position when a robot is first initiated. KR-Net is based on a hierarchical visual localization method and demonstrates significant robustness in large-scale indoor environments. The proposed VPS utilizes a photo-realistic 3D model to generate a dense database of any camera pose and incorporates a novel global descriptor for indoor spaces, i-GeM, that outperforms existing methods in terms of robustness. Additionally, the proposed combinatorial pooling approach overcomes the limitations of previous single image-based predictions in large-scale indoor environments, allowing for accurate discrimination between similar locations. Extensive evaluations were performed on six large-scale indoor datasets to demonstrate the contributions of each component. To the best of our knowledge, KR-Net is the first system to estimate wake-up positions with a near 100% confidence level within a 1.0 m distance error threshold.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] On Photo-Realistic 3D Reconstruction of Large-Scale and Arbitrary-Shaped Environments
    Bondarev, Egor
    Heredia, Francisco
    Favier, Rafael
    Ma, Lingni
    de With, Peter H. N.
    2013 IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE (CCNC), 2013, : 621 - 624
  • [2] Photo-realistic 3D model reconstruction
    Se, Stephen
    Jasiobedzki, Piotr
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 3076 - +
  • [3] Spatial template-based geometric complexity reduction method for photo-realistic modeling of large-scale indoor spaces
    Hyeon, Janghun
    Kim, Joohyung
    Choi, Hyunga
    Jang, Bumchul
    Kang, Jaehyeon
    Doh, Nakju
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [4] Automatic Spatial Template Generation for Realistic 3D Modeling of Large-Scale Indoor Spaces
    Hyeon, Janghun
    Choi, Hyunga
    Kim, JooHyung
    Jang, Bumchul
    Kang, Jaehyeon
    Doh, Nakju
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4221 - 4228
  • [5] PreSim: A 3D Photo-Realistic Environment Simulator for Visual AI
    Yuan, Honglin
    Veltkamp, Remco C.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2501 - 2508
  • [6] 3D Semantic Parsing of Large-Scale Indoor Spaces
    Armeni, Iro
    Sener, Ozan
    Zamir, Amir R.
    Jiang, Helen
    Brilakis, Ioannis
    Fischer, Martin
    Savarese, Silvio
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
  • [7] Photo-realistic 3D model coding in MPEG-4
    Aspert, N
    Ebrahimi, T
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1111 - 1114
  • [8] Hand Pose Understanding With Large-Scale Photo-Realistic Rendering Dataset
    Deng, Xiaoming
    Zhang, Yinda
    Shi, Jian
    Zhu, Yuying
    Cheng, Dachuan
    Zuo, Dexin
    Cui, Zhaopeng
    Tan, Ping
    Chang, Liang
    Wang, Hongan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4275 - 4290
  • [9] Pose Correction for Highly Accurate Visual Localization in Large-scale Indoor Spaces
    Hyeon, Janghun
    Kim, Joohyung
    Doh, Nakju
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15954 - 15963
  • [10] Text Driven 3D Photo-Realistic Talking Head
    Wang, Lijuan
    Han, Wei
    Soong, Frank K.
    Huo, Qiang
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3314 - 3315