Adapt-Net: A Unified Object Detection Framework for Mobile Augmented Reality

被引:0
|
作者
Zeng, Xiangyun [1 ]
Tan, Siok Yee [1 ]
Nasrudin, Mohammad Faidzul [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Artificial Intelligence Technol, Bangi 43600, Malaysia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Adaptation models; Object detection; Computational modeling; Accuracy; Data models; Feature extraction; Training; Contrastive learning; Deep learning; Mutual information; Knowledge management; Augmented reality; deep mutual learning; masked generative knowledge distillation; mobile augmented reality; object detection; unsupervised domain adaptation;
D O I
10.1109/ACCESS.2024.3447043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection is a crucial task in mobile augmented reality (MAR), where achieving both speed and accuracy with limited computational resources is essential. However, applying object detection models to new domains or reducing the model size tends to lower their performance. To address this problem, this research introduced a unified object detection framework called Adapt-Net. This framework incorporates contrastive learning techniques for unsupervised domain adaptation, a teacher-student structure generative compressed model with masking, and deep mutual learning of student models, all built upon the YOLOv8 architecture. Adapt-Net's key novelty lies in its unified framework that combines three models: two student models and one teacher model. Each model comprises a feature-extracting backbone and an adapter network. The student models backbone are trained using deep mutual learning and contrastive learning loss to ensure domain-invariant feature generation. Unsupervised domain adaptation and masked generative knowledge distillation modules facilitate knowledge transfer from the teacher to the student models, enhancing their ability to generalize to unfamiliar objects. The use of masked generative knowledge distillation, which guides the student models to reconstruct the teacher's features from a masked input in a generative manner, rather than merely imitating the output. This generative approach improves the student models' representation capabilities. Adapt-Net enables the student models to not only learn domain-invariant features but also enhance their generalization capabilities to new objects. Extensive experiments conducted on benchmark datasets demonstrate that our proposed approach surpasses state-of-the-art object detection methods by 6.8 mAP score in terms of detection accuracy on the Microsoft COCO dataset. Notably, the model size remains a compact 3.2M, enabling fast inference speeds, lower computational resource consumption, and enhanced resilience to domain variations. Adapt-Net represents a promising and efficient approach to object detection that combines accuracy with efficiency.
引用
收藏
页码:120788 / 120803
页数:16
相关论文
共 50 条
  • [41] A Mobile Augmented Reality Framework for Post-stroke Patient Rehabilitation
    Babruwad, Sujay
    Avaghan, Rahul
    Mudenagudi, Uma
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT III, 2015, 9010 : 396 - 406
  • [42] Augmented reality for archaeological environments on mobile devices: A novel open framework
    Deliyiannis, Ioannis
    Papaioannou, Georgios
    Papaioannou, Georgios (gpapaioa@ionio.gr), 1600, University of AEGEAN (14): : 1 - 10
  • [43] Mobile Augmented Reality Framework in STEM Education: A Systematic Literature Review
    Binti Mohd Razali, Nur Fatin Razlieya
    Eh Phon, Danakorn Nincarean
    Iraqi Journal for Computer Science and Mathematics, 2024, 5 (03): : 513 - 525
  • [44] A MOBILE AUGMENTED REALITY FRAMEWORK FOR INSPECTION AND VISUALIZATION DURING FATIGUE TESTS
    Re, Guido Maria
    Kharshiduzzaman, Md.
    Bordegoni, Monica
    Bernasconi, Andrea
    Anodio, Luca Francesco
    Comolli, Lorenzo
    Braghin, Francesco
    PROCEEDINGS OF THE ASME 12TH BIENNIAL CONFERENCE ON ENGINEERING SYSTEMS DESIGN AND ANALYSIS - 2014, VOL 3, 2014,
  • [45] HoloLucination: A Framework for Live Augmented Reality Presentations across Mobile Devices
    Bahremand, Alireza
    Nguyen, Linda
    Harrison, Tanya
    LiKamWa, Robert
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR), 2019, : 243 - 244
  • [46] ROBUST MOBILE COMPUTING FRAMEWORK FOR VISUALIZATION OF SIMULATED PROCESSES IN AUGMENTED REALITY
    Dong, Suyang
    Kamat, Vineet R.
    PROCEEDINGS OF THE 2010 WINTER SIMULATION CONFERENCE, 2010, : 3111 - 3122
  • [47] An adoption framework for mobile augmented reality games: The case of Pokemon Go
    Rauschnabel, Philipp A.
    Rossmann, Alexander
    Dieck, M. Claudia tom
    COMPUTERS IN HUMAN BEHAVIOR, 2017, 76 : 276 - 286
  • [48] AUGMENTED REALITY FOR ARCHAEOLOGICAL ENVIRONMENTS ON MOBILE DEVICES: A NOVEL OPEN FRAMEWORK
    Deliyiannis, Ioannis
    Papaioannou, Georgios
    MEDITERRANEAN ARCHAEOLOGY & ARCHAEOMETRY, 2014, 14 (04): : 1 - 10
  • [49] Visual Prompt Selection Framework for Real-Time Object Detection and Interactive Segmentation in Augmented Reality Applications
    Song, Eungyeol
    Oh, Doeun
    Oh, Beom-Seok
    Applied Sciences (Switzerland), 2024, 14 (22):
  • [50] Comparison of Mobile Touch Interfaces for Object Identification and Troubleshooting Tasks in Augmented Reality
    Tiefenbacher, Philipp
    Gillich, Jan
    Schottt, Paul
    Rigoll, Gerhard
    2016 IEEE VIRTUAL REALITY CONFERENCE (VR), 2016, : 297 - 298