RingGesture: A Ring-Based Mid-Air Gesture Typing System Powered by a Deep-Learning Word Prediction Framework

被引:0
|
作者
Shen, Junxiao [1 ,2 ]
Boldu, Roger [1 ]
Kalla, Arpit [1 ]
Glueck, Michael [1 ]
Surale, Hemant Bhaskar [1 ]
Karlson, Amy [1 ]
机构
[1] Meta, Real Labs Res, Menlo Pk, CA 94025 USA
[2] Univ Bristol, Bristol, England
关键词
Tracking; Keyboards; Context modeling; Wrist; Predictive models; Decoding; Trajectory; Text entry; augmented reality; word prediction; language models; INPUT; PERFORMANCE;
D O I
10.1109/TVCG.2024.3456163
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Text entry is a critical capability for any modern computing experience, with lightweight augmented reality (AR) glasses being no exception. Designed for all-day wearability, a limitation of lightweight AR glass is the restriction to the inclusion of multiple cameras for extensive field of view in hand tracking. This constraint underscores the need for an additional input device. We propose a system to address this gap: a ring-based mid-air gesture typing technique, RingGesture, utilizing electrodes to mark the start and end of gesture trajectories and inertial measurement units (IMU) sensors for hand tracking. This method offers an intuitive experience similar to raycast-based mid-air gesture typing found in VR headsets, allowing for a seamless translation of hand movements into cursor navigation. To enhance both accuracy and input speed, we propose a novel deep-learning word prediction framework, Score Fusion, comprised of three key components: a) a word-gesture decoding model, b) a spatial spelling correction model, and c) a lightweight contextual language model. In contrast, this framework fuses the scores from the three models to predict the most likely words with higher precision. We conduct comparative and longitudinal studies to demonstrate two key findings: firstly, the overall effectiveness of RingGesture, which achieves an average text entry speed of 27.3 words per minute (WPM) and a peak performance of 47.9 WPM. Secondly, we highlight the superior performance of the Score Fusion framework, which offers a 28.2% improvement in uncorrected Character Error Rate over a conventional word prediction framework, Naive Correction, leading to a 55.2% improvement in text entry speed for RingGesture. Additionally, RingGesture received a System Usability Score of 83 signifying its excellent usability.
引用
收藏
页码:7441 / 7451
页数:11
相关论文
共 28 条
  • [1] Retrosynthesis prediction with an interpretable deep-learning framework based on molecular assembly tasks
    Wang, Yu
    Pang, Chao
    Wang, Yuzhe
    Jin, Junru
    Zhang, Jingjie
    Zeng, Xiangxiang
    Su, Ran
    Zou, Quan
    Wei, Leyi
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [2] Retrosynthesis prediction with an interpretable deep-learning framework based on molecular assembly tasks
    Yu Wang
    Chao Pang
    Yuzhe Wang
    Junru Jin
    Jingjie Zhang
    Xiangxiang Zeng
    Ran Su
    Quan Zou
    Leyi Wei
    Nature Communications, 14 (1)
  • [3] The role of digital interactive technology in cultural heritage learning: Evaluating a mid-air gesture-based interactive media of Ruihetu
    Li, Qiang
    Luo, Tian
    Wang, Jingjing
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (3-4)
  • [4] dCrop: A Deep-Learning based Framework for Accurate Prediction of Diseases of Crops in Smart Agriculture
    Pallagani, Vishal
    Khandelwal, Vedant
    Chandra, Bharath
    Udutalapally, Venkanna
    Das, Debanjan
    Mohanty, Saraju P.
    2019 IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2019), 2019, : 29 - 33
  • [5] WiDG: An Air Hand Gesture Recognition System Based on CSI and Deep Learning
    Wang, Zhengjie
    Song, Xue
    Fan, Jingwen
    Chen, Fang
    Zhou, Naisheng
    Guo, Yinjing
    Chen, Da
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1243 - 1248
  • [6] A Deep Learning-Powered TinyML Model for Gesture-Based Air Handwriting Simple Arabic Letters Recognition
    Lamaakal, Ismail
    Maleh, Yassine
    Ouahbi, Ibrahim
    El Makkaoui, Khalid
    Abd El-Latif, Ahmed A.
    DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2024, VOL 4, 2024, 1101 : 32 - 42
  • [7] Vision-Based Mid-Air Object Detection and Avoidance Approach for Small Unmanned Aerial Vehicles with Deep Learning and Risk Assessment
    Lai, Ying-Chih
    Lin, Tzu-Yun
    REMOTE SENSING, 2024, 16 (05)
  • [8] Design of Air Passenger Travel Choice Intention Prediction System Based on Deep Learning
    Wei, Wei
    Cheng, Wang
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [9] G2PDeep: a web-based deep-learning framework for quantitative phenotype prediction and discovery of genomic markers
    Zeng, Shuai
    Mao, Ziting
    Ren, Yijie
    Wang, Duolin
    Xu, Dong
    Joshi, Trupti
    NUCLEIC ACIDS RESEARCH, 2021, 49 (W1) : W228 - W236
  • [10] A novel attention based deep learning model for software defect prediction with bidirectional word embedding system
    M. Chitra Devi
    T. Dhiliphan Rajkumar
    Soft Computing, 2025, 29 (4) : 2171 - 2188