共 3 条
Intake Gesture Detection With IMU Sensor in Free-Living Environments: The Effects of Measuring Two-Hand Intake and Down-Sampling
被引:0
|作者:
Wang, Chunzhuo
[1
,2
,3
]
Kong, Jiaze
[1
,2
]
Cai, Yutong
[1
,2
]
Kumar, T. Sunil
[1
,2
]
De Raedt, Walter
[3
]
Camps, Guido
[4
,5
]
Hallez, Hans
[6
]
Vanrumste, Bart
[1
,2
]
机构:
[1] Katholieke Univ Leuven, eMedia Res Lab, B-3000 Leuven, Belgium
[2] Katholieke Univ Leuven, ESAT STADIUS Div, B-3000 Leuven, Belgium
[3] IMEC, Dept Life Sci, B-3001 Heverlee, Belgium
[4] Wageningen Univ & Res, Div Human Nutr & Hlth, Dept Agrotechnol & Food Sci, Wageningen, Netherlands
[5] OnePlanet Res Ctr, Wageningen, Netherlands
[6] Katholieke Univ Leuven, MGrp, DistriNet, Dept Comp Sci, B-8200 Sint Michiels, Belgium
关键词:
eating gesture detection;
free-living;
food intake monitoring;
hand mirroring;
down-sampling;
D O I:
10.1109/BSN58485.2023.10331032
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
Food intake monitoring plays an important role in personal dietary systems. Numerous approaches have been proposed to automatically detect eating gestures using various sensors and machine learning. However, existing eating gesture detection approaches mainly focus on meal sessions. Such a task is still challenging in free-living environments due to longer monitoring duration and more non-feeding activities. This paper proposes a wearable Inertial Measurement Unit (IMU) based method to detect eating and drinking gestures in free-living environments. Two important factors that impede intake gesture detection in free-living environments are addressed: 1) how to handle IMU data from two hands, and 2) what is the impact of downsampling sensor data on performance. To integrate two-hand data, we propose a solution that combines hand mirroring and temporal concatenation techniques. The multi-stage temporal convolutional network (MS-TCN) is applied to effectively recognise intake gestures. A dataset contains 12 subjects with 67.5 h data is collected for validation. Moreover, IMU data with different sampling frequencies are processed to test performance. Validated by Leave-One-Subject-Out (LOSO) method, our approach (with 16 Hz sampling frequency) achieves a segmental F1-score of 0.826 and 0.893 for recognizing eating and drinking gestures, respectively. Results show that the proposed solution outperforms existing two-hand data combination approaches. Moreover, in our case, a higher sampling frequency does not always mean better performance.
引用
收藏
页数:4
相关论文