How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data

被引:4
|
作者
Resheff, Yehezkel S. [1 ]
Bensch, Hanna M. [2 ,3 ]
Zottl, Markus [2 ,3 ]
Harel, Roi [4 ,5 ,6 ,7 ]
Matsumoto-Oda, Akiko [8 ]
Crofoot, Margaret C. [4 ,5 ,6 ,7 ]
Gomez, Sara [9 ]
Borger, Luca [9 ]
Rotics, Shay [10 ,11 ,12 ]
机构
[1] Hebrew Univ Jerusalem, Hebrew Univ, Business Sch, Jerusalem, Israel
[2] Linnaeus Univ, Ctr Ecol & Evolut Microbial Model Syst EEMIS, Dept Biol & Environm Sci, S-39182 Kalmar, Sweden
[3] Kalahari Res Ctr, Kuruman River Reserve, Van Zylsrus, South Africa
[4] Max Planck Inst Anim Behav, Dept Ecol Anim Soc, Constance, Germany
[5] Univ Konstanz, Dept Biol, Constance, Germany
[6] Univ Konstanz, Ctr Adv Study Collect Behav, Constance, Germany
[7] Mpala Res Ctr, Nanyuki, Kenya
[8] Univ Ryukyus, Grad Sch Tourism Sci, Nishihara, Okinawa, Japan
[9] Swansea Univ, Dept Biosci, Swansea, Wales
[10] Tel Aviv Univ, Fac Life Sci, Sch Zool, Tel Aviv, Israel
[11] Tel Aviv Univ, Steinhardt Museum Nat Hist, Tel Aviv, Israel
[12] Kalahari Res Ctr, Kuruman River Reserve, Van Zylsrus, South Africa
来源
MOVEMENT ECOLOGY | 2024年 / 12卷 / 01期
基金
欧洲研究理事会; 以色列科学基金会;
关键词
Body-acceleration; Bio-logging; Machine learning; Animal behaviour; ACCELEROMETERS; MIGRATION;
D O I
10.1186/s40462-024-00485-7
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The application of supervised machine learning methods to identify behavioural modes from inertial measurements of bio-loggers has become a standard tool in behavioural ecology. Several design choices can affect the accuracy of identifying the behavioural modes. One such choice is the inclusion or exclusion of segments consisting of more than a single behaviour (mixed segments) in the machine learning model training data. Currently, the common practice is to ignore such segments during model training. In this paper we tested the hypothesis that including mixed segments in model training will improve accuracy, as the model would perform better in identifying them in the test data. We test this hypothesis using a series of data simulations on four datasets of accelerometer data coupled with behaviour observations, obtained from four study species (Damaraland mole-rats, meerkats, olive baboons, polar bears). Results show that when a substantial proportion of the test data are mixed behaviour segments (above similar to 10%), including mixed segments in machine learning model training improves the accuracy of classification. These results were consistent across the four study species, and robust to changes in segment length, sample size, and degree of mixture within the mixed segments. However, we also find that in some cases (particularly in baboons) models trained with mixed segments show reduced accuracy in classifying test data containing only single behaviour (pure) segments, compared to models trained without mixed segments. Based on these results, we recommend that when the classification model is expected to deal with a substantial proportion of mixed behaviour segments (> 10%), it is beneficial to include them in model training, otherwise, it is unnecessary but also not harmful. The exception is when there is a basis to assume that the training data contains a higher rate of mixed segments than the actual (unobserved) data to be classified-such a situation may occur particularly when training data are collected in captivity and used to classify data from the wild. In this case, excess inclusion of mixed segments in training data should probably be avoided.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Automatically evaluating balance using machine learning and data from a single inertial measurement unit
    Fahad Kamran
    Kathryn Harrold
    Jonathan Zwier
    Wendy Carender
    Tian Bao
    Kathleen H. Sienko
    Jenna Wiens
    Journal of NeuroEngineering and Rehabilitation, 18
  • [2] Automatically evaluating balance using machine learning and data from a single inertial measurement unit
    Kamran, Fahad
    Harrold, Kathryn
    Zwier, Jonathan
    Carender, Wendy
    Bao, Tian
    Sienko, Kathleen H.
    Wiens, Jenna
    JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2021, 18 (01)
  • [3] Predicting Diabetes Diseases Using Mixed Data and Supervised Machine Learning Algorithms
    Daanouni, Othmane
    Cherradi, Bouchaib
    Tmiri, Amal
    4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [4] Incremental semi-supervised Extreme Learning Machine for Mixed data stream classification
    Li, Qiude
    Xiong, Qingyu
    Ji, Shengfen
    Yu, Yang
    Wu, Chao
    Gao, Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
  • [5] Automatic recognition of excavator working cycles using supervised learning and motion data obtained from inertial measurement units (IMUs)
    Amirmasoud Molaei
    Antti Kolu
    Kalle Lahtinen
    Marcus Geimer
    Construction Robotics, 2024, 8 (2)
  • [6] Identifying Rare Diseases from Behavioural Data: A Machine Learning Approach
    MacLeod, Haley
    Yang, Shuo
    Oakes, Kim
    Connelly, Kay
    Natarajan, Sriraam
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE), 2016, : 130 - 139
  • [7] DeepEthogram, a machine learning pipeline for supervised behavior classification from raw pixels
    Bohnslav, James P.
    Wimalasena, Nivanthika K.
    Clausing, Kelsey J.
    Dai, Yu Y.
    Yarmolinsky, David A.
    Cruz, Tomas
    Kashlan, Adam D.
    Chiappe, M. Eugenia
    Orefice, Lauren L.
    Woolf, Clifford J.
    Harvey, Christopher D.
    ELIFE, 2021, 10
  • [8] Supervised learning from noisy observations: Combining machine-learning techniques with data assimilation
    Gottwald, Georg A.
    Reich, Sebastian
    PHYSICA D-NONLINEAR PHENOMENA, 2021, 423
  • [9] Learning from Privacy Preserved Encrypted Data on Cloud Through Supervised and Unsupervised Machine Learning
    Khan, Ahmad Neyaz
    Fan, Ming Yu
    Malik, Asad
    Memon, Raheel Ahmed
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING, MATHEMATICS AND ENGINEERING TECHNOLOGIES (ICOMET), 2019,
  • [10] Robust Data-Driven Framework for Driver Behavior Profiling Using Supervised Machine Learning
    Abdelrahman, Abdalla Ebrahim
    Hassanein, Hossam S.
    Abu-Ali, Najah
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3336 - 3350