Classification of bird species from video using appearance and motion features

被引:11
|
作者
Atanbori, John [1 ]
Duan, Wenting [2 ]
Shaw, Edward [4 ]
Appiah, Kofi [3 ]
Dickinson, Patrick [2 ]
机构
[1] Univ Nottingham, Sch Comp Sci, Nottingham, England
[2] Univ Lincoln, Sch Comp Sci, Lincoln, England
[3] Sheffield Hallam Univ, Dept Comp, Sheffield, S Yorkshire, England
[4] Don Catchment Rivers Trust, St Catherines House,Woodfield Pk, Doncaster, England
基金
英国工程与自然科学研究理事会;
关键词
Appearance features; Motion features; Feature extraction; Feature selection; Bird species classification; Fine-grained classification; WAVELET; BATS;
D O I
10.1016/j.ecoinf.2018.07.005
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The monitoring of bird populations can provide important information on the state of sensitive ecosystems; however, the manual collection of reliable population data is labour-intensive, time-consuming, and potentially error prone. Automated monitoring using computer vision is therefore an attractive proposition, which could facilitate the collection of detailed data on a much larger scale than is currently possible. A number of existing algorithms are able to classify bird species from individual high quality detailed images often using manual inputs (such as a priori parts labelling). However, deployment in the field necessitates fully automated in-flight classification, which remains an open challenge due to poor image quality, high and rapid variation in pose, and similar appearance of some species. We address this as a fine-grained classification problem, and have collected a video dataset of thirteen bird classes (ten species and another with three colour variants) for training and evaluation. We present our proposed algorithm, which selects effective features from a large pool of appearance and motion features. We compare our method to others which use appearance features only, including image classification using state-of-the-art Deep Convolutional Neural Networks (CNNs). Using our algorithm we achieved an 90% correct classification rate, and we also show that using effectively selected motion and appearance features together can produce results which outperform state-of-the-art single image classifiers. We also show that the most significant motion features improve correct classification rates by 7% compared to using appearance features alone.
引用
收藏
页码:12 / 23
页数:12
相关论文
共 50 条
  • [21] Detection & Classification of Imperceptible Motion Using Video Decomposition
    Bhattacharya, Saumik
    Venkatsh, K. S.
    Gupta, Sumana
    2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2016, : 296 - 299
  • [22] RECOGNIZING ACTIVITIES FROM EGOCENTRIC IMAGES WITH APPEARANCE AND MOTION FEATURES
    Chen, Yanhua
    Pei, Mingtao
    Nie, Zhengang
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [23] Internet Video Traffic Classification using QoS Features
    Wang Zai-jian
    Dong, Yu-ning
    Shi, Hai-xian
    Yang Lingyun
    Tang Pingping
    2016 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2016,
  • [24] Hierarchical Classification of Bird Species Using Their Audio Recorded Songs
    Silla, Carlos N., Jr.
    Kaestner, Celso A. A.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1895 - 1900
  • [25] Bird Species Classification Using Deep Learning: A Comparative Study
    Mutlu Bilgin, Mehtap
    Ozdem, Kevser
    Akcayol, M. Ali
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2022, 25 (03): : 1251 - 1260
  • [26] Video salient event classification using audio features
    Corchs, Silvia
    Ciocca, Gianluigi
    Fiori, Massimiliano
    Gasparini, Francesca
    IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2014, 2014, 9027
  • [27] Medical video event classification using shared features
    Cao, Yu
    Liu, Shih-Hsi 'Alex'
    Li, Ming
    Baang, Sung
    Hu, Sanqing
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 266 - +
  • [28] Ciliary motion features from digitized video photography
    Ben-Shimol, Yehuda
    Dinstein, Its'hak
    Meisels, Amnon
    Priel, Zvi
    Journal of Computer-Assisted Microscopy, 1992, 3 (03): : 103 - 116
  • [29] Video Genre Estimation from Relationship between Motion and Facial Features using SLPCCA
    Sasaka, Yuma
    Ogawa, Takahiro
    Haseyama, Miki
    2015 IEEE 4TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2015, : 250 - 251
  • [30] News sports video shot classification with sports play field and motion features
    Wang, DH
    Tian, Q
    Gao, S
    Sung, WK
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2247 - 2250