Classification of bird species from video using appearance and motion features

被引:11
|
作者
Atanbori, John [1 ]
Duan, Wenting [2 ]
Shaw, Edward [4 ]
Appiah, Kofi [3 ]
Dickinson, Patrick [2 ]
机构
[1] Univ Nottingham, Sch Comp Sci, Nottingham, England
[2] Univ Lincoln, Sch Comp Sci, Lincoln, England
[3] Sheffield Hallam Univ, Dept Comp, Sheffield, S Yorkshire, England
[4] Don Catchment Rivers Trust, St Catherines House,Woodfield Pk, Doncaster, England
基金
英国工程与自然科学研究理事会;
关键词
Appearance features; Motion features; Feature extraction; Feature selection; Bird species classification; Fine-grained classification; WAVELET; BATS;
D O I
10.1016/j.ecoinf.2018.07.005
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The monitoring of bird populations can provide important information on the state of sensitive ecosystems; however, the manual collection of reliable population data is labour-intensive, time-consuming, and potentially error prone. Automated monitoring using computer vision is therefore an attractive proposition, which could facilitate the collection of detailed data on a much larger scale than is currently possible. A number of existing algorithms are able to classify bird species from individual high quality detailed images often using manual inputs (such as a priori parts labelling). However, deployment in the field necessitates fully automated in-flight classification, which remains an open challenge due to poor image quality, high and rapid variation in pose, and similar appearance of some species. We address this as a fine-grained classification problem, and have collected a video dataset of thirteen bird classes (ten species and another with three colour variants) for training and evaluation. We present our proposed algorithm, which selects effective features from a large pool of appearance and motion features. We compare our method to others which use appearance features only, including image classification using state-of-the-art Deep Convolutional Neural Networks (CNNs). Using our algorithm we achieved an 90% correct classification rate, and we also show that using effectively selected motion and appearance features together can produce results which outperform state-of-the-art single image classifiers. We also show that the most significant motion features improve correct classification rates by 7% compared to using appearance features alone.
引用
收藏
页码:12 / 23
页数:12
相关论文
共 50 条
  • [31] Human age classification using appearance features and artificial neural network
    Jagtap, Jayant
    Kokare, Manesh
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2016, 8 (3-4) : 179 - 201
  • [32] Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients
    Lee, Chang-Hsing
    Han, Chin-Chuan
    Chuang, Ching-Chien
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1541 - 1550
  • [33] Multi-label bird species classification from audio recordings using attention framework
    Noumida, A.
    Rajan, Rajeev
    APPLIED ACOUSTICS, 2022, 197
  • [34] Soft biometric classification using local appearance periocular region features
    Lyle, Jamie R.
    Miller, Philip E.
    Pundlik, Shrinivas J.
    Woodard, Damon L.
    PATTERN RECOGNITION, 2012, 45 (11) : 3877 - 3885
  • [35] Multi-label bird species classification from audio recordings using attention framework
    Noumida, A.
    Rajan, Rajeev
    Applied Acoustics, 2022, 197
  • [36] Motion blur concealment of digital video using invariant features
    Ojansivu, Ville
    Heikkila, Janne
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2006, 4179 : 35 - 45
  • [37] Video Fire Smoke Detection Using Motion and Color Features
    Yu Chunyu
    Fang Jun
    Wang Jinjun
    Zhang Yongming
    FIRE TECHNOLOGY, 2010, 46 (03) : 651 - 663
  • [38] Video Fire Smoke Detection Using Motion and Color Features
    Yu Chunyu
    Fang Jun
    Wang Jinjun
    Zhang Yongming
    Fire Technology, 2010, 46 : 651 - 663
  • [39] Fusing Appearance Features and Correlation Features for Face Video Retrieval
    Jing, Chenchen
    Dong, Zhen
    Pei, Mingtao
    Jia, Yunde
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 150 - 160
  • [40] Appearance-and-Relation Networks for Video Classification
    Wang, Limin
    Li, Wei
    Li, Wen
    Van Gool, Luc
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1430 - 1439