New paradigm of learnable computer vision algorithms based on the representational MDL principle

被引:9
|
作者
Potapov, Alexey S. [1 ]
Malyshev, Igor A. [1 ]
Puysha, Alexander E. [1 ]
Averkin, Anton N. [2 ]
机构
[1] Vavilov State Opt Inst, St Petersburg, Russia
[2] Univ Informat Technol Mech & Opt, St Petersburg, Russia
关键词
image; representation; learning; segmentation; feature; MDL; information-theoretic; FEATURES;
D O I
10.1117/12.849532
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Learning is one of the most crucial components, which increases generality, flexibility, and robustness of computer vision systems. At present, image analysis algorithms adopt particular machine learning methods resulting in rather superficial learning. We present a new paradigm for constructing essentially learnable image analysis algorithms. Learning is interpreted as optimization of image representations. Notion of representation is formalized within information-theoretic framework. Optimization criterion is derived from well-known minimum description length (MDL) principle. Adaptation of the MDL principle in computer vision has been receiving increasing attention. However, this principle has been applied in heuristic way. We deduced representational MDL (RMDL) principle that fills the gap between theoretical MDL principle and its practical applications. The RMDL principle gives criteria both for optimal model selection of a single image within given representation, and for optimal representation selection for an image sample. Thus, it can be used for optimization of computer vision systems functioning within specific environment. Adequacy of the RMDL principle was validated on segmentation-based representations applied to different object domains. A method for learning local features as representation optimization was also developed. This method outperformed some popular methods with predefined representations such as SURF. Thus, the paradigm can be admitted as promising.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] MDL regularizer: A new regularizer based on the MDL principle
    Saito, K
    Nakano, R
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1833 - 1838
  • [2] CVNodes: A Visual Programming Paradigm for Developing Computer Vision Algorithms
    Wang, JunFeng
    Hogue, Andrew
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 174 - 181
  • [3] Testing Vision-Based Control Systems Using Learnable Evolutionary Algorithms
    Ben Abdessalem, Raja
    Nejati, Shiva
    Briand, Lionel C.
    Stifter, Thomas
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, : 1016 - 1026
  • [4] Vision without Frames: A Semiotic Paradigm of Event Based Computer Vision
    Ryad Benosman
    Biosemiotics, 2010, 3 : 1 - 16
  • [5] Vision without Frames: A Semiotic Paradigm of Event Based Computer Vision
    Benosman, Ryad
    BIOSEMIOTICS, 2010, 3 (01) : 1 - 16
  • [6] Towards a New Paradigm for Brain-inspired Computer Vision
    Xiao-Long Zou
    Tie-Jun Huang
    Si Wu
    Machine Intelligence Research, 2022, 19 : 412 - 424
  • [7] Towards a New Paradigm for Brain-inspired Computer Vision
    Zou, Xiao-Long
    Huang, Tie-Jun
    Wu, Si
    MACHINE INTELLIGENCE RESEARCH, 2022, 19 (05) : 412 - 424
  • [8] Towards a New Paradigm for Brain-inspired Computer Vision
    Xiao-Long Zou
    Tie-Jun Huang
    Si Wu
    Machine Intelligence Research, 2022, 19 (05) : 412 - 424
  • [9] The Comparison of Crowd Counting Algorithms based on Computer Vision
    Wang, Zhaoqing
    Deng, Qishu
    Zhao, Yusheng
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [10] Algorithms for Computer Vision Based Vehicle Speed Estimation Sensor
    Timofejevs, Jurijs
    Potapovs, Andrejs
    Gorobetz, Mikhail
    2022 IEEE 63TH INTERNATIONAL SCIENTIFIC CONFERENCE ON POWER AND ELECTRICAL ENGINEERING OF RIGA TECHNICAL UNIVERSITY (RTUCON), 2022,