Learning Abstract Representations Through Lossy Compression of Multimodal Signals

被引:3
|
作者
Wilmot, Charles [1 ]
Baldassarre, Gianluca [2 ]
Triesch, Jochen [1 ]
机构
[1] Frankfurt Inst Adv Studies, Dept Neurosci, D-60438 Frankfurt, Germany
[2] CNR, Inst Cognit Sci & Technol, I-00185 Rome, Italy
关键词
Abstraction; autoencoder; intrinsic motivation; lossy compression; multimodality; open-ended learning; COMPONENT ANALYSIS;
D O I
10.1109/TCDS.2021.3108478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key competence for open-ended learning is the formation of increasingly abstract representations useful for driving complex behavior. Abstract representations ignore specific details and facilitate generalization. Here, we consider the learning of abstract representations in a multimodal setting with two or more input modalities. We treat the problem as a lossy compression problem and show that generic lossy compression of multimodal sensory input naturally extracts abstract representations that tend to strip away modalitiy specific details and preferentially retain information that is shared across the different modalities. Specifically, we propose an architecture that is able to extract information common to different modalities based on the compression abilities of generic autoencoder neural networks. We test the architecture with two tasks that allow: 1) the precise manipulation of the amount of information contained in and shared across different modalities and 2) testing the method on a simulated robot with visual and proprioceptive inputs. Our results show the validity of the proposed approach and demonstrate the applicability to embodied agents.
引用
收藏
页码:348 / 360
页数:13
相关论文
共 50 条
  • [1] Lossy Compression Techniques for EEG Signals
    Phuong Thi Dao
    Li, Xue Jun
    Hung Ngoc Do
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, : 154 - 159
  • [2] Theorising Learning in Science Through Integrating Multimodal Representations
    Vaughan Prain
    Russell Tytler
    [J]. Research in Science Education, 2022, 52 : 805 - 817
  • [3] Theorising Learning in Science Through Integrating Multimodal Representations
    Prain, Vaughan
    Tytler, Russell
    [J]. RESEARCH IN SCIENCE EDUCATION, 2022, 52 (03) : 805 - 817
  • [4] Influence of lossy compression on eye movement signals
    Juhola, M
    Tossavainen, T
    Aalto, H
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2004, 34 (03) : 221 - 239
  • [5] Lossy compression of auditory brainstem response signals
    Tossavainen, T
    Juhola, M
    Grönfors, T
    [J]. MEDICAL INFOBAHN FOR EUROPE, PROCEEDINGS, 2000, 77 : 1250 - 1255
  • [6] A New Lossy Compression Algorithm for Ultrasound Signals
    Freitas, Miguel de A.
    Jimenez, Marcelo R.
    Benincaza, Henrique
    von der Weid, Jean P.
    [J]. 2008 IEEE ULTRASONICS SYMPOSIUM, VOLS 1-4 AND APPENDIX, 2008, : 1885 - 1888
  • [7] Lossy compression of EEG signals using SPIHT
    Higgins, G.
    McGinley, B.
    Walsh, N.
    Glavin, M.
    Jones, E.
    [J]. ELECTRONICS LETTERS, 2011, 47 (18) : 1017 - U1548
  • [8] Lossy Image Compression in a Preclinical Multimodal Imaging Study
    Cunha, Francisco F.
    Blueml, Valentin
    Zopf, Lydia M.
    Walter, Andreas
    Wagner, Michael
    Weninger, Wolfgang J.
    Thomaz, Lucas A.
    Tavora, Luis M. N.
    Cruz, Luis da Silva A.
    Faria, Sergio M. M.
    [J]. JOURNAL OF DIGITAL IMAGING, 2023, 36 (04) : 1826 - 1850
  • [9] L2 vocabulary learning through multimodal representations
    Zarei, Gholam Reza
    Khazaie, Saeed
    [J]. 3RD WORLD CONFERENCE ON EDUCATIONAL SCIENCES - 2011, 2011, 15 : 369 - 375
  • [10] Lossy Image Compression in a Preclinical Multimodal Imaging Study
    Francisco F. Cunha
    Valentin Blüml
    Lydia M. Zopf
    Andreas Walter
    Michael Wagner
    Wolfgang J. Weninger
    Lucas A. Thomaz
    Luís M. N. Tavora
    Luis A. da Silva Cruz
    Sergio M. M. Faria
    [J]. Journal of Digital Imaging, 2023, 36 : 1826 - 1850