Information encoding by deep neural networks: what can we learn?

被引：3

作者：

ten Bosch, L. ^{[1
,2
]}

Boves, L. ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Nijmegen, Netherlands

[2] Max Planck Inst Psycholinguist, Nijmegen, Netherlands

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

关键词：

deep neural networks; conventional knowledge; information encoding; structure discovery;

D O I：

10.21437/Interspeech.2018-1896

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recent advent of deep learning techniques in speech technology and in particular in automatic speech recognition has yielded substantial performance improvements. This suggests that deep neural networks (DNNs) are able to capture structure in speech data that older methods for acoustic modeling, such as Gaussian Mixture Models and shallow neural networks fail to uncover. In image recognition it is possible to link representations on the first couple of layers in DNNs to structural properties of images, and to representations on early layers in the visual cortex. This raises the question whether it is possible to accomplish a similar feat with representations on DNN layers when processing speech input. In this paper we present three different experiments in which we attempt to untangle how DNNs encode speech signals, and to relate these representations to phonetic knowledge, with the aim to advance conventional phonetic concepts and to choose the topology of a DNNs more efficiently. Two experiments investigate representations formed by auto-encoders. A third experiment investigates representations on convolutional layers that treat speech spectrograms as if they were images. The results lay the basis for future experiments with recursive networks.

引用

页码：1457 / 1461

页数：5

共 50 条

[1] Neural networks and antipsychotics: What can we learn from functional MRI and MR-Spectroscopy?
Braus, DF
EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2004, 14 : S155 - S156
[2] The information of attribute uncertainties: what convolutional neural networks can learn about errors in input data
Rodrigues, Natalia V. N.
Abramo, L. Raul
Hirata, Nina S. T.
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):
[3] What have we learned? What can we learn?
Strack, Fritz
Stroebe, Wolfgang
BEHAVIORAL AND BRAIN SCIENCES, 2018, 41
[4] Severe Tests in Neuroimaging: What We Can Learn and How We Can Learn It
Aktunc, M. Emrah
PHILOSOPHY OF SCIENCE, 2014, 81 (05) : 961 - 973
[5] Information Overload as a Burden and a Challenge. What Can We Learn for Information Literacy?
Kisilowska-Szurminska, Malgorzata
INFORMATION EXPERIENCE AND INFORMATION LITERACY, PT II, ECIL 2023, 2024, 2043 : 161 - 172
[6] A population study of fractures: what we can learn and what we cannot learn
Henderson, Richard
DEVELOPMENTAL MEDICINE AND CHILD NEUROLOGY, 2013, 55 (09): : 779 - 780
[7] What Lessons Can We Learn?
Hart, W. A.
JOURNAL OF PHILOSOPHY OF EDUCATION, 2012, 46 (04) : 663 - 673
[8] What can we learn today?
Nesland, JM
ULTRASTRUCTURAL PATHOLOGY, 2002, 26 (03) : 123 - 123
[9] Pneumomediastinum - What can we learn?
Furuse, M
INTERNAL MEDICINE, 1998, 37 (10) : 802 - 803
[10] Afghanistan: What can we learn?
Osland, Kari M.
Torjesen, Stina
INTERNASJONAL POLITIKK, 2015, 73 (01) : 78 - 88

← 1 2 3 4 5 →