Intonation contour similarity: f0 representations and distance measures compared to human perception in two languages

被引:2
|
作者
Kaland, Constantijn [1 ]
机构
[1] Univ Cologne, Inst Linguist, Cologne, Germany
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2023年 / 154卷 / 01期
关键词
WORD STRESS; PAPUAN; PSYTOOLKIT;
D O I
10.1121/10.0019850
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, cluster analysis on f0 contours has become a popular method in phonetic research. Cluster analysis provides an automated way of categorising f0 contours, which gives new insights into (phonological) categories of intonation that vary across languages. As cluster analysis can be performed in many different ways, it is important to understand the extent to which these analyses can capture human perception of f0. This study focuses on the way in which f0 contours and differences among them are represented numerically, i.e., a crucial methodological choice preceding cluster analysis. These representations are then compared to the way in which f0 contour differences are perceived by human listeners from two different languages. To this end, four time-series contour representations (equivalent rectangular bandwidth, standardisation, octave-median rescaling, first derivative) and three distance measures [Euclidean distance (L2 norm), Pearson correlation, and dynamic time warping) were tested. The perceived differences were obtained from listeners of German and Papuan Malay, two typologically different languages. Results show that computed contour differences reflect human perception moderately, with dynamic time warping applied to the first derivative of the contour performing best, and showing minimal differences between the languages.
引用
收藏
页码:95 / 107
页数:13
相关论文
empty
未找到相关数据