Article

Towards an Acoustic-Semantic Space of Extreme Metal Vocal Styles (en)

* Presenting author
Day / Time: 19.03.2024, 15:20-16:00
Typ: Poster
Information: Die Poster werden durchgängig von Dienstag bis Donnerstag im Glasgang ausgestellt, sortiert nach thematischen Zusammenhängen in der hier angegebenen Posterinsel (siehe Sitzungstitel). Das Posterforum zur angegebenen Zeit bietet die Möglichkeit, mit den Autor*innen in Diskussion zu kommen.
Abstract: Extreme vocal techniques typical for metal, e.g. growling or screaming, are characterized by low harmonicity and high roughness and associated with expressive dimensions like "aggressiveness" [Tsai et al., 2010]. Based on audio features, classification into broad style categories is possible [Kalbag & Lerch, 2022]. Which audio features are associated with the perception of emerging expressive techniques/stylistic devices that go beyond previously known categories remains open.Short phrases were extracted from 105 metal vocal tracks, 10 pilot-rated by subjects for pairwise similarity (45 comparisons). The resulting similarity matrix serves as basis for a perceptual similarity space computed using multidimensional scaling (MDS). In another pilot experiment, free verbal associations are collected for all 105 excerpts.Preliminary analyses reveal a three-dimensional similarity space whose first major axis represents the contrast between harmonic vs. more inharmonic/rough singing (Harmonic-to-Noise Ratio: r=0.837, p=0.005; Spectral Complexity: r=-0.959, p<0.001). While the second perceptual dimension shows no correlations with extracted sound features, the third dimension relates to the position of the higher formants (e.g., F2: r=-0.855, p=0.003). When projecting the verbal associations into the acoustic space, descriptions such as "demonic," "scratchy," "powerful," or "angelic" are located at specific positions.