Automatic Classification of Strain in the Singing Voice Using Machine Learning

Liu, Yuanyuan; Mittapalle, Kiran Reddy; Yagnavajjula, Madhu Keerthana; Räsänen, Okko; Alku, Paavo; Ikävalko, Tero; Hakanpää, Tua; Öyry, Aleksi; Laukkanen, Anne-Maria

Automatic Classification of Strain in the Singing Voice Using Machine Learning

dc.contributor.author	Liu, Yuanyuan
dc.contributor.author	Mittapalle, Kiran Reddy
dc.contributor.author	Yagnavajjula, Madhu Keerthana
dc.contributor.author	Räsänen, Okko
dc.contributor.author	Alku, Paavo
dc.contributor.author	Ikävalko, Tero
dc.contributor.author	Hakanpää, Tua
dc.contributor.author	Öyry, Aleksi
dc.contributor.author	Laukkanen, Anne-Maria
dc.contributor.organization	fi=opettajankoulutuslaitos (Rauma)\|en=Department of Teacher Education (Rauma)\|
dc.contributor.organization-code	1.2.246.10.2458963.20.99310884848
dc.converis.publication-id	491612849
dc.converis.url	https://research.utu.fi/converis/portal/Publication/491612849
dc.date.accessioned	2025-08-27T23:47:50Z
dc.date.available	2025-08-27T23:47:50Z
dc.description.abstract	<p><b>Objectives</b><br>Classifying strain in the singing voice can help protect professional singers from vocal overuse and support singing training. This study investigates whether machine learning can automatically classify singing voices into two levels of perceived strain. The singing samples represent two genres: classical and contemporary commercial music (CCM).<br><b>Methods</b><br>A total of 324 singing voice samples from 15 professional normophonic singers (nine female, six male) were analyzed. Nine singers were classical, and six were CCM singers. The samples consisted of syllable strings produced at three to six pitches and three loudness levels. Based on expert auditory-perceptual ratings, the samples were categorized into two strain levels: normal-mild and moderate-severe. Three acoustic feature sets (mel-frequency cepstral coefficients (MFCCs), the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS), and wavelet scattering features) were compared using two classifier models [support vector machine (SVM) and multilayer perceptron (MLP)]. Feature selection was performed using recursive feature elimination, and the Mann-Whitney U test was used to assess the discriminative power of the selected features.<br><b>Results</b><br>The highest classification accuracy of 86.1% was achieved using a subset of wavelet scattering features with the MLP classifier. A comparison between individual features showed that the first MFCC coefficient, representing spectral tilt, exhibited the greatest between-class separation.<br><b>Conclusion</b><br>This study demonstrates that machine learning models utilizing selected acoustic features can classify perceptual strain of singing voices automatically with high accuracy. These preliminary findings highlight the potential for larger studies involving more diverse singer groups across different genres.<br></p>
dc.identifier.eissn	1873-4588
dc.identifier.jour-issn	0892-1997
dc.identifier.olddbid	204638
dc.identifier.oldhandle	10024/187665
dc.identifier.uri	https://www.utupub.fi/handle/11111/53206
dc.identifier.url	https://www.jvoice.org/article/S0892-1997(25)00134-1/fulltext
dc.identifier.urn	URN:NBN:fi-fe2025082790510
dc.language.iso	en
dc.okm.affiliatedauthor	Hakanpää, Tua
dc.okm.discipline	112 Statistics and probability	en_GB
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	Elsevier BV
dc.publisher.country	United States	en_GB
dc.publisher.country	Yhdysvallat (USA)	fi_FI
dc.publisher.country-code	US
dc.relation.doi	10.1016/j.jvoice.2025.03.040
dc.relation.ispartofjournal	Journal of Voice
dc.source.identifier	https://www.utupub.fi/handle/10024/187665
dc.title	Automatic Classification of Strain in the Singing Voice Using Machine Learning
dc.year.issued	2025

Tiedostot

Näytetään 1 - 1 / 1

Name:: PIIS0892199725001341.pdf
Size:: 923.63 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet