Comparing Data Augmentation Methods for Synthesizer Parameter Estimation

Pasanen, Lassi

Comparing Data Augmentation Methods for Synthesizer Parameter Estimation

dc.contributor.author	Pasanen, Lassi
dc.contributor.department	fi=Tietotekniikan laitos\|en=Department of Computing\|
dc.contributor.faculty	fi=Teknillinen tiedekunta\|en=Faculty of Technology\|
dc.contributor.studysubject	fi=Tietojenkäsittelytieteet\|en=Computer Science\|
dc.date.accessioned	2026-06-15T19:32:21Z
dc.date.issued	2026-06-02
dc.description.abstract	Synthesizer parameter estimation is a machine learning task where we train a model to estimate synthesizer parameters for recreating a given target sound. This problem can be approached by randomly generating synthesizer sounds to be used as training data. This might cause the model to estimate parameters well for the synthesizer generated sounds, but not for other sounds, like instrument sounds recorded with a microphone. Data augmentation methods can help overcome this problem. This thesis compares two different data augmentation methods in the context of synthesizer parameter estimation. The first is augmentation applied during the process of synthesizing the sound by adding noise to the pitch and amplitude envelopes of the synthesizer. The second augmentation method applies masking to the spectrogram of the sound which is the input for the neural network. We compare applying no augmentation, only envelope augmentation, only spectrogram augmentation, and both augmentations. Evaluation is based on recorded instrument sounds by comparing the spectrograms of the predicted sound and the target sound. The results show slightly better performance when using only spectrogram augmentation, but the difference is subtle and it is difficult to say how significant it is. We do not get clear answers to the questions about how the augmentation methods affect the model’s performance, suggesting that further research with a different evaluation metric is needed.
dc.format.extent	64
dc.identifier.uri	https://www.utupub.fi/handle/11111/61990
dc.identifier.urn	URN:NBN:fi-fe2026061569984
dc.language.iso	eng
dc.rights	fi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.\|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.\|
dc.rights.accessrights	suljettu
dc.subject	synthesizer parameter estimation
dc.subject	data augmentation
dc.subject	synthetic data
dc.subject	machine learning
dc.title	Comparing Data Augmentation Methods for Synthesizer Parameter Estimation
dc.type.ontasot	fi=Pro gradu -tutkielma\|en=Master's thesis\|

Tiedostot

Näytetään 1 - 1 / 1

Name:: Pasanen_Lassi_opinnayte.pdf
Size:: 1.64 MB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Pro gradu -tutkielmat ja diplomityöt sekä syventävien opintojen opinnäytetyöt (rajattu näkyvyys)