Comparing Data Augmentation Methods for Synthesizer Parameter Estimation
| dc.contributor.author | Pasanen, Lassi | |
| dc.contributor.department | fi=Tietotekniikan laitos|en=Department of Computing| | |
| dc.contributor.faculty | fi=Teknillinen tiedekunta|en=Faculty of Technology| | |
| dc.contributor.studysubject | fi=Tietojenkäsittelytieteet|en=Computer Science| | |
| dc.date.accessioned | 2026-06-15T19:32:21Z | |
| dc.date.issued | 2026-06-02 | |
| dc.description.abstract | Synthesizer parameter estimation is a machine learning task where we train a model to estimate synthesizer parameters for recreating a given target sound. This problem can be approached by randomly generating synthesizer sounds to be used as training data. This might cause the model to estimate parameters well for the synthesizer generated sounds, but not for other sounds, like instrument sounds recorded with a microphone. Data augmentation methods can help overcome this problem. This thesis compares two different data augmentation methods in the context of synthesizer parameter estimation. The first is augmentation applied during the process of synthesizing the sound by adding noise to the pitch and amplitude envelopes of the synthesizer. The second augmentation method applies masking to the spectrogram of the sound which is the input for the neural network. We compare applying no augmentation, only envelope augmentation, only spectrogram augmentation, and both augmentations. Evaluation is based on recorded instrument sounds by comparing the spectrograms of the predicted sound and the target sound. The results show slightly better performance when using only spectrogram augmentation, but the difference is subtle and it is difficult to say how significant it is. We do not get clear answers to the questions about how the augmentation methods affect the model’s performance, suggesting that further research with a different evaluation metric is needed. | |
| dc.format.extent | 64 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/61990 | |
| dc.identifier.urn | URN:NBN:fi-fe2026061569984 | |
| dc.language.iso | eng | |
| dc.rights | fi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.| | |
| dc.rights.accessrights | suljettu | |
| dc.subject | synthesizer parameter estimation | |
| dc.subject | data augmentation | |
| dc.subject | synthetic data | |
| dc.subject | machine learning | |
| dc.title | Comparing Data Augmentation Methods for Synthesizer Parameter Estimation | |
| dc.type.ontasot | fi=Pro gradu -tutkielma|en=Master's thesis| |
Tiedostot
1 - 1 / 1