Experimenting with GPT OSS 20B and LoRA Fine-tuning to build Cooking Recipes GPT

dc.contributor.authorPohto, Tony
dc.contributor.departmentfi=Tietotekniikan laitos|en=Department of Computing|
dc.contributor.facultyfi=Teknillinen tiedekunta|en=Faculty of Technology|
dc.contributor.studysubjectfi=Tietojenkäsittelytieteet|en=Computer Science|
dc.date.available2026-03-31T21:05:12Z
dc.date.issued2026-03-03
dc.description.abstractThe evolution of Large Language Models has been fast. Even some recent smaller and open-weight models have proved to be capable of various tasks. One of them is GPT OSS 20B which turns out to be capable of producing versatile, quality cooking recipes out-of-the box. There exists also specialized cooking and recipes related datasets which could be used with these LLMs to fine-tune them. But experiments suggest that such fine-tuning is often not needed. GPT OSS 20B has gone through already extensive post-training optimization and tuning. Yet, the fine-tuning techniques have also been developing and become more accessible to more developers and institutions. LoRA as technique, is very useful and lightweight and still worth to try. In this thesis, LoRA is used to fine tune GPT-OSS 20B with cooking related conversational dataset. Another dataset consisting of recipes, is used to evaluate the understanding of the LLM of the cooking domain. The results show that GPT OSS 20B didn’t really benefit from such light fine-tuning but instead shines already on it’s own. One restriction was hardware which lead to using small amount of data, only affecting the style of the LLM. Using larger, high quality and versatile datasets is one thing which could be tested and studied in future research.
dc.format.extent54
dc.identifier.olddbid214797
dc.identifier.oldhandle10024/197809
dc.identifier.urihttps://www.utupub.fi/handle/11111/58440
dc.identifier.urnURN:NBN:fi-fe2026033124461
dc.language.isoeng
dc.rightsfi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.|
dc.rights.accessrightsavoin
dc.source.identifierhttps://www.utupub.fi/handle/10024/197809
dc.subjectLLM, GPT, Transformer, Fine-Tuning, Transfer Learning, LoRA, PEFT, RAG, Prompt Engineering, GPT OSS, CoT, MoE
dc.titleExperimenting with GPT OSS 20B and LoRA Fine-tuning to build Cooking Recipes GPT
dc.type.ontasotfi=Pro gradu -tutkielma|en=Master's thesis|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Pohto_Tony_opinnayte.pdf
Size:
865.76 KB
Format:
Adobe Portable Document Format