Android Malware Detection using LLM

dc.contributor.authorKarunarathna Rajapakshe Mudiyanselage, Madura
dc.contributor.departmentfi=Tietotekniikan laitos|en=Department of Computing|
dc.contributor.facultyfi=Teknillinen tiedekunta|en=Faculty of Technology|
dc.contributor.studysubjectfi=Tietotekniikka|en=Information and Communication Technology|
dc.date.accessioned2025-08-08T21:05:07Z
dc.date.available2025-08-08T21:05:07Z
dc.date.issued2025-07-31
dc.description.abstractAndroid’s widespread adoption has made it a prime target for mobile malware, posing significant threats to user privacy, financial security and device security. Traditional malware detection approaches, such as signature-based and heuristic algorithms, frequently fail to keep up with the dynamic nature of Android malware. To solve this issue, this thesis provides a static analysis-based malware detection system that uses fine-tuned transformer models, notably BERT, to categorize Android apps. The system extracts permission ermissions from AndroidManifest.xml and API call sequences from smali code, which are then treated as textual features suitable for language model input. Three different BERT-based classifiers were trained: one with permissions, one with API calls, and one with a combined feature set. The final categorization decision is determined using an ensemble majority-voting approach. Experimental results from the CIC-AndMal2017 dataset indicate that the combined-feature model outperformed both single-feature models and traditional baselines, with an accuracy of 92% and an F1-score of 0.915. The system was deployed as a real-time detection service with a FastAPI backend and a React-based web frontend, allowing for easy malware investigation. This research illustrates the feasibility of using large language models for static malware detection and provides a scalable framework for incorporating future dynamic or hybrid analysis methods.
dc.format.extent84
dc.identifier.olddbid199706
dc.identifier.oldhandle10024/182734
dc.identifier.urihttps://www.utupub.fi/handle/11111/10854
dc.identifier.urnURN:NBN:fi-fe2025080881601
dc.language.isoeng
dc.rightsfi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.|
dc.rights.accessrightsavoin
dc.source.identifierhttps://www.utupub.fi/handle/10024/182734
dc.subjectAndroid Malware, APK, LLM, Malware Analysis
dc.titleAndroid Malware Detection using LLM
dc.type.ontasotfi=Pro gradu -tutkielma|en=Master's thesis|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Madura_Master_s_Thesis_Final_2304031.pdf
Size:
4.75 MB
Format:
Adobe Portable Document Format