Classifying Web Exploits with Topic Modeling

dc.contributor.authorJukka Ruohonen
dc.contributor.organizationfi=ohjelmistotekniikka|en=Software Engineering|
dc.contributor.organization-code2610302
dc.converis.publication-id26902121
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/26902121
dc.date.accessioned2026-01-21T12:27:45Z
dc.date.available2026-01-21T12:27:45Z
dc.description.abstract<p>This short empirical paper investigates how well topic modeling and database meta-data characteristics can classify web and other proof-of-concept (PoC) exploits for publicly disclosed software vulnerabilities. By using a dataset comprised of over 36 thousand PoC exploits, near a 0.9 accuracy rate is obtained in the empirical experiment. Text mining and topic modeling are a significant boost factor behind this classification performance. In addition to these empirical results, the paper contributes to the research tradition of enhancing software vulnerability information with text mining, providing also a few scholarly observations about the potential for semi-automatic classification of exploits in the existing tracking infrastructures.<br /></p>
dc.format.pagerange93
dc.format.pagerange97
dc.identifier.eisbn978-1-5386-1051-0
dc.identifier.isbn978-1-5386-2207-0
dc.identifier.issn1529-4188
dc.identifier.olddbid212519
dc.identifier.oldhandle10024/195537
dc.identifier.urihttps://www.utupub.fi/handle/11111/52334
dc.identifier.urlhttp://ieeexplore.ieee.org/document/8049693/
dc.identifier.urnURN:NBN:fi-fe2021042717252
dc.language.isoen
dc.okm.affiliatedauthorRuohonen, Jukka
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.conferenceInternational Workshop on Database and Expert Systems Applications
dc.relation.doi10.1109/DEXA.2017.35
dc.source.identifierhttps://www.utupub.fi/handle/10024/195537
dc.titleClassifying Web Exploits with Topic Modeling
dc.title.bookProceedings of the 28th International Workshop on Database and Expert Systems Applications (DEXA), 2017
dc.year.issued2017

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
vulntext.pdf
Size:
83.48 KB
Format:
Adobe Portable Document Format
Description:
Post-print (final draft)