Classifying Web Exploits with Topic Modeling
| dc.contributor.author | Jukka Ruohonen | |
| dc.contributor.organization | fi=ohjelmistotekniikka|en=Software Engineering| | |
| dc.contributor.organization-code | 2610302 | |
| dc.converis.publication-id | 26902121 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/26902121 | |
| dc.date.accessioned | 2026-01-21T12:27:45Z | |
| dc.date.available | 2026-01-21T12:27:45Z | |
| dc.description.abstract | <p>This short empirical paper investigates how well topic modeling and database meta-data characteristics can classify web and other proof-of-concept (PoC) exploits for publicly disclosed software vulnerabilities. By using a dataset comprised of over 36 thousand PoC exploits, near a 0.9 accuracy rate is obtained in the empirical experiment. Text mining and topic modeling are a significant boost factor behind this classification performance. In addition to these empirical results, the paper contributes to the research tradition of enhancing software vulnerability information with text mining, providing also a few scholarly observations about the potential for semi-automatic classification of exploits in the existing tracking infrastructures.<br /></p> | |
| dc.format.pagerange | 93 | |
| dc.format.pagerange | 97 | |
| dc.identifier.eisbn | 978-1-5386-1051-0 | |
| dc.identifier.isbn | 978-1-5386-2207-0 | |
| dc.identifier.issn | 1529-4188 | |
| dc.identifier.olddbid | 212519 | |
| dc.identifier.oldhandle | 10024/195537 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/52334 | |
| dc.identifier.url | http://ieeexplore.ieee.org/document/8049693/ | |
| dc.identifier.urn | URN:NBN:fi-fe2021042717252 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Ruohonen, Jukka | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | United States | en_GB |
| dc.publisher.country | Yhdysvallat (USA) | fi_FI |
| dc.publisher.country-code | US | |
| dc.relation.conference | International Workshop on Database and Expert Systems Applications | |
| dc.relation.doi | 10.1109/DEXA.2017.35 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/195537 | |
| dc.title | Classifying Web Exploits with Topic Modeling | |
| dc.title.book | Proceedings of the 28th International Workshop on Database and Expert Systems Applications (DEXA), 2017 | |
| dc.year.issued | 2017 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- vulntext.pdf
- Size:
- 83.48 KB
- Format:
- Adobe Portable Document Format
- Description:
- Post-print (final draft)