Validating and Extracting Information from National Identification Numbers in R: The Case of Finland and Sweden

dc.contributor.authorKantanen, Pyry
dc.contributor.authorBülow, Erik
dc.contributor.authorLahtinen, Aleksi
dc.contributor.authorMagnusson, Måns
dc.contributor.authorPaananen, Jussi
dc.contributor.authorLahti, Leo
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.converis.publication-id505865827
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/505865827
dc.date.accessioned2026-01-21T14:59:12Z
dc.date.available2026-01-21T14:59:12Z
dc.description.abstract<p>National identification numbers (NIN) and similar identification code systems are widely used for uniquely identifying individuals and organizations in Finland, Sweden, and many other countries. To increase the general understanding of such techniques of identification, openly available methods and tools for NIN analysis and validation are needed. The hetu and sweidnumbr R packages provide functions for extracting embedded information, checking the validity, and generating random but valid numbers in the context of Finnish and Swedish NINs and other identification codes. In this article, we demonstrate these functions from both packages and provide theoretical context and motivation on the importance of the subject matter. Our work contributes to the growing toolkit of standardized methods for computational social science research, epidemiology, demographic studies, and other register-based inquiries.<br></p>
dc.format.pagerange14
dc.format.pagerange4
dc.identifier.eissn2073-4859
dc.identifier.olddbid213954
dc.identifier.oldhandle10024/196972
dc.identifier.urihttps://www.utupub.fi/handle/11111/56183
dc.identifier.urlhttps://doi.org/10.32614/rj-2024-023
dc.identifier.urnURN:NBN:fi-fe202601216320
dc.language.isoen
dc.okm.affiliatedauthorKantanen, Pyry
dc.okm.affiliatedauthorLahtinen, Aleksi
dc.okm.affiliatedauthorLahti, Leo
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherThe R Foundation
dc.publisher.countryAustriaen_GB
dc.publisher.countryItävaltafi_FI
dc.publisher.country-codeAT
dc.relation.doi10.32614/rj-2024-023
dc.relation.ispartofjournalThe R journal
dc.relation.issue3
dc.relation.volume16
dc.source.identifierhttps://www.utupub.fi/handle/10024/196972
dc.titleValidating and Extracting Information from National Identification Numbers in R: The Case of Finland and Sweden
dc.year.issued2024

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
kantanen_etal_2024.pdf
Size:
225.96 KB
Format:
Adobe Portable Document Format