Extracting Geographical References from Finnish Literature: Fully Automated Processing of Plain-Text Corpora
Kiiskinen Harri; Nivala Asko; Westerlund Jasmine; Saarelainen Juhana
Extracting Geographical References from Finnish Literature: Fully Automated Processing of Plain-Text Corpora
Kiiskinen Harri
Nivala Asko
Westerlund Jasmine
Saarelainen Juhana
Julkaisun pysyvä osoite on:
https://urn.fi/URN:NBN:fi-fe2025082789879
https://urn.fi/URN:NBN:fi-fe2025082789879
Tiivistelmä
In the Atlas of Finnish Literature 1870-1940 project, we extract geo- graphical information from a Finnish-language corpus of literary texts published between 1870 and 1940. The texts are transformed from plain texts to TEI/XML, and further processed with named entity recognition and linking tools. The results are presented in a web-based environment. This article describes the technical structure of the analysis chain, the tools used and the metaprocesses used to manage the research dataset.
Kokoelmat
- Rinnakkaistallenteet [27094]