Exploring Register Variation in Turkish Web Corpus

dc.contributor.authorErten Selcen
dc.contributor.organizationfi=kieli- ja käännöstieteiden laitos|en=School of Languages and Translation Studies|
dc.contributor.organization-code1.2.246.10.2458963.20.56461112866
dc.converis.publication-id380878163
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/380878163
dc.date.accessioned2025-08-28T01:38:57Z
dc.date.available2025-08-28T01:38:57Z
dc.description.abstract<p><br></p><p>In linguistics, web registers are language varieties occurring on the web such as news reports and editorials. Most of the previous web register research has been done for Indo-European languages. Additionally, previous research has mainly focused on the restricted corpora with pre-determined registers. This article describes Turkish web registers on the web. The data is Turkish web register corpus which consists of 2601 web texts. A taxonomy was adapted to register label these texts. The manual annotations of the texts were done with the adapted taxonomy, and the registers were defined accordingly. Text dispersion keyword analysis was used to generate the keywords of the registers and examine the basic linguistic characteristics of them. The results display the web registers existing for Turkish, and the linguistic characteristics associated with the news report and editorial registers. Keywords: Turkish web registers, manual annotation, text dispersion keyword analysis.<br></p>
dc.identifier.isbn978-3-937241-95-1
dc.identifier.olddbid207839
dc.identifier.oldhandle10024/190866
dc.identifier.urihttps://www.utupub.fi/handle/11111/57283
dc.identifier.urlhttps://doi.org/10.14618/1z5k-pb25
dc.identifier.urnURN:NBN:fi-fe2025082791784
dc.language.isoen
dc.okm.affiliatedauthorErten Johansson, Selcen
dc.okm.discipline518 Media and communicationsen_GB
dc.okm.discipline518 Media- ja viestintätieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryGermanyen_GB
dc.publisher.countrySaksafi_FI
dc.publisher.country-codeDE
dc.publisher.placeLeibniz-Institut für Deutsche Sprache, Mannheim
dc.relation.conferenceInternational Conference on CMC and Social Media Corpora for the Humanities
dc.relation.doi10.14618/1z5k-pb25
dc.source.identifierhttps://www.utupub.fi/handle/10024/190866
dc.titleExploring Register Variation in Turkish Web Corpus
dc.title.bookProceedings of the 10th International Conference on CMC and Social Media Corpora for the Humanities
dc.year.issued2023

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Erten_Exploring register variation in Turkish web corpus.pdf
Size:
398.35 KB
Format:
Adobe Portable Document Format