Exploring Register Variation in Turkish Web Corpus
| dc.contributor.author | Erten Selcen | |
| dc.contributor.organization | fi=kieli- ja käännöstieteiden laitos|en=School of Languages and Translation Studies| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.56461112866 | |
| dc.converis.publication-id | 380878163 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/380878163 | |
| dc.date.accessioned | 2025-08-28T01:38:57Z | |
| dc.date.available | 2025-08-28T01:38:57Z | |
| dc.description.abstract | <p><br></p><p>In linguistics, web registers are language varieties occurring on the web such as news reports and editorials. Most of the previous web register research has been done for Indo-European languages. Additionally, previous research has mainly focused on the restricted corpora with pre-determined registers. This article describes Turkish web registers on the web. The data is Turkish web register corpus which consists of 2601 web texts. A taxonomy was adapted to register label these texts. The manual annotations of the texts were done with the adapted taxonomy, and the registers were defined accordingly. Text dispersion keyword analysis was used to generate the keywords of the registers and examine the basic linguistic characteristics of them. The results display the web registers existing for Turkish, and the linguistic characteristics associated with the news report and editorial registers. Keywords: Turkish web registers, manual annotation, text dispersion keyword analysis.<br></p> | |
| dc.identifier.isbn | 978-3-937241-95-1 | |
| dc.identifier.olddbid | 207839 | |
| dc.identifier.oldhandle | 10024/190866 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/57283 | |
| dc.identifier.url | https://doi.org/10.14618/1z5k-pb25 | |
| dc.identifier.urn | URN:NBN:fi-fe2025082791784 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Erten Johansson, Selcen | |
| dc.okm.discipline | 518 Media and communications | en_GB |
| dc.okm.discipline | 518 Media- ja viestintätieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | Germany | en_GB |
| dc.publisher.country | Saksa | fi_FI |
| dc.publisher.country-code | DE | |
| dc.publisher.place | Leibniz-Institut für Deutsche Sprache, Mannheim | |
| dc.relation.conference | International Conference on CMC and Social Media Corpora for the Humanities | |
| dc.relation.doi | 10.14618/1z5k-pb25 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/190866 | |
| dc.title | Exploring Register Variation in Turkish Web Corpus | |
| dc.title.book | Proceedings of the 10th International Conference on CMC and Social Media Corpora for the Humanities | |
| dc.year.issued | 2023 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- Erten_Exploring register variation in Turkish web corpus.pdf
- Size:
- 398.35 KB
- Format:
- Adobe Portable Document Format