Dependency profiles as a tool for big data analysis of linguistic constructions: A case study of emoticons
| dc.contributor.author | Laippala V. | |
| dc.contributor.author | Kyröläinen A. | |
| dc.contributor.author | Kanerva J. | |
| dc.contributor.author | Luotolahti J. | |
| dc.contributor.author | Ginter F. | |
| dc.contributor.organization | fi=digitaalinen kielentutkimus, espanja, italia, kiina, ranska, saksa|en=Digital Language Studies, Chinese, French, German, Italian, Spanish| | |
| dc.contributor.organization | fi=kotimaiset kielet ja niiden sukukielet|en=Finnish, Finno-Ugric and Scandinavian languages| | |
| dc.contributor.organization | fi=tietojenkäsittelytiede|en=Computer Science| | |
| dc.contributor.organization | fi=tietotekniikan laitos|en=Department of Computing| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.36764574459 | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.85312822902 | |
| dc.contributor.organization-code | 2606803 | |
| dc.converis.publication-id | 27582771 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/27582771 | |
| dc.date.accessioned | 2025-08-28T02:40:18Z | |
| dc.date.available | 2025-08-28T02:40:18Z | |
| dc.description.abstract | <p>This study presents a methodological toolbox for big data analysis of linguistic constructions by introducing dependency profiles, i.e., co-occurrences of linguistic elements with syntax information. These were operationalized by reconstructing sentences as delexicalized syntactic biarcs, subtrees of dependency analyses. As a case study, we utilize these dependency profiles to explore usage patterns associated with emoticons, the graphic representations of facial expressions. These are said to be characteristic of Computer-Mediated Communication, but typically studied only in restricted corpora. To analyze the 3.7-billion token Finnish Internet Parsebank we use as data, we apply clustering and support vector machines. The results show that emoticons are associated with three typical usage patterns: stream of the writer’s consciousness, narrative constructions and elements guiding the interaction and expressing the writer’s reactions by means of interjections and discourse particles. Additionally, the more frequent emoticons, such as :), are used differently than the less frequent ones, such as ^_^.<br /></p> | |
| dc.format.pagerange | 127 | |
| dc.format.pagerange | 153 | |
| dc.identifier.eissn | 1736-8987 | |
| dc.identifier.jour-issn | 1736-8987 | |
| dc.identifier.olddbid | 209493 | |
| dc.identifier.oldhandle | 10024/192520 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/46262 | |
| dc.identifier.urn | URN:NBN:fi-fe2021042717516 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Laippala, Veronika | |
| dc.okm.affiliatedauthor | Kyröläinen, Aki | |
| dc.okm.affiliatedauthor | Kanerva, Jenna | |
| dc.okm.affiliatedauthor | Dataimport, Suomen kieli ja suom-ugrilainen kielent | |
| dc.okm.affiliatedauthor | Ginter, Filip | |
| dc.okm.discipline | 6121 Languages | en_GB |
| dc.okm.discipline | 6121 Kielitieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A1 ScientificArticle | |
| dc.publisher | University of Tartu Press | |
| dc.publisher.country | Estonia | en_GB |
| dc.publisher.country | Viro | fi_FI |
| dc.publisher.country-code | EE | |
| dc.relation.doi | 10.12697/jeful.2017.8.2.05 | |
| dc.relation.ispartofjournal | Eesti ja soome-ugri keeleteaduse ajakiri | |
| dc.relation.issue | 2 | |
| dc.relation.volume | 8 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/192520 | |
| dc.title | Dependency profiles as a tool for big data analysis of linguistic constructions: A case study of emoticons | |
| dc.year.issued | 2017 |
Tiedostot
1 - 1 / 1