Donate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovation
| dc.contributor.author | Lindén Krister | |
| dc.contributor.author | Jauhiainen Tommi | |
| dc.contributor.author | Lennes Mietta | |
| dc.contributor.author | Kurimo Mikko | |
| dc.contributor.author | Rossi Aleksi | |
| dc.contributor.author | Kurki Tommi | |
| dc.contributor.author | Pitkänen Olli. | |
| dc.contributor.organization | fi=kotimaiset kielet ja niiden sukukielet|en=Finnish, Finno-Ugric and Scandinavian languages| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.59108485091 | |
| dc.converis.publication-id | 176591290 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/176591290 | |
| dc.date.accessioned | 2025-08-28T00:31:31Z | |
| dc.date.available | 2025-08-28T00:31:31Z | |
| dc.description.abstract | <p> <span>The Donate Speech campaign aimed to collect 10,000 hours of ordinary, </span><span>casual Finnish speech to be used for studying language as well as for develop</span><span>-</span><span>ing technology and services that can be readily used in the languages spoken in </span><span>Finland. In this project, particular attention has been devoted to allowing for both </span><span>academic and commercial use of the material. Even though this ambitious target </span><span>currently seems likely to evade us, the Donate Speech campaign has managed </span><span>to amass an extensive resource of more than 4,000 hours of Finnish colloquial </span><span>speech comprising more than 220,000 speech recordings by more than 25,000 </span><span>speakers from all over Finland in just a few months.</span><span></span><br></p><p><span>Keywords:</span><span> speech resources, colloquial speech, large-scale data collection, aca</span><span>-</span><span>demic and commercial use</span> <br></p> | |
| dc.format.pagerange | 481 | |
| dc.format.pagerange | 510 | |
| dc.identifier.eisbn | 978-3-11-076737-7 | |
| dc.identifier.isbn | 978-3-11-076734-6 | |
| dc.identifier.olddbid | 205871 | |
| dc.identifier.oldhandle | 10024/188898 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/35559 | |
| dc.identifier.url | https://doi.org/10.1515/9783110767377-019 | |
| dc.identifier.urn | URN:NBN:fi-fe2022102462987 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Kurki, Tommi | |
| dc.okm.discipline | 6121 Languages | en_GB |
| dc.okm.discipline | 6121 Kielitieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A3 Book | |
| dc.publisher | De Gruyter | |
| dc.publisher.country | Germany | en_GB |
| dc.publisher.country | United States | en_GB |
| dc.publisher.country | Saksa | fi_FI |
| dc.publisher.country | Yhdysvallat (USA) | fi_FI |
| dc.publisher.country-code | DE | |
| dc.publisher.country-code | US | |
| dc.publisher.isbn | 978-3-11; 978-3-484; 978-3-597; 978-3-598; 978-3-7940; 978-3-11-025877-6 | |
| dc.publisher.place | Berlin & Boston | |
| dc.relation.doi | 10.1515/9783110767377-019 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/188898 | |
| dc.title | Donate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovation | |
| dc.title.book | CLARIN: The Infrastructure for Language Resources | |
| dc.year.issued | 2022 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- 10.1515_9783110767377-019.pdf
- Size:
- 1.89 MB
- Format:
- Adobe Portable Document Format