Neural network hate deletion: Developing a machine learning model to eliminate hate from online comments

dc.contributor.authorJoni Salminen
dc.contributor.authorJuhani Luotolahti
dc.contributor.authorHind Almerekhi
dc.contributor.authorBernard J. Jansen
dc.contributor.authorSoon-gyo Jung
dc.contributor.organizationfi=markkinointi|en=Marketing|
dc.contributor.organizationfi=tietojenkäsittelytiede|en=Computer Science|
dc.contributor.organization-code1.2.246.10.2458963.20.50826905346
dc.contributor.organization-code2606803
dc.converis.publication-id36541856
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/36541856
dc.date.accessioned2022-10-27T12:19:41Z
dc.date.available2022-10-27T12:19:41Z
dc.description.abstract<p>We propose a method for modifying hateful online comments to non-hateful comments without losing the understandability and original meaning of the comments. To accomplish this, we retrieve and classify 301,153 hateful and 1,041,490 non-hateful comments from Facebook and YouTube channels of a large international media organization that is a target of considerable online hate. We supplement this dataset by 10,000 Reddit comments manually labeled for hatefulness. Using these two datasets, we train a neural network to distinguish linguistic patterns. The model we develop, Neural Network Hate Deletion (NNHD), computes how hateful the sentences of a social media comment are and if they are above a given threshold, it deletes them using a language dependency tree. We evaluate the results by comparing crowd workers’ perceptions of hatefulness and understandability before and after transformation and find that our method reduces hatefulness without resulting in a significant loss of understandability. In some cases, removing hateful elements improves understandability by reducing the linguistic complexity of the comment. In addition, we find that NNHD can satisfactorily retain the original meaning on average but is not perfect in this regard. In terms of practical implications, NNHD could be used in social media platforms to suggest more neutral use of language to agitated online users.</p>
dc.format.pagerange25
dc.format.pagerange39
dc.identifier.eisbn978-3-030-01437-7
dc.identifier.isbn978-3-030-01436-0
dc.identifier.issn0302-9743
dc.identifier.jour-issn0302-9743
dc.identifier.olddbid174764
dc.identifier.oldhandle10024/157858
dc.identifier.urihttps://www.utupub.fi/handle/11111/34823
dc.identifier.urlhttps://link.springer.com/chapter/10.1007/978-3-030-01437-7_3
dc.identifier.urnURN:NBN:fi-fe2021042720117
dc.language.isoen
dc.okm.affiliatedauthorSalminen, Joni
dc.okm.affiliatedauthorLuotolahti, Matti
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.relation.conferenceInternational Conference on Internet Science
dc.relation.doi10.1007/978-3-030-01437-7_3
dc.relation.ispartofjournalLecture Notes in Computer Science
dc.relation.ispartofseriesLecture Notes in Computer Science
dc.relation.volume11193
dc.source.identifierhttps://www.utupub.fi/handle/10024/157858
dc.titleNeural network hate deletion: Developing a machine learning model to eliminate hate from online comments
dc.title.bookInternet Science: 5th International Conference, INSCI 2018, St. Petersburg, Russia, October 24–26, 2018, Proceedings
dc.year.issued2018

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Neural network hate deletion.pdf
Size:
697.12 KB
Format:
Adobe Portable Document Format