Neural network hate deletion: Developing a machine learning model to eliminate hate from online comments

Joni Salminen; Juhani Luotolahti; Hind Almerekhi; Bernard J. Jansen; Soon-gyo Jung

Neural network hate deletion: Developing a machine learning model to eliminate hate from online comments

dc.contributor.author	Joni Salminen
dc.contributor.author	Juhani Luotolahti
dc.contributor.author	Hind Almerekhi
dc.contributor.author	Bernard J. Jansen
dc.contributor.author	Soon-gyo Jung
dc.contributor.organization	fi=tietojenkäsittelytiede\|en=Computer Science\|
dc.contributor.organization-code	1.2.246.10.2458963.20.50826905346
dc.contributor.organization-code	2606803
dc.converis.publication-id	36541856
dc.converis.url	https://research.utu.fi/converis/portal/Publication/36541856
dc.date.accessioned	2022-10-27T12:19:41Z
dc.date.available	2022-10-27T12:19:41Z
dc.description.abstract	<p>We propose a method for modifying hateful online comments to non-hateful comments without losing the understandability and original meaning of the comments. To accomplish this, we retrieve and classify 301,153 hateful and 1,041,490 non-hateful comments from Facebook and YouTube channels of a large international media organization that is a target of considerable online hate. We supplement this dataset by 10,000 Reddit comments manually labeled for hatefulness. Using these two datasets, we train a neural network to distinguish linguistic patterns. The model we develop, Neural Network Hate Deletion (NNHD), computes how hateful the sentences of a social media comment are and if they are above a given threshold, it deletes them using a language dependency tree. We evaluate the results by comparing crowd workers’ perceptions of hatefulness and understandability before and after transformation and find that our method reduces hatefulness without resulting in a significant loss of understandability. In some cases, removing hateful elements improves understandability by reducing the linguistic complexity of the comment. In addition, we find that NNHD can satisfactorily retain the original meaning on average but is not perfect in this regard. In terms of practical implications, NNHD could be used in social media platforms to suggest more neutral use of language to agitated online users.</p>
dc.format.pagerange	39
dc.identifier.eisbn	978-3-030-01437-7
dc.identifier.isbn	978-3-030-01436-0
dc.identifier.issn	0302-9743
dc.identifier.jour-issn	0302-9743
dc.identifier.olddbid	174764
dc.identifier.oldhandle	10024/157858
dc.identifier.uri	https://www.utupub.fi/handle/11111/34823
dc.identifier.url	https://link.springer.com/chapter/10.1007/978-3-030-01437-7_3
dc.identifier.urn	URN:NBN:fi-fe2021042720117
dc.language.iso	en
dc.okm.affiliatedauthor	Salminen, Joni
dc.okm.affiliatedauthor	Luotolahti, Matti
dc.okm.discipline	113 Computer and information sciences	en_GB
dc.okm.discipline	113 Tietojenkäsittely ja informaatiotieteet	fi_FI
dc.okm.internationalcopublication	international co-publication
dc.okm.internationality	International publication
dc.okm.type	A4 Conference Article
dc.relation.conference	International Conference on Internet Science
dc.relation.doi	10.1007/978-3-030-01437-7_3
dc.relation.ispartofjournal	Lecture Notes in Computer Science
dc.relation.ispartofseries	Lecture Notes in Computer Science
dc.relation.volume	11193
dc.source.identifier	https://www.utupub.fi/handle/10024/157858
dc.title	Neural network hate deletion: Developing a machine learning model to eliminate hate from online comments
dc.title.book	Internet Science: 5th International Conference, INSCI 2018, St. Petersburg, Russia, October 24–26, 2018, Proceedings
dc.year.issued	2018

Tiedostot

Näytetään 1 - 1 / 1

Name:: Neural network hate deletion.pdf
Size:: 697.12 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet