Developing an online hate classifier for multiple social media platforms

dc.contributor.authorJoni Salminen
dc.contributor.authorMaximilian Hopf
dc.contributor.authorShammur A. Chowdhury
dc.contributor.authorSoon-gyo Jung
dc.contributor.authorHind Almerekhi
dc.contributor.authorBernard Jansen
dc.contributor.organizationfi=markkinointi|en=Marketing|
dc.contributor.organization-code1.2.246.10.2458963.20.50826905346
dc.converis.publication-id45063523
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/45063523
dc.date.accessioned2022-10-28T12:20:11Z
dc.date.available2022-10-28T12:20:11Z
dc.description.abstract<p>The proliferation of social media enables people to express their opinions widely online. However, at the same time, this has resulted in the emergence of conflict and hate, making online environments uninviting for users. Although researchers have found that hate is a problem across multiple platforms, there is a lack of models for online hate detection using multi-platform data. To address this research gap, we collect a total of 197,566 comments from four platforms: YouTube, Reddit, Wikipedia, and Twitter, with 80% of the comments labeled as non-hateful and the remaining 20% labeled as hateful. We then experiment with several classification algorithms (Logistic Regression, Naïve Bayes, Support Vector Machines, XGBoost, and Neural Networks) and feature representations (Bag-of-Words, TF-IDF, Word2Vec, BERT, and their combination). While all the models significantly outperform the keyword-based baseline classifier, XGBoost using all features performs the best (F1 = 0.92). Feature importance analysis indicates that BERT features are the most impactful for the predictions. Findings support the generalizability of the best model, as the platform-specific results from Twitter and Wikipedia are comparable to their respective source papers. We make our code publicly available for application in real software systems as well as for further development by online hate researchers.</p>
dc.identifier.eissn2192-1962
dc.identifier.jour-issn2192-1962
dc.identifier.olddbid175919
dc.identifier.oldhandle10024/159013
dc.identifier.urihttps://www.utupub.fi/handle/11111/30086
dc.identifier.urnURN:NBN:fi-fe2021042824119
dc.language.isoen
dc.okm.affiliatedauthorSalminen, Joni
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherSpringer
dc.publisher.countryGermanyen_GB
dc.publisher.countrySaksafi_FI
dc.publisher.country-codeDE
dc.relation.articlenumber1
dc.relation.doi10.1186/s13673-019-0205-6
dc.relation.ispartofjournalHuman-Centric Computing and Information Sciences
dc.relation.issue1
dc.relation.volume10
dc.source.identifierhttps://www.utupub.fi/handle/10024/159013
dc.titleDeveloping an online hate classifier for multiple social media platforms
dc.year.issued2020

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
s13673-019-0205-6.pdf
Size:
1.87 MB
Format:
Adobe Portable Document Format
Description:
Publisher's version