Combining support vector machines and LSTM networks for chemical-protein relation extraction

dc.contributor.authorFarrokh​ Mehryary
dc.contributor.author​​ Jari​ Björne​
dc.contributor.author​ Tapio​ Salakoski​
dc.contributor.author​ Filip​ Ginter​
dc.contributor.organizationfi=kieli- ja puheteknologia|en=Language and Speech Technology|
dc.contributor.organization-code1.2.246.10.2458963.20.47465613983
dc.converis.publication-id28819195
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/28819195
dc.date.accessioned2022-10-28T13:32:38Z
dc.date.available2022-10-28T13:32:38Z
dc.description.abstract<p>We present the results of our participation in the BioCreative VI: Text mining chemical-protein interactions (CHEMPROT) track. The goal of this task is to promote the development and evaluation of systems capable of extracting relations between chemical compounds/drug and genes/proteins from biomedical literature. We participate with two systems: (1) an SVM system which relies on a rich set of features extracted from the parse graph and (2) an ensemble of neural networks that utilize LSTM networks and generate features along the shortest path of dependencies. We also combine the predictions from the two systems with the goal of increasing performance. On the development set, our system combination approach outperforms the two individual systems, achieving an F-score of 61.09 (according to the official evaluation metric). On the test set, our SVM system achieves the highest result for our submissions with​ ​an​ ​ F-score​ ​ of​ ​ 60.99.</p>
dc.format.pagerange175
dc.format.pagerange179
dc.identifier.isbn978-84-948397-0-2
dc.identifier.olddbid182814
dc.identifier.oldhandle10024/165908
dc.identifier.urihttps://www.utupub.fi/handle/11111/40214
dc.identifier.urlhttp://www.biocreative.org/resources/publications/bcvi-proceedings/
dc.identifier.urnURN:NBN:fi-fe2021042718181
dc.language.isoen
dc.okm.affiliatedauthorMehryary, Farrokh
dc.okm.affiliatedauthorBjörne, Jari
dc.okm.affiliatedauthorSalakoski, Tapio
dc.okm.affiliatedauthorGinter, Filip
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.conferenceBioCreative
dc.source.identifierhttps://www.utupub.fi/handle/10024/165908
dc.titleCombining support vector machines and LSTM networks for chemical-protein relation extraction
dc.title.bookProceedings of the BioCreative VI Workshop
dc.year.issued2018

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
BC6_track5_9.pdf
Size:
220.5 KB
Format:
Adobe Portable Document Format
Description:
Publisher's PDF