Hyppää sisältöön
    • Suomeksi
    • In English
  • Suomeksi
  • In English
  • Kirjaudu
Näytä aineisto 
  •   Etusivu
  • 3. UTUCris-artikkelit
  • Rinnakkaistallenteet
  • Näytä aineisto
  •   Etusivu
  • 3. UTUCris-artikkelit
  • Rinnakkaistallenteet
  • Näytä aineisto
JavaScript is disabled for your browser. Some features of this site may not work without it.

Wide-scope biomedical named entity recognition and normalization with CRFs, fuzzy matching and character level modeling

Niko Miekka; Filip Ginter; Tapio Salakoski; Suwisa Kaewphan; Kai Hakala

Wide-scope biomedical named entity recognition and normalization with CRFs, fuzzy matching and character level modeling

Niko Miekka
Filip Ginter
Tapio Salakoski
Suwisa Kaewphan
Kai Hakala
Katso/Avaa
Publisher's PDF (867.3Kb)
Lataukset: 

Oxford University Press
doi:10.1093/database/bay096
URI
https://academic.oup.com/database/article/doi/10.1093/database/bay096/5101499
Näytä kaikki kuvailutiedot
Julkaisun pysyvä osoite on:
https://urn.fi/URN:NBN:fi-fe2021042719760
Tiivistelmä

We present a system for automatically identifying a multitude of
biomedical entities from the literature. This work is based on our
previous efforts in the BioCreative VI: Interactive Bio-ID Assignment
shared task in which our system demonstrated state-of-the-art
performance with the highest achieved results in named entity
recognition. In this paper we describe the original conditional random
field-based system used in the shared task as well as experiments
conducted since, including better hyperparameter tuning and character
level modeling, which led to further performance improvements. For
normalizing the mentions into unique identifiers we use fuzzy character n-gram
matching. The normalization approach has also been improved with a
better abbreviation resolution method and stricter guideline compliance
resulting in vastly improved results for various entity types. All tools
and models used for both named entity recognition and normalization are
publicly available under open license.

Kokoelmat
  • Rinnakkaistallenteet [19207]

Turun yliopiston kirjasto | Turun yliopisto
julkaisut@utu.fi | Tietosuoja | Saavutettavuusseloste
 

 

Tämä kokoelma

JulkaisuajatTekijätNimekkeetAsiasanatTiedekuntaLaitosOppiaineYhteisöt ja kokoelmat

Omat tiedot

Kirjaudu sisäänRekisteröidy

Turun yliopiston kirjasto | Turun yliopisto
julkaisut@utu.fi | Tietosuoja | Saavutettavuusseloste