An Eye for AI: A Multimodal Bottleneck Transformer Approach for Predicting Individual Eye Movements : Towards Foundation Models for Human Factors & Neuroscience

dc.contributor.authorDolmans, Tenzing
dc.contributor.departmentfi=Kliininen laitos|en=Department of Clinical Medicine|
dc.contributor.facultyfi=Lääketieteellinen tiedekunta|en=Faculty of Medicine|
dc.contributor.studysubjectfi=Kliiniset neurotieteet|en=Clinical Neurosciences|
dc.date.accessioned2023-08-28T11:02:24Z
dc.date.available2023-08-28T11:02:24Z
dc.date.issued2023-06-19
dc.description.abstractHuman perception has been a subject of study for centuries. Various eye tracking methods in many study designs have shed light on individual differences in perception and visual navigation. However, accurately identifying individuals based on gaze behaviour remains a challenge. Artificial intelligence (AI) based methods have led to large successes in domains such as vision and language; they are also making their introduction in human factors & neuroscience (HFN). Leveraging AI for HFN requires quantities of data several orders of magnitude larger than the field is used to organising; there exists a clear discrepancy in the standardisation of data publication. In this work, we work towards foundation models (FM) for HFN by highlighting important data insights from AI. A multimodal bottleneck transformer is proposed, a model architecture that can effectively and efficiently represent and work with the varying modalities encountered in HFN. Results indicate that classification of individuals and prediction of gaze is possible, given more training data.
dc.format.extent63
dc.identifier.olddbid192595
dc.identifier.oldhandle10024/175667
dc.identifier.urihttps://www.utupub.fi/handle/11111/18285
dc.identifier.urnURN:NBN:fi-fe2023072691718
dc.language.isoeng
dc.rightsfi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.|
dc.rights.accessrightsavoin
dc.source.identifierhttps://www.utupub.fi/handle/10024/175667
dc.subjecteye tracking, deep learning, standardisation, transformers, multimodal AI
dc.titleAn Eye for AI: A Multimodal Bottleneck Transformer Approach for Predicting Individual Eye Movements : Towards Foundation Models for Human Factors & Neuroscience
dc.type.ontasotfi=Pro gradu -tutkielma|en=Master's thesis|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
MasterThesis_DolmansTenzing.pdf
Size:
1.06 MB
Format:
Adobe Portable Document Format