An Eye for AI: A Multimodal Bottleneck Transformer Approach for Predicting Individual Eye Movements : Towards Foundation Models for Human Factors & Neuroscience

Dolmans, Tenzing

An Eye for AI: A Multimodal Bottleneck Transformer Approach for Predicting Individual Eye Movements : Towards Foundation Models for Human Factors & Neuroscience

dc.contributor.author	Dolmans, Tenzing
dc.contributor.department	fi=Kliininen laitos\|en=Department of Clinical Medicine\|
dc.contributor.faculty	fi=Lääketieteellinen tiedekunta\|en=Faculty of Medicine\|
dc.contributor.studysubject	fi=Kliiniset neurotieteet\|en=Clinical Neurosciences\|
dc.date.accessioned	2023-08-28T11:02:24Z
dc.date.available	2023-08-28T11:02:24Z
dc.date.issued	2023-06-19
dc.description.abstract	Human perception has been a subject of study for centuries. Various eye tracking methods in many study designs have shed light on individual differences in perception and visual navigation. However, accurately identifying individuals based on gaze behaviour remains a challenge. Artificial intelligence (AI) based methods have led to large successes in domains such as vision and language; they are also making their introduction in human factors & neuroscience (HFN). Leveraging AI for HFN requires quantities of data several orders of magnitude larger than the field is used to organising; there exists a clear discrepancy in the standardisation of data publication. In this work, we work towards foundation models (FM) for HFN by highlighting important data insights from AI. A multimodal bottleneck transformer is proposed, a model architecture that can effectively and efficiently represent and work with the varying modalities encountered in HFN. Results indicate that classification of individuals and prediction of gaze is possible, given more training data.
dc.format.extent	63
dc.identifier.olddbid	192595
dc.identifier.oldhandle	10024/175667
dc.identifier.uri	https://www.utupub.fi/handle/11111/18285
dc.identifier.urn	URN:NBN:fi-fe2023072691718
dc.language.iso	eng
dc.rights	fi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.\|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.\|
dc.rights.accessrights	avoin
dc.source.identifier	https://www.utupub.fi/handle/10024/175667
dc.subject	eye tracking, deep learning, standardisation, transformers, multimodal AI
dc.title	An Eye for AI: A Multimodal Bottleneck Transformer Approach for Predicting Individual Eye Movements : Towards Foundation Models for Human Factors & Neuroscience
dc.type.ontasot	fi=Pro gradu -tutkielma\|en=Master's thesis\|

Tiedostot

Näytetään 1 - 1 / 1

Name:: MasterThesis_DolmansTenzing.pdf
Size:: 1.06 MB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Pro gradu -tutkielmat ja diplomityöt sekä syventävien opintojen opinnäytetyöt (kokotekstit)