A Deep Dive into Multi-Head Attention and Multi-Aspect Embedding

dc.contributor.authorTeimouri, Maryam
dc.contributor.authorKanerva, Jenna
dc.contributor.authorGinter, Filip
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.converis.publication-id500281244
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/500281244
dc.date.accessioned2026-01-21T14:54:00Z
dc.date.available2026-01-21T14:54:00Z
dc.description.abstract<p>Multi-vector embedding models play an increasingly important role in retrievalaugmented generation, yet their internal behaviour lacks comprehensive analysis. We conduct a systematic, head-level study of the 32-head Semantic Feature Representation (SFR) encoder with the FineWeb corpus containing 10 billion tokens. For a set of 4,000 web documents, we pair head-specific embeddings with GPT-4o topic annotations and analyse the results using t-SNE visualisations, heat maps, and a 32-way logistic probe. The analysis shows that (i) clear semantic separation between heads emerges only at an intermediate layer, (ii) some heads align with specific topics while others capture broader corpus features, and (iii) naive pooling of head outputs can blur these distinctions, leading to frequent topic mismatches. The study offers practical guidance on where to extract embeddings, which heads may be pruned, and how to aggregate them to support more transparent and controllable retrieval pipelines.<br></p>
dc.format.pagerange1263
dc.format.pagerange1270
dc.identifier.eisbn978-954-452-098-4
dc.identifier.olddbid213849
dc.identifier.oldhandle10024/196867
dc.identifier.urihttps://www.utupub.fi/handle/11111/56014
dc.identifier.urlhttps://doi.org/10.26615/978-954-452-098-4-146
dc.identifier.urnURN:NBN:fi-fe202601217088
dc.language.isoen
dc.okm.affiliatedauthorTeimouribadelehdareh, Maryam
dc.okm.affiliatedauthorKanerva, Jenna
dc.okm.affiliatedauthorGinter, Filip
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryBulgariaen_GB
dc.publisher.countryBulgariafi_FI
dc.publisher.country-codeBG
dc.relation.conferenceInternational Conference on Recent Advances in Natural Language Processing
dc.relation.doi10.26615/978-954-452-098-4-146
dc.source.identifierhttps://www.utupub.fi/handle/10024/196867
dc.titleA Deep Dive into Multi-Head Attention and Multi-Aspect Embedding
dc.title.bookProceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI era
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
teimouri_kanerva_ginter_2025.pdf
Size:
6.99 MB
Format:
Adobe Portable Document Format