A Resource-Efficient Codebook-Driven Semantic Structuring Pipeline for Human-AI Dialogue in Ambient Intelligent Systems
| dc.contributor.author | Adeseye, Aisvarya | |
| dc.contributor.author | Isoaho, Jouni | |
| dc.contributor.author | Virtanen, Seppo | |
| dc.contributor.author | Mohammad, Tahir | |
| dc.contributor.organization | fi=kyberturvallisuusteknologia|en=Cyber Security Engineering| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.28753843706 | |
| dc.converis.publication-id | 526472743 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/526472743 | |
| dc.date.accessioned | 2026-06-10T20:12:16Z | |
| dc.description.abstract | <p>Human–AI dialogue in ambient intelligent systems is increasingly relying on large language models (LLMs). When questions are generated dynamically to enable personalized and context-aware interactions, variations in phrasing and topical focus exist between conversations. Without structured organization, which is often extremely resource-intensive, conversational data remains fragmented and cannot be reliably used for systematic analysis or reporting. This study proposes a semantic structuring pipeline to map LLM-generated questions to shared codes, sub-themes, and themes using a predefined codebook. This multi-stage pipeline applies semantic screening, factor-based scoring, mathematical aggregation, and validation checks, supported by locally deployed LLMs and manual confirmation. The pipeline was evaluated on 6,030 question–response pairs collected from dynamic interviews across three research objectives. The framework achieved an overall mapping accuracy of 97% while reducing hallucinated semantic matches to 1.2% through layered validation. The results indicate that the framework effectively reduces hallucinated matches and improves mapping accuracy while remaining computationally efficient for private local deployment.<br></p> | |
| dc.format.pagerange | 559 | |
| dc.format.pagerange | 552 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/61692 | |
| dc.identifier.url | https://doi.org/10.1016/j.procs.2026.04.070 | |
| dc.identifier.urn | URN:NBN:fi-fe2026061066543 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Adeseye, Aisvarya | |
| dc.okm.affiliatedauthor | Isoaho, Jouni | |
| dc.okm.affiliatedauthor | Virtanen, Seppo | |
| dc.okm.affiliatedauthor | Mohammad, Tahir | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.discipline | 213 Electronic, automation and communications engineering, electronics | en_GB |
| dc.okm.discipline | 213 Sähkö-, automaatio- ja tietoliikennetekniikka, elektroniikka | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | Netherlands | en_GB |
| dc.publisher.country | Alankomaat | fi_FI |
| dc.publisher.country-code | NL | |
| dc.relation.conference | International Conference on Ambient Systems, Networks and Technologies Networks | |
| dc.relation.doi | 10.1016/j.procs.2026.04.070 | |
| dc.relation.ispartofjournal | Procedia Computer Science | |
| dc.relation.volume | 280 | |
| dc.title | A Resource-Efficient Codebook-Driven Semantic Structuring Pipeline for Human-AI Dialogue in Ambient Intelligent Systems | |
| dc.title.book | The 17th International Conference on Ambient Systems, Networks and Technologies Networks (ANT)/ the 9th International Conference on Emerging Data and Industry 4.0 (EDI40) | |
| dc.year.issued | 2026 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- 1-s2.0-S1877050926010835-main.pdf
- Size:
- 461.89 KB
- Format:
- Adobe Portable Document Format