A novel multi-level integrated roofline model approach for performance characterization
| dc.contributor.author | Tuomas Koskela | |
| dc.contributor.author | Zakhar Matveev | |
| dc.contributor.author | Charlene Yang | |
| dc.contributor.author | Adetokunbo Adedoyin | |
| dc.contributor.author | Roman Belenov | |
| dc.contributor.author | Philippe Thierry | |
| dc.contributor.author | Zhengji Zhao | |
| dc.contributor.author | Rahulkumar Gayatri | |
| dc.contributor.author | Hongzhang Shan | |
| dc.contributor.author | Leonid Oliker | |
| dc.contributor.author | Jack Deslippe | |
| dc.contributor.author | Ron Green | |
| dc.contributor.author | Samuel Williams | |
| dc.contributor.organization | fi=avaruustutkimuslaboratorio|en=Space Research Laboratory| | |
| dc.contributor.organization-code | 2606702 | |
| dc.converis.publication-id | 32081756 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/32081756 | |
| dc.date.accessioned | 2022-10-27T11:58:59Z | |
| dc.date.available | 2022-10-27T11:58:59Z | |
| dc.description.abstract | With energy-efficient architectures, including accelerators and many-core processors, gaining traction, application developers face the challenge of optimizing their applications for multiple hardware features including many-core parallelism, wide processing vector-units and on-chip high-bandwidth memory. In this paper, we discuss the development and utilization of a new application performance tool based on an extension of the classical roofline-model for simultaneously profiling multiple levels in the cache-memory hierarchy. This tool presents a powerful visual aid for the developer and can be used to frame the many-dimensional optimization problem in a tractable way. We show case studies of real scientific applications that have gained insights from the Integrated Roofline Model.<br /> | |
| dc.format.pagerange | 226 | |
| dc.format.pagerange | 245 | |
| dc.identifier.eisbn | 978-3-319-92040-5 | |
| dc.identifier.isbn | 978-3-319-92039-9 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.jour-issn | 0302-9743 | |
| dc.identifier.olddbid | 173288 | |
| dc.identifier.oldhandle | 10024/156382 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/31313 | |
| dc.identifier.urn | URN:NBN:fi-fe2021042719358 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Koskela, Tuomas | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.internationalcopublication | international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | United States | en_GB |
| dc.publisher.country | Yhdysvallat (USA) | fi_FI |
| dc.publisher.country-code | US | |
| dc.relation.conference | International Conference on High Performance Computing | |
| dc.relation.doi | 10.1007/978-3-319-92040-5_12 | |
| dc.relation.ispartofjournal | Lecture Notes in Computer Science | |
| dc.relation.ispartofseries | Lecture Notes in Computer Science | |
| dc.relation.volume | 10876 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/156382 | |
| dc.title | A novel multi-level integrated roofline model approach for performance characterization | |
| dc.title.book | High Performance Computing | |
| dc.year.issued | 2018 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- multi-level-integrated (2).pdf
- Size:
- 988.27 KB
- Format:
- Adobe Portable Document Format
- Description:
- Final draft