Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/81361
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Tan, Wen Jun | en |
dc.contributor.author | Tang, Wai Teng | en |
dc.contributor.author | Goh, Rick Siow Mong | en |
dc.contributor.author | Turner, Stephen John | en |
dc.contributor.author | Wong, Weng-Fai | en |
dc.date.accessioned | 2015-12-23T08:52:13Z | en |
dc.date.accessioned | 2019-12-06T14:29:15Z | - |
dc.date.available | 2015-12-23T08:52:13Z | en |
dc.date.available | 2019-12-06T14:29:15Z | - |
dc.date.issued | 2015 | en |
dc.identifier.citation | Tan, W. J., Tang, W. T., Goh, R. S. M., Turner, S. J., & Wong, W.-F. (2015). A Code Generation Framework for Targeting Optimized Library Calls for Multiple Platforms. IEEE Transactions on Parallel and Distributed Systems, 26(7), 1789-1799. | en |
dc.identifier.issn | 1045-9219 | en |
dc.identifier.uri | https://hdl.handle.net/10356/81361 | - |
dc.description.abstract | Directive-based programming approaches such as OpenMP and OpenACC have gained popularity due to their ease of programming. These programming models typically involve adding compiler directives to code sections such as loops in order to parallelize them for execution on multicore CPUs or GPUs. However, one problem with this approach is that existing compilers generate code directly from the annotated sections and do not make use of hardware-specific architectural features. As a result, the generated code is unable to fully exploit the capabilities of the underlying hardware. Alternatively, we propose a code generation framework in which linear algebraic operations in the annotated codes are recognized, extracted and mapped to optimized vendor-provided platform-specific library calls. We demonstrate that such an approach can result in better performance in the generated code compared to those which are generated by existing compilers. This is substantiated by experimental results on multicore CPUs and GPUs. | en |
dc.description.sponsorship | ASTAR (Agency for Sci., Tech. and Research, S’pore) | en |
dc.format.extent | 12 p. | en |
dc.language.iso | en | en |
dc.relation.ispartofseries | IEEE Transactions on Parallel and Distributed Systems | en |
dc.rights | © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TPDS.2014.2329494]. | en |
dc.subject | OpenMP | en |
dc.subject | OpenACC | en |
dc.subject | Multicore CPU | en |
dc.subject | GPU | en |
dc.subject | Code generation | en |
dc.title | A Code Generation Framework for Targeting Optimized Library Calls for Multiple Platforms | en |
dc.type | Journal Article | en |
dc.contributor.school | School of Computer Engineering | en |
dc.identifier.doi | 10.1109/TPDS.2014.2329494 | en |
dc.description.version | Accepted version | en |
item.grantfulltext | open | - |
item.fulltext | With Fulltext | - |
Appears in Collections: | SCSE Journal Articles |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
A Code Generation Framework for Targeting Optimized Library Calls for Multiple Platforms.pdf | 596.39 kB | Adobe PDF | ![]() View/Open |
SCOPUSTM
Citations
20
11
Updated on Nov 30, 2023
Web of ScienceTM
Citations
20
9
Updated on Oct 27, 2023
Page view(s)
312
Updated on Dec 7, 2023
Download(s) 20
264
Updated on Dec 7, 2023
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.