Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/81361
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTan, Wen Junen
dc.contributor.authorTang, Wai Tengen
dc.contributor.authorGoh, Rick Siow Mongen
dc.contributor.authorTurner, Stephen Johnen
dc.contributor.authorWong, Weng-Faien
dc.date.accessioned2015-12-23T08:52:13Zen
dc.date.accessioned2019-12-06T14:29:15Z-
dc.date.available2015-12-23T08:52:13Zen
dc.date.available2019-12-06T14:29:15Z-
dc.date.issued2015en
dc.identifier.citationTan, W. J., Tang, W. T., Goh, R. S. M., Turner, S. J., & Wong, W.-F. (2015). A Code Generation Framework for Targeting Optimized Library Calls for Multiple Platforms. IEEE Transactions on Parallel and Distributed Systems, 26(7), 1789-1799.en
dc.identifier.issn1045-9219en
dc.identifier.urihttps://hdl.handle.net/10356/81361-
dc.description.abstractDirective-based programming approaches such as OpenMP and OpenACC have gained popularity due to their ease of programming. These programming models typically involve adding compiler directives to code sections such as loops in order to parallelize them for execution on multicore CPUs or GPUs. However, one problem with this approach is that existing compilers generate code directly from the annotated sections and do not make use of hardware-specific architectural features. As a result, the generated code is unable to fully exploit the capabilities of the underlying hardware. Alternatively, we propose a code generation framework in which linear algebraic operations in the annotated codes are recognized, extracted and mapped to optimized vendor-provided platform-specific library calls. We demonstrate that such an approach can result in better performance in the generated code compared to those which are generated by existing compilers. This is substantiated by experimental results on multicore CPUs and GPUs.en
dc.description.sponsorshipASTAR (Agency for Sci., Tech. and Research, S’pore)en
dc.format.extent12 p.en
dc.language.isoenen
dc.relation.ispartofseriesIEEE Transactions on Parallel and Distributed Systemsen
dc.rights© 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TPDS.2014.2329494].en
dc.subjectOpenMPen
dc.subjectOpenACCen
dc.subjectMulticore CPUen
dc.subjectGPUen
dc.subjectCode generationen
dc.titleA Code Generation Framework for Targeting Optimized Library Calls for Multiple Platformsen
dc.typeJournal Articleen
dc.contributor.schoolSchool of Computer Engineeringen
dc.identifier.doi10.1109/TPDS.2014.2329494en
dc.description.versionAccepted versionen
item.grantfulltextopen-
item.fulltextWith Fulltext-
Appears in Collections:SCSE Journal Articles
Files in This Item:
File Description SizeFormat 
A Code Generation Framework for Targeting Optimized Library Calls for Multiple Platforms.pdf596.39 kBAdobe PDFThumbnail
View/Open

SCOPUSTM   
Citations 20

11
Updated on Nov 30, 2023

Web of ScienceTM
Citations 20

9
Updated on Oct 27, 2023

Page view(s)

312
Updated on Dec 7, 2023

Download(s) 20

264
Updated on Dec 7, 2023

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.