Vis enkel innførsel

dc.contributor.authorAhmed, Usman
dc.contributor.authorLin, Jerry Chun-Wei
dc.contributor.authorSrivastava, Gautam
dc.date.accessioned2023-03-10T12:49:04Z
dc.date.available2023-03-10T12:49:04Z
dc.date.created2022-05-20T15:10:58Z
dc.date.issued2022
dc.identifier.citationSustainable Computing: Informatics and Systems. 2022, 35 .en_US
dc.identifier.issn2210-5379
dc.identifier.urihttps://hdl.handle.net/11250/3057709
dc.description.abstractMassive data parallelism can be achieved by using general-purpose graphics processing units (GPGPU) with the help of the OpenCL framework. When smaller data with higher GPU memory is executed, it results in a low resource utilization ratio and energy inefficiencies. Up until now, there is no existing model to share GPU for further execution. In addition, if the kernel pair requires the same computation resource, then kernel merging also results in a significant increase in execution time. Therefore, optimal device selection, as well as kernel merging, can significantly speed up the execution performance for a batch of jobs. This paper proposes a kernel merging method that leads to high GPU occupancy. As a result, it reduces execution time and increases GPU utilization. Additionally, a machine learning (ML)-based GPU sharing mechanism is presented to select pairs of kernels in OpenCL frameworks. The model first selects suitable architecture for the jobs and then merges GPU kernels for better resource utilization. From all the GPU candidates, the optimal pair of the kernel concerning data size is selected. The experimental results show that the developed model can achieve 0.91 F1-measure for device selection and 0.98 for the scheduling scheme of kernel merging.en_US
dc.language.isoengen_US
dc.publisherElsevieren_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleA ML-based resource utilization OpenCL GPU-kernel fusion modelen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© 2022 The Authors.en_US
dc.source.pagenumber0en_US
dc.source.volume35en_US
dc.source.journalSustainable Computing: Informatics and Systemsen_US
dc.identifier.doi10.1016/j.suscom.2022.100683
dc.identifier.cristin2026051
dc.source.articlenumber100683en_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal