A GPU performance estimation model based on micro-benchmarks and black-box kernel profiling

doi:10.12681/eadd/41390

Home

Browse

Discipline

Date

Author

Country

Language

Degree Grantor

About

Theses Submission

FAQ

Helpdesk

Open Data

Abstract

Over the last decade GPUs have been established in the High Performance Computing sector as compute accelerators. The primary characteristics that justify this modern trend are the exceptionally high compute throughput and the remarkable power efficiency of GPUs. However, GPU performance is highly sensitive to many factors, e.g. the type of memory access patterns, branch divergence, the degree of parallelism and potential latencies. Consequently, the execution time of a kernel on a GPU is a difficult to predict measure. Unless the kernel is latency bound, a rough estimate of the execution time on a particular GPU could be provided by applying the roofline model, which is used to map the program’s operation intensity to the peak expected performance on a particular processor.Though this approach is straightforward, it cannot not provide accurate prediction results. In this thesis, after validating the roofline principle on GPUs by employing a micro-benchmark, an analytical throughput oriented performance model is proposed. In particular, this improves on the roofline model following a quantitative approach and a completely automated GPU performance prediction technique is presented. In this respect, the proposed model utilizes micro-benchmarking and profiling in a “black-box” fashion as no inspection of source/binary code is required. The proposed model combines GPU and kernel parameters in order to characterize the performance limiting factor and to predict the execution time on target hardware, by taking into account the efficiency of beneficial computational instructions. In addition, the “quadrant-split” visual representation is proposed, which captures the characteristics of multiple processors in relation to a particular kernel.The experimental evaluation combines test executions on stencil computations (red/black SOR, LMSOR), matrix multiplication (SGEMM) and a total of 28 kernels of the Rodinia benchmark suite, all applied on six CUDA GPUs. The observed absolute error in predictions was 27.66% in the average case. Special cases of mispredicted results were investigated and justified. Moreover, the aforementioned micro-benchmark was used as a subject for performance prediction and the exhibited results were very accurate. Furthermore, the performance model was also examined in a cross vendor configuration by applying the prediction method on the same kernel source codes through the HIP programming environment supported on the AMD ROCm platform. Prediction errors were comparable to CUDA experiments despite the significant architectural differences evident between different vendor GPUs.

	Read Online
	Download full text in PDF format (2.75 MB) (Available only to registered users) I declare that I have read and unconditionally agree and accept the Terms of Use of the National Archive of Ph.D. Theses, as well as the

All items in National Archive of Phd theses are protected by copyright.

DOI	10.12681/eadd/41390
Handle URL	http://hdl.handle.net/10442/hedi/41390
ND	41390
Alternative title	Ένα μοντέλο εκτίμησης απόδοσης επεξεργαστή γραφικών (GPU) βασισμένο σε μετροπρογράμματα και καταγραφή μετρικών με προσέγγιση «μαύρο-κουτί»
Author	Konstantinidis, Elias (Father's name: Nikolaos)
Date	2017
Degree Grantor	National and Kapodistrian University of Athens
Committee members	Κοτρώνης Ιωάννης Μανωλάκος Ηλίας Κοζύρης Νεκτάριος Μισυρλής Νικόλαος Γκιζόπουλος Δημήτριος Σούντρης Δημήτριος Τζαφέρης Φίλιππος
Discipline	Natural Sciences ➨ Computer and Information Sciences
Keywords	Performance model; Graphics Processing Unit; Roofline model
Country	Greece
Language	English
Description	152 σ., im., tbls., fig., ch.
Rights and terms of use	Το έργο παρέχεται υπό τους όρους της δημόσιας άδειας του νομικού προσώπου Creative Commons Corporation: Attribution - ShareAlike 3.0 (CC-BY_SA)

Usage statistics

VIEWS

Concern the unique Ph.D. Thesis' views for the period 07/2018 - 07/2023.
Source: Google Analytics.

ONLINE READER

Concern the online reader's opening for the period 07/2018 - 07/2023.
Source: Google Analytics.

DOWNLOADS

Concern all downloads of this Ph.D. Thesis' digital file.
Source: National Archive of Ph.D. Theses.

USERS

Concern all registered users of National Archive of Ph.D. Theses who have interacted with this Ph.D. Thesis. Mostly, it concerns downloads.
Source: National Archive of Ph.D. Theses.

Related items (based on users' visits)

Physical and numerical modelling of irregular wave propagation in coastal waters

Αριθμητική προσομοίωση της τρισδιάστατης τυρβώδους ροής θραυομένων κυμάτων στην παράκτια ζώνη απόσβεσης

Ανάπτυξη στατιστικών μεθόδων υποβιβασμού κλίμακας με εφαρμογές σε περιβαλλοντολογικά προβλήματα

Numerical solution of incompressible flow equations over irregular geometry, with application to coastal hydrodynamics

Ασφάλεια πληροφοριακών συστημάτων: μαθηματικές αναλύσεις

Συμβολή της υποστηριζόμενης από GPS και GIS διαστημικής τηλεπισκόπησης στη μορφοτεκτονική έρευνα της Κεντρικής Μακεδονίας

ΜΕΛΕΤΗ ΤΩΝ ΓΗΙΝΩΝ ΠΑΛΙΡΡΟΙΩΝ ΑΠΟ ΚΑΤΑΓΡΑΦΕΣ ΒΑΡΥΤΗΤΑΣ. ΕΦΑΡΜΟΓΗ ΣΤΗΝ ΠΕΡΙΟΧΗ ΤΩΝ ΑΘΗΝΩΝ

Στοχαστική προσομοίωση θαλάσσσιων κυματισμών στα αβαθή ύδατα

Μελέτη της σύγχρονης ιζηματογένεσης στον κόλπο (εσωτερική υφαλοκρηπίδα) της Αλεξανδρούπολης

ΜΑΘΗΜΑΤΙΚΗ ΠΕΡΙΓΡΑΦΗ ΤΗΣ ΣΥΜΒΟΛΗΣ ΤΩΝ ΤΕΤΡΑΠΟΛΙΚΩΝ ΡΟΠΩΝ ΣΤΙΣ ΗΛΕΚΤΡΟΕΛΑΣΤΙΚΕΣ ΑΛΛΗΛΕΠΙΔΡΑΣΕΙΣ ΤΩΝ ΣΥΝΕΧΩΝ ΜΕΣΩΝ:ΔΙΑΝΥΣΜΑΤΙΚΗ ΚΑΙ ΜΕΤΑΒΟΛΙΚΗ ΠΡΟΣΕΓΓΙΣΗ

"A GPU performance estimation model based on micro-benchmarks and black-box kernel profiling"
	Please, type what you see in the image!
I declare that I have read and unconditionally agree and accept the Terms of Use of the National Archive of Ph.D. Theses, as well as the. Iï¿½m aware that this thesis is licensed under the Creative Commons Αναφορά Δημιουργού Παρόμοια Διανομή 3.0 Ελλάδα