Lectures

GPU Architectures and Programming
DozentIn:Prof. Dr. Sohan Lal
Veranstaltungstyp:Vorlesung (Lehre)
Beschreibung:In this module, you will study the architecture and programming of GPUs. Please find below a brief outline of the lectures: - Review of computer architecture basics - measuring performance, benchmarks, five-stage RISC pipeline, caches - GPU basics - the evolution of GPU computing, a high-level overview of a GPU architecture - GPU programming with CUDA - program structure, CUDA threads organization, warp/thread-block scheduling - GPU (micro) architecture - streaming multiprocessors, single instruction multiple threads (SIMT) core design, tensor cores for deep learning, RT cores for ray tracing, mixed-precision support - GPU memory hierarchy - banked register file and operand collectors, shared memory, GPU caches (differences w.r.t. CPU caches), global memory - Branch and memory divergence - branch handling, stack-based reconvergence, memory coalescing, coalescer design - Barriers and synchronization - Temporal and spatial locality exploitation challenges in GPU caches - Global memory- high throughput requirements, GDDR/HBM, memory bandwidth optimization techniques - GPU research issues - performance bottlenecks, GPU power modeling, high-power consumption/energy efficiency, GPU security - Application case study - deep learning - Cycle-accurate simulators for GPUs In addition to lectures, a semester-long problem-based project will augment the learning in the lectures. Several topics related to GPUs will be proposed. You are required to choose a topic and work on it. It is possible to work in groups. There will be (bi-) weekly meetings to discuss progress and problems. In addition to the semester-long project, there will be assignments to teach CUDA programming. Course Evaluation: Oral examination Duration: 30 minutes
Ort:nicht angegeben
Semester:SoSe 24
Zeiten:Do. 11:15 - 12:45 (wöchentlich)
Erster Termin:Donnerstag, 04.04.2024 11:15 - 12:45
Veranstaltungsnummer:lv3039_s24
ECTS-Kreditpunkte:6
Weitere Informationen:
Heimatinstitut: Institut für Massively Parallel Systems (E-EXK5)
In Stud.IP angemeldete Teilnehmer: 81
Anzahl der Postings im Stud.IP-Forum: 2
Anzahl der Dokumente im Stud.IP-Downloadbereich: 1
Anmelden in Stud.IP