{"version":"1.0","provider_name":"UCT HPC","provider_url":"https:\/\/ucthpc.uct.ac.za","author_name":"Andrew Lewis","author_url":"https:\/\/ucthpc.uct.ac.za\/index.php\/author\/andrew-lewis\/","title":"GPGPU course - UCT HPC","type":"rich","width":600,"height":338,"html":"<blockquote class=\"wp-embedded-content\" data-secret=\"KO3y2GWB2C\"><a href=\"https:\/\/ucthpc.uct.ac.za\/index.php\/2013\/05\/06\/gpgpu-course\/\">GPGPU course<\/a><\/blockquote><iframe sandbox=\"allow-scripts\" security=\"restricted\" src=\"https:\/\/ucthpc.uct.ac.za\/index.php\/2013\/05\/06\/gpgpu-course\/embed\/#?secret=KO3y2GWB2C\" width=\"600\" height=\"338\" title=\"&#8220;GPGPU course&#8221; &#8212; UCT HPC\" data-secret=\"KO3y2GWB2C\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" class=\"wp-embedded-content\"><\/iframe><script type=\"text\/javascript\">\n\/*! This file is auto-generated *\/\n!function(c,d){\"use strict\";var e=!1,o=!1;if(d.querySelector)if(c.addEventListener)e=!0;if(c.wp=c.wp||{},c.wp.receiveEmbedMessage);else if(c.wp.receiveEmbedMessage=function(e){var t=e.data;if(!t);else if(!(t.secret||t.message||t.value));else if(\/[^a-zA-Z0-9]\/.test(t.secret));else{for(var r,s,a,i=d.querySelectorAll('iframe[data-secret=\"'+t.secret+'\"]'),n=d.querySelectorAll('blockquote[data-secret=\"'+t.secret+'\"]'),o=new RegExp(\"^https?:$\",\"i\"),l=0;l<n.length;l++)n[l].style.display=\"none\";for(l=0;l<i.length;l++)if(r=i[l],e.source!==r.contentWindow);else{if(r.removeAttribute(\"style\"),\"height\"===t.message){if(1e3<(s=parseInt(t.value,10)))s=1e3;else if(~~s<200)s=200;r.height=s}if(\"link\"===t.message)if(s=d.createElement(\"a\"),a=d.createElement(\"a\"),s.href=r.getAttribute(\"src\"),a.href=t.value,!o.test(a.protocol));else if(a.host===s.host)if(d.activeElement===r)c.top.location.href=t.value}}},e)c.addEventListener(\"message\",c.wp.receiveEmbedMessage,!1),d.addEventListener(\"DOMContentLoaded\",t,!1),c.addEventListener(\"load\",t,!1);function t(){if(o);else{o=!0;for(var e,t,r,s=-1!==navigator.appVersion.indexOf(\"MSIE 10\"),a=!!navigator.userAgent.match(\/Trident.*rv:11\\.\/),i=d.querySelectorAll(\"iframe.wp-embedded-content\"),n=0;n<i.length;n++){if(!(r=(t=i[n]).getAttribute(\"data-secret\")))r=Math.random().toString(36).substr(2,10),t.src+=\"#?secret=\"+r,t.setAttribute(\"data-secret\",r);if(s||a)(e=t.cloneNode(!0)).removeAttribute(\"security\"),t.parentNode.replaceChild(e,t);t.contentWindow.postMessage({message:\"ready\",secret:r},\"*\")}}}}(window,document);\n<\/script>\n","description":"Last  week Andrew Lewis attended an honours course in GPGPU programming. &nbsp;The  course was hosted and organised by UCT's computer science department and funded by the&nbsp;SKA.  &nbsp;We were very privileged to have as our lecturers John Stone who is the  Associate Director of the CUDA Center of Excellence at University of  Illinois and Associate Professor Manuel Ujald&oacute;n from the University of  Malaga. &nbsp;Both are also NVIDIA CUDA Fellows. The  course covered introductory programming and architecture of CUDA cards,&nbsp;new GPU technologies such as dynamic parallelism&nbsp;as well as more  advanced tricks and techniques for saving both time as well as  electrical power, the latter being of great importance when writing code  for portable devices as well as considering power consumption of  extremely large data centres. As  we reach the limits of our manufacturing technology where physical  track sizes on the chips can be measured in 10's of atoms our ability to  achieve significant speed-ups decrease (in 2006 it was estimated that  CPU speeds in 2013 would be around 7GHz). &nbsp;GPGPU technology is a way  around this barrier provided there are programmers capable of taking  advantage of these computing paradigms. Both  lecturers gave extremely interesting glimpses into the future of GPU  computing; the upcoming release of&nbsp;Maxwell, the successor to the  NVIDIA Kepler card  and the future&nbsp;Volta&nbsp;card, all of which ofer  more capabilities at lower power. &nbsp;The integration of HMC memory into  GPU chips will also surmount the issue of memory transfer latency. &nbsp;It  is estimated that the stacked DRAM in the Volta card will have transfer  rates in the order of 1TB\/s.","thumbnail_url":"https:\/\/ucthpc.uct.ac.za\/wp-content\/uploads\/2015\/07\/nvidia_kepler_tesla_k10.jpg"}