Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current specifications of languages and hardware are inconclusive; thus programmers often rely on folklore assumptions when writing software. To remedy this state of affairs, we conducted a large empirical study of the concurrent behaviour of deployed GPUs. Armed with litmus tests (i.e. short concurrent programs), we questioned the assumptions in programming guides and vendor documentation about the guarantees provided by hardware. We developed a tool to generate thousands of litmus tests and run them under stressful workloads. We observed a litany of previously elusive weak behaviours, and exposed folklore beliefs about GPU programming---often suppor...
The tutorial at CONCUR will provide a practical overview of work undertaken over the last six years ...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
<p>Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are ...
Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current s...
Graphics Processing Units (GPUs) are highly parallel shared memory microprocessors, and as such, the...
thesisGraphics Processing Units (GPUs) are highly parallel shared memory microprocessors, and as suc...
I have read the thesis of Tyler Sorensen in its final form and have found that (1) its format, citat...
The tutorial at CONCUR will provide a practical overview of work undertaken over the last six years ...
Each new generation of GPUs vastly increases the resources avail-able to GPGPU programs. GPU program...
Each new generation of GPUs vastly increases the resources available to GPGPU programs. GPU programm...
The tremendous computing power GPUs are capable of makes of them the epicenter of an unprecedented a...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
The Graphics Processing Unit (GPU) has become a mainstream computing platform for a wide range of ap...
Even though graphics processors (GPUs) are becoming increasingly popular for general purpose computi...
Nondeterminism is a key challenge in developing multithreaded applications. Even with the same input...
The tutorial at CONCUR will provide a practical overview of work undertaken over the last six years ...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
<p>Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are ...
Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current s...
Graphics Processing Units (GPUs) are highly parallel shared memory microprocessors, and as such, the...
thesisGraphics Processing Units (GPUs) are highly parallel shared memory microprocessors, and as suc...
I have read the thesis of Tyler Sorensen in its final form and have found that (1) its format, citat...
The tutorial at CONCUR will provide a practical overview of work undertaken over the last six years ...
Each new generation of GPUs vastly increases the resources avail-able to GPGPU programs. GPU program...
Each new generation of GPUs vastly increases the resources available to GPGPU programs. GPU programm...
The tremendous computing power GPUs are capable of makes of them the epicenter of an unprecedented a...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
The Graphics Processing Unit (GPU) has become a mainstream computing platform for a wide range of ap...
Even though graphics processors (GPUs) are becoming increasingly popular for general purpose computi...
Nondeterminism is a key challenge in developing multithreaded applications. Even with the same input...
The tutorial at CONCUR will provide a practical overview of work undertaken over the last six years ...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
<p>Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are ...