AbstractSoftware parallelism is a key factor in performance of parallel systems. In this paper we discuss a parallel-instruction vector space model for workload representation and comparison. This model will be compared with the parallelism-matrix technique, which is based on the Frobenius matrix norm. The latter compares two workloads based on identical parallel instructions only, whereas the former compares two workloads based on all parallel instructions. It will be shown that the parallel-instruction vector space method outperforms the parallelism-matrix method in time and space, as well as in accuracy. Further, it will be shown that this model provides a useful framework for the design and analysis of benchmarks. This will be demonstra...
Scientific programs are typically characterized as floating-point intensive loop-dominated tasks wit...
This paper presents a comparative and qualitative survey of techniques for evaluating parallel syste...
Having a representative work-load of the target domain of a microprocessor is extremely important th...
Software parallelism is a key factor in performance of parallel systems. In this paper we discuss a ...
AbstractSoftware parallelism is a key factor in performance of parallel systems. In this paper we di...
AbstractA characterization study of analyzing dynamic instruction traces to characterize program par...
A practical methodology for evaluating and comparing the performance of distributed memory Multiple ...
Abstract—Performance evaluation is a significant step in the study of scheduling algorithms in large...
The analysis of workload traces from real production parallel machines can aid a wide variety of par...
(parallel computers and algorithms too). In this sense the paper is devoted to a complex performance...
Abstract — A parallel program should be evaluated to determine its efficiency, accuracy and benefits...
grantor: University of TorontoUnderstanding the characteristics of parallel workloads aids...
137 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This research addresses the i...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
Scientific programs are typically characterized as floating-point intensive loop-dominated tasks wit...
This paper presents a comparative and qualitative survey of techniques for evaluating parallel syste...
Having a representative work-load of the target domain of a microprocessor is extremely important th...
Software parallelism is a key factor in performance of parallel systems. In this paper we discuss a ...
AbstractSoftware parallelism is a key factor in performance of parallel systems. In this paper we di...
AbstractA characterization study of analyzing dynamic instruction traces to characterize program par...
A practical methodology for evaluating and comparing the performance of distributed memory Multiple ...
Abstract—Performance evaluation is a significant step in the study of scheduling algorithms in large...
The analysis of workload traces from real production parallel machines can aid a wide variety of par...
(parallel computers and algorithms too). In this sense the paper is devoted to a complex performance...
Abstract — A parallel program should be evaluated to determine its efficiency, accuracy and benefits...
grantor: University of TorontoUnderstanding the characteristics of parallel workloads aids...
137 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This research addresses the i...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
Scientific programs are typically characterized as floating-point intensive loop-dominated tasks wit...
This paper presents a comparative and qualitative survey of techniques for evaluating parallel syste...
Having a representative work-load of the target domain of a microprocessor is extremely important th...