Sparse linear algebra algorithms typically perform poorly on superscalar, general-purpose processors due to irregular data access patterns and indexing overhead. These algorithms are important to a number of scientific c computing domains including power system simulation, which motivates this work. A variety of algorithms and techniques exist to exploit CPU features, but it has been shown that special purpose hardware support can dramatically outperform these methods. However, the development cost and scaling limitations of a custom hardware solution limit widespread use. This work presents an analysis of hardware and software performance during sparse LU decomposition in order to better understand trade-o s and to suggest the most promisi...
Matrix decomposition plays an increasingly significant role in many scientific and engineering appli...
We consider three problems in this thesis. First, we want to construct a nearly workload-optimal h...
The focus of this work lies on implementational improvements and, in particular, node-level performa...
Sparse Lower-Upper (LU) Triangular Decomposition is important to many di erent applications, includi...
The Data Pump Architecture (DPA) is a novel non-von-Neumann computer architecture emphasizing effici...
Subspace learning is an essential approach for learning a low dimensional representation of a high ...
An issue of great concern as it relates to global warming is power consumption and efficient use of ...
Graphs are a common representation in many problem domains, including engineering, finance, medicine...
The economic performance of most processes and certainly their safety and operability depend to a l...
This thesis research provides several contributions to computer efficient methodology for estimation...
Sparse Matrix-Matrix multiplication (SpMM) is a fundamental operation over irregular data, which is ...
Scalable design of large applications requires domain-specific, high-level abstraction. Classically,...
Analyzing the electrical energy load and pricing data is important to make informed decisions about ...
Functional verification is used to confirm that the logic of a design meets its specification. The m...
This book contains the abstracts of the presentations at the conference Parallel Computing 2011, 30 ...
Matrix decomposition plays an increasingly significant role in many scientific and engineering appli...
We consider three problems in this thesis. First, we want to construct a nearly workload-optimal h...
The focus of this work lies on implementational improvements and, in particular, node-level performa...
Sparse Lower-Upper (LU) Triangular Decomposition is important to many di erent applications, includi...
The Data Pump Architecture (DPA) is a novel non-von-Neumann computer architecture emphasizing effici...
Subspace learning is an essential approach for learning a low dimensional representation of a high ...
An issue of great concern as it relates to global warming is power consumption and efficient use of ...
Graphs are a common representation in many problem domains, including engineering, finance, medicine...
The economic performance of most processes and certainly their safety and operability depend to a l...
This thesis research provides several contributions to computer efficient methodology for estimation...
Sparse Matrix-Matrix multiplication (SpMM) is a fundamental operation over irregular data, which is ...
Scalable design of large applications requires domain-specific, high-level abstraction. Classically,...
Analyzing the electrical energy load and pricing data is important to make informed decisions about ...
Functional verification is used to confirm that the logic of a design meets its specification. The m...
This book contains the abstracts of the presentations at the conference Parallel Computing 2011, 30 ...
Matrix decomposition plays an increasingly significant role in many scientific and engineering appli...
We consider three problems in this thesis. First, we want to construct a nearly workload-optimal h...
The focus of this work lies on implementational improvements and, in particular, node-level performa...