Graph algorithms have gained popularity and are utilized in high performance and mobile computing paradigms. Input dependence due to input graph changes leads to performance variations in such algorithms. The impact of input dependence for graph algorithms is not well studied in the context of approximate computing. This thesis conducts such analysis by applying loop perforation, which is a general approximation mechanism that transforms the program loops to drop a subset of their total iterations. The analysis identifies the need to adapt the inner and outer loop perforation as a function of input graph characteristics, such as the density or size of the graph. A predictive model is proposed to learn the near-optimal loop perforation rates...
This thesis explores the time optimal implementation of computational graphs on a finite register ma...
A long-standing assumption common in algorithm design is that any part of the input is accessible at...
This thesis proposes a reconfigurable computing approach for supporting parallel processing in large...
Graph algorithms have gained popularity and are utilized in high performance and mobile computing pa...
This thesis discusses the application of optimizations to machine learning algorithms. In particular...
In this thesis we investigate the relation between the structure of input graphs and the performance...
Mathematicians and computational scientists are often limited in their ability to model complex phen...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
Many applications provide inherent resilience to some amount of error and can potentially trade accu...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...
Parallel graph processing is central to analytical computer science applications, and GPUs have prov...
Irregular algorithms such as graph algorithms, sorting, and sparse matrix multiplication, present nu...
Abstract—Loops are the main source of parallelism in many applications. This paper solves the open p...
This thesis explores the time optimal implementation of computational graphs on a finite register ma...
A long-standing assumption common in algorithm design is that any part of the input is accessible at...
This thesis proposes a reconfigurable computing approach for supporting parallel processing in large...
Graph algorithms have gained popularity and are utilized in high performance and mobile computing pa...
This thesis discusses the application of optimizations to machine learning algorithms. In particular...
In this thesis we investigate the relation between the structure of input graphs and the performance...
Mathematicians and computational scientists are often limited in their ability to model complex phen...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
Many applications provide inherent resilience to some amount of error and can potentially trade accu...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...
Parallel graph processing is central to analytical computer science applications, and GPUs have prov...
Irregular algorithms such as graph algorithms, sorting, and sparse matrix multiplication, present nu...
Abstract—Loops are the main source of parallelism in many applications. This paper solves the open p...
This thesis explores the time optimal implementation of computational graphs on a finite register ma...
A long-standing assumption common in algorithm design is that any part of the input is accessible at...
This thesis proposes a reconfigurable computing approach for supporting parallel processing in large...