In high performance computing systems, parallel applications request a large number of resources for long time periods. In this scenario, if a resource fails during the application runtime, it would cause all applications using this resource to fail. The probability of application failure is tied to the inherent reliability of resources used by the application. Our investigation of high performance computing systems operating in the field has revealed a significant difference in the measured operational reliability of individual computing nodes. By adding awareness of the individual system nodes\u27 reliability to the scheduler along with the predicted reliability needs of parallel applications, reliable resources can be matched with the mo...
This dissertation develops analytical models to provide insight into various design issues associate...
While microprocessors have doubled their speed every 18 months, performance improvement of memory sy...
Abstract The goal of this research is to advance the state of the art in bulk power system reliabil...
The defense sector is undergoing a phase of rapid technological advancement, in the pursuit of its g...
High performance computing clusters provide an efficient and cost effective solution to tackle large...
The effectiveness of computer system resource management has been always determined by two major fac...
Pipelining the functional units and memory interface of processors can result in shorter cycle times...
In this thesis, we analyze various factors that affect quality of service (QoS) communication in hig...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
textThis thesis makes progress towards the fundamental understanding of heterogeneous and dynamic in...
We propose a novel organization for multi-dimensional data based on the conceptof macro-voxels. This...
The intent of this Thesis is to study the potential of distributed resources to increase the efficac...
Pattern matching is at the core of many computational problems, e.g., search engine, data mining, ne...
n the recent times, advances in scientific research related to electric vehicles led to generation o...
Large optimization problems are frequently solved for power systems operation and analysis of electr...
This dissertation develops analytical models to provide insight into various design issues associate...
While microprocessors have doubled their speed every 18 months, performance improvement of memory sy...
Abstract The goal of this research is to advance the state of the art in bulk power system reliabil...
The defense sector is undergoing a phase of rapid technological advancement, in the pursuit of its g...
High performance computing clusters provide an efficient and cost effective solution to tackle large...
The effectiveness of computer system resource management has been always determined by two major fac...
Pipelining the functional units and memory interface of processors can result in shorter cycle times...
In this thesis, we analyze various factors that affect quality of service (QoS) communication in hig...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
textThis thesis makes progress towards the fundamental understanding of heterogeneous and dynamic in...
We propose a novel organization for multi-dimensional data based on the conceptof macro-voxels. This...
The intent of this Thesis is to study the potential of distributed resources to increase the efficac...
Pattern matching is at the core of many computational problems, e.g., search engine, data mining, ne...
n the recent times, advances in scientific research related to electric vehicles led to generation o...
Large optimization problems are frequently solved for power systems operation and analysis of electr...
This dissertation develops analytical models to provide insight into various design issues associate...
While microprocessors have doubled their speed every 18 months, performance improvement of memory sy...
Abstract The goal of this research is to advance the state of the art in bulk power system reliabil...