Extreme-scale computing poses a number of challenges to application performance. Developers need to study appli-cation behavior by collecting detailed information with the help of tracing toolsets to determine shortcomings. But not only applications are“scalability challenged”, current tracing toolsets also fall short of exascale requirements for low back-ground overhead since trace collection for each execution en-tity is becoming infeasible. One effective solution is to clus-ter processes with the same behavior into groups. Instead of collecting performance information from each individual node, this information can be collected from just a set of representative nodes. This work contributes a fast, scalable, signature-based clustering alg...
The proliferation of the web presents an unsolved problem of automatically analyzing billions of pag...
Analyzing parallel programs has become increasingly difficult due to the immense amount of informati...
Clustering algorithms are an important tool for data mining and data analysis purposes. Clustering a...
Today most complex scientific applications requires a large number of calculations to solve a partic...
Event tracing provides the detailed data needed to under-stand the dynamics of interactions among ap...
With larger and larger systems being constantly deployed, trace-based performance analysis of paral...
Applications on todays massively parallel supercom-puters rely on performance analysis tools to guid...
A considerably fraction of science discovery is nowadays relying on computer simulations. High Per...
This paper studies the utility of using data analytics and machine learning techniques for identifyi...
Thesis (Ph.D.)--University of Washington, 2015-12Clustering algorithms provide a way to analyze and ...
As access to supercomputing resources is becoming more and more commonplace, performance analysis to...
Often parallel scientific applications are instrumented and traces are collected and analyzed to ide...
Process mining techniques have been used to analyze event logs from information systems in order to ...
The proliferation of the web presents an unsolved problem of automatically analyzing billions of pag...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...
The proliferation of the web presents an unsolved problem of automatically analyzing billions of pag...
Analyzing parallel programs has become increasingly difficult due to the immense amount of informati...
Clustering algorithms are an important tool for data mining and data analysis purposes. Clustering a...
Today most complex scientific applications requires a large number of calculations to solve a partic...
Event tracing provides the detailed data needed to under-stand the dynamics of interactions among ap...
With larger and larger systems being constantly deployed, trace-based performance analysis of paral...
Applications on todays massively parallel supercom-puters rely on performance analysis tools to guid...
A considerably fraction of science discovery is nowadays relying on computer simulations. High Per...
This paper studies the utility of using data analytics and machine learning techniques for identifyi...
Thesis (Ph.D.)--University of Washington, 2015-12Clustering algorithms provide a way to analyze and ...
As access to supercomputing resources is becoming more and more commonplace, performance analysis to...
Often parallel scientific applications are instrumented and traces are collected and analyzed to ide...
Process mining techniques have been used to analyze event logs from information systems in order to ...
The proliferation of the web presents an unsolved problem of automatically analyzing billions of pag...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...
The proliferation of the web presents an unsolved problem of automatically analyzing billions of pag...
Analyzing parallel programs has become increasingly difficult due to the immense amount of informati...
Clustering algorithms are an important tool for data mining and data analysis purposes. Clustering a...