This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Computer Science. The final authenticated version is available online at: https://doi.org/10.1007/978-3-540-74466-5_27[Abstract] Understanding program behavior is at the foundation of program optimization. Techniques for automatic recognition of program constructs (from now on, computational kernels) characterize the behavior of program statements, providing compilers with valuable information to to guide code optimization. Our goal is to develop automatic techniques that summarize the behavior of full-scale real applications by building a high-level representation that hides the complexity of implementation details. The first step towards this goal...
The widespread use of multicore processors is not a consequence of significant advances in parallel ...
The widespread use of multicore processors is not a consequence of significant advances in parallel ...
We propose a compiler analysis pass for programs expressed in the Single Program, Multiple Data (SPM...
This is a post-peer-review, pre-copyedit version of an article published in ACM Transactions on Prog...
This working paper was submitted as a Ph.D. thesis proposal.Program recognition is a program underst...
For decades, computer scientists have sought guidance on how to evolve architectures, languages, and...
[Abstract] Summary form only given. The automatic parallelization of loops that contain complex comp...
The automatic parallelization of loops that contain complex computations is still a challenge for cu...
This report defines the Kernel programming language and documents its design. Kernel is a statically...
High performance computing systems are characterized by a high level of complexity both on their har...
Application performance models are important for both software and hardware development. They can be...
Automatic parallelization techniques for finding loop-based parallelism fail to find efficient paral...
This paper presents a new approach for the detection of coarse-grain parallelism in loop nests that ...
Vectorization support in hardware continues to expand and grow as well we still continue on supersca...
Kernel methods are widely used to address a variety of learning tasks including classification, regr...
The widespread use of multicore processors is not a consequence of significant advances in parallel ...
The widespread use of multicore processors is not a consequence of significant advances in parallel ...
We propose a compiler analysis pass for programs expressed in the Single Program, Multiple Data (SPM...
This is a post-peer-review, pre-copyedit version of an article published in ACM Transactions on Prog...
This working paper was submitted as a Ph.D. thesis proposal.Program recognition is a program underst...
For decades, computer scientists have sought guidance on how to evolve architectures, languages, and...
[Abstract] Summary form only given. The automatic parallelization of loops that contain complex comp...
The automatic parallelization of loops that contain complex computations is still a challenge for cu...
This report defines the Kernel programming language and documents its design. Kernel is a statically...
High performance computing systems are characterized by a high level of complexity both on their har...
Application performance models are important for both software and hardware development. They can be...
Automatic parallelization techniques for finding loop-based parallelism fail to find efficient paral...
This paper presents a new approach for the detection of coarse-grain parallelism in loop nests that ...
Vectorization support in hardware continues to expand and grow as well we still continue on supersca...
Kernel methods are widely used to address a variety of learning tasks including classification, regr...
The widespread use of multicore processors is not a consequence of significant advances in parallel ...
The widespread use of multicore processors is not a consequence of significant advances in parallel ...
We propose a compiler analysis pass for programs expressed in the Single Program, Multiple Data (SPM...