Many modern (mobile) systems involve memory intensive computations. External memory accesses are costly when it comes to the execution time and energy consumption of a program. To overcome this, we usually apply tiling to improve data locality and data reuse in internal memories. In the research reported in this paper we add the possibility to recompute data rather than storing temporary results, and demonstrate that this can have a positive e ect on the overall application performance. To achieve this we represented recomputation in the Polyhedral model by extending Polly. We experimentally veri ed the e ectiveness of recomputation on a pair of Convolutional Neural Network layers, when applying loop tiling, loop fusion, and recompute
International audienceA few weeks ago, we were glad to announce the first release of Apollo, the Aut...
International audienceThe polyhedral model is a high-level intermediate representation for loop nest...
Abstract. The polyhedral model is a powerful framework for automatic optimization and parallelizatio...
Many modern (mobile) systems involve memory intensive computations. External memory accesses are cos...
International audienceHigh-level loop optimizations are necessary to achieve good performanceover a ...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
International audienceThe polyhedral model is a powerful framework for automatic optimization and pa...
International audienceThere may be a huge gap between the statements outlined by programmers in a pr...
Loop-nests in most scientific applications perform repetitive operations on array(s) and account for...
Computers become increasingly complex. Current and future systems feature configurable hardware, mul...
The polyhedral model for loop parallelization has proved to be an effective tool for ad-vanced optim...
International audienceHigh-level program optimizations, such as loop transformations, are critical f...
Contains fulltext : 197929.pdf (publisher's version ) (Open Access)IMPACT 2018: Ei...
International audienceIn this paper, we propose Rec2Poly, a framework which detects automatically if...
Multi-core processors are now in widespread use in almost all areas of computing: desktops, laptops ...
International audienceA few weeks ago, we were glad to announce the first release of Apollo, the Aut...
International audienceThe polyhedral model is a high-level intermediate representation for loop nest...
Abstract. The polyhedral model is a powerful framework for automatic optimization and parallelizatio...
Many modern (mobile) systems involve memory intensive computations. External memory accesses are cos...
International audienceHigh-level loop optimizations are necessary to achieve good performanceover a ...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
International audienceThe polyhedral model is a powerful framework for automatic optimization and pa...
International audienceThere may be a huge gap between the statements outlined by programmers in a pr...
Loop-nests in most scientific applications perform repetitive operations on array(s) and account for...
Computers become increasingly complex. Current and future systems feature configurable hardware, mul...
The polyhedral model for loop parallelization has proved to be an effective tool for ad-vanced optim...
International audienceHigh-level program optimizations, such as loop transformations, are critical f...
Contains fulltext : 197929.pdf (publisher's version ) (Open Access)IMPACT 2018: Ei...
International audienceIn this paper, we propose Rec2Poly, a framework which detects automatically if...
Multi-core processors are now in widespread use in almost all areas of computing: desktops, laptops ...
International audienceA few weeks ago, we were glad to announce the first release of Apollo, the Aut...
International audienceThe polyhedral model is a high-level intermediate representation for loop nest...
Abstract. The polyhedral model is a powerful framework for automatic optimization and parallelizatio...