Effective models for fusion of loop nests continue to remain a challenge in both general-purpose and domain-specific language (DSL) compilers. The difficulty often arises from the combinatorial explosion of grouping choices and their interaction with parallelism and locality. This paper presents a new fusion algorithm for high-performance domain-specific compilers for image processing pipelines. The fusion algorithm is driven by dynamic programming and explores spaces of fusion possibilities not covered by previous approaches, and is driven by a cost function more concrete and precise in capturing optimization criteria than prior approaches. The fusion model is particularly tailored to the transformation and optimization sequence applied by...
International audienceWe present a new compilation strategy, implemented at a small cost, to optimiz...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
This paper introduces TIRAMISU, a polyhedral framework designed to generate high performance code fo...
Effective models for fusion of loop nests continue to remain a challenge in both general-purpose and...
This paper presents the design and implementation of PolyMage, a domain-specific language and compil...
\u3cp\u3eEfficient code generation for image processing applications continues to pose a challenge i...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
Image processing applications typically involve data-oriented kernels with limited control divergenc...
Even though computer graphics applications are widely used, they remain challenging to implement and...
Many parallel algorithms and library routines are available for performing computer vision and image...
Today's multi-core era places significant demands on an optimizing compiler, which must parallelize ...
International audienceWe present an API-based compilation strategy to optimize image applications, d...
National audienceImage Processing, Mathematical Morphology Base Operators, The MPPA-256 Chip, Exampl...
Many image processing tasks are naturally expressed as a pipeline of small computational kernels kno...
Modern compilers offer more and more capabilities to automatically parallelize code-regions if these...
International audienceWe present a new compilation strategy, implemented at a small cost, to optimiz...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
This paper introduces TIRAMISU, a polyhedral framework designed to generate high performance code fo...
Effective models for fusion of loop nests continue to remain a challenge in both general-purpose and...
This paper presents the design and implementation of PolyMage, a domain-specific language and compil...
\u3cp\u3eEfficient code generation for image processing applications continues to pose a challenge i...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
Image processing applications typically involve data-oriented kernels with limited control divergenc...
Even though computer graphics applications are widely used, they remain challenging to implement and...
Many parallel algorithms and library routines are available for performing computer vision and image...
Today's multi-core era places significant demands on an optimizing compiler, which must parallelize ...
International audienceWe present an API-based compilation strategy to optimize image applications, d...
National audienceImage Processing, Mathematical Morphology Base Operators, The MPPA-256 Chip, Exampl...
Many image processing tasks are naturally expressed as a pipeline of small computational kernels kno...
Modern compilers offer more and more capabilities to automatically parallelize code-regions if these...
International audienceWe present a new compilation strategy, implemented at a small cost, to optimiz...
International audienceToday's multi-core era places significant demands on an optimizing compiler, w...
This paper introduces TIRAMISU, a polyhedral framework designed to generate high performance code fo...