INTERNATIONALCOMPUTERSCIENCEINSTITUTEI 1947CenterSt.Suite600Berkeley,California94704-1198(510)643-9153FAX(510)643-7684 The PHiPAC v1.0 Matrix-Multiply Distribution.

Jeff Bilmes
Krste Asanovićy
Chee-whye Chinz
Jim Demmelx

Publication date

January 1998

Abstract

Modern microprocessors can achieve high performance on linear algebra kernels but this currently requires extensive machine-specific hand tuning. We have developed a methodology whereby near-peak performance on a wide range of systems can be achieved automatically for such routines. First, by analyzing current machines and C compilers, we’ve developed guidelines for writing Portable, High-Performance, ANSI C (PHiPAC, pronounced “fee-pack”). Second, rather than code by hand, we produce parameterized code generators. Third, we write search scripts that find the best parameters for a given system. We report on a BLAS GEMM compatible multi-level cache-blocked matrix multiply generator which produces code that achieves around 90 % of peak on the...

Extracted data

We use cookies to provide a better user experience.

Data Protection

INTERNATIONALCOMPUTERSCIENCEINSTITUTEI 1947CenterSt.Suite600Berkeley,California94704-1198(510)643-9153FAX(510)643-7684 The PHiPAC v1.0 Matrix-Multiply Distribution.

Abstract

Extracted data

INTERNATIONALCOMPUTERSCIENCEINSTITUTEI 1947CenterSt.Suite600Berkeley,California94704-1198(510)643-9153FAX(510)643-7684 The PHiPAC v1.0 Matrix-Multiply Distribution.

Abstract

Extracted data

Related items

Related items