AbstractWe propose a new storage scheme (word packing) for matrices with elements in Z2 that enables improved performance. This scheme is based on utilizing the full register length of modern microprocessors to perform multiple Z2 operations in parallel. We analyze several operations over word packed matrices and compare them with their conventional equivalents
For matrix multiplication on hypercube multiprocessors with the product matrix accumulated in place ...
Combinatorial scientific computing plays an important enabling role in computational science, partic...
We describe an efficient implementation of a hierarchy of algorithms for multiplication of dense mat...
AbstractWe propose a new storage scheme (word packing) for matrices with elements in Z2 that enables...
a r t i c l e i n f o a b s t r a c t Dedicated to Professor Gad M. Landau, on the occasion of his 6...
International audienceBini–Capovani–Lotti–Romani approximate formula (or border rank) for matrix mul...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
As nowadays Machine Learning (ML) techniques are generating huge data collections, the problem of h...
In the packed string matching problem, each machine word accomodates alpha characters, thus an n-cha...
International audienceWe propose to store several integers modulo a small prime into a single machin...
In this letter, a functional Z2-FET DRAM memory matrix is experimentally demonstrated for the first ...
The authors describe a new extension to ScaLAPACK for computing with symmetric (Hermitian) matrices ...
We describe a new extension to ScaLAPACK [2] for computing with symmetric (Hermitian) matrices store...
Abstract: Suppose the bits of a computer word are partitioned into d disjoint sets, each of which is...
International audienceOver the past few years, multicore systems have become more and more powerful ...
For matrix multiplication on hypercube multiprocessors with the product matrix accumulated in place ...
Combinatorial scientific computing plays an important enabling role in computational science, partic...
We describe an efficient implementation of a hierarchy of algorithms for multiplication of dense mat...
AbstractWe propose a new storage scheme (word packing) for matrices with elements in Z2 that enables...
a r t i c l e i n f o a b s t r a c t Dedicated to Professor Gad M. Landau, on the occasion of his 6...
International audienceBini–Capovani–Lotti–Romani approximate formula (or border rank) for matrix mul...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
As nowadays Machine Learning (ML) techniques are generating huge data collections, the problem of h...
In the packed string matching problem, each machine word accomodates alpha characters, thus an n-cha...
International audienceWe propose to store several integers modulo a small prime into a single machin...
In this letter, a functional Z2-FET DRAM memory matrix is experimentally demonstrated for the first ...
The authors describe a new extension to ScaLAPACK for computing with symmetric (Hermitian) matrices ...
We describe a new extension to ScaLAPACK [2] for computing with symmetric (Hermitian) matrices store...
Abstract: Suppose the bits of a computer word are partitioned into d disjoint sets, each of which is...
International audienceOver the past few years, multicore systems have become more and more powerful ...
For matrix multiplication on hypercube multiprocessors with the product matrix accumulated in place ...
Combinatorial scientific computing plays an important enabling role in computational science, partic...
We describe an efficient implementation of a hierarchy of algorithms for multiplication of dense mat...