Matrix multiplication is an essential part of many applications, such as linear algebra, image processing and machine learning. One platform used in such applications is TensorFlow, which is a machine learning library whose structure is based on dataflow programming paradigm. In this work, a method for multiplication of medium-density matrices on multicore CPUs using TensorFlow platform is proposed. This method, called tbt_matmul, utilizes TensorFlow built-in methods tf.matmul and tf.sparse_matmul. By partitioning each input matrix into four smaller sub-matrices, called tiles, and applying an appropriate multiplication method to each pair depending on their density, the proposed method outperforms the built-in methods for matrices of medium...
NVIDIA Tensor Core is a mixed-precision matrix-matrix multiplication and addition computing unit, wh...
Since data sizes of analytical applications are continuously growing, many data scientists are switc...
Matrix multiplication is a core building block for numerous scientific computing and, more recently,...
Matrix multiplication is an essential part of many applications, such as linear algebra, image proce...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
This work is comprised of two different projects in numerical linear algebra. The first project is a...
Computational intensive applications such as pattern recognition, and natural language processing, a...
This Master Thesis examines if a matrix multiplication program that combines the two efficiency stra...
Popular Machine Learning (ML) and High Performance Computing (HPC) workloads contribute to a signifi...
In this paper, a new methodology for computing the Dense Matrix Vector Multiplication, for both embe...
Sparse computations are ubiquitous in computational codes, with the sparse matrix-vector (SpMV) mult...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
AbstractIn this article, we present a fast algorithm for matrix multiplication optimized for recent ...
To respond to the intense computational load of deep neural networks, a plethora of domain-specific ...
NVIDIA Tensor Core is a mixed-precision matrix-matrix multiplication and addition computing unit, wh...
Since data sizes of analytical applications are continuously growing, many data scientists are switc...
Matrix multiplication is a core building block for numerous scientific computing and, more recently,...
Matrix multiplication is an essential part of many applications, such as linear algebra, image proce...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
This work is comprised of two different projects in numerical linear algebra. The first project is a...
Computational intensive applications such as pattern recognition, and natural language processing, a...
This Master Thesis examines if a matrix multiplication program that combines the two efficiency stra...
Popular Machine Learning (ML) and High Performance Computing (HPC) workloads contribute to a signifi...
In this paper, a new methodology for computing the Dense Matrix Vector Multiplication, for both embe...
Sparse computations are ubiquitous in computational codes, with the sparse matrix-vector (SpMV) mult...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
AbstractIn this article, we present a fast algorithm for matrix multiplication optimized for recent ...
To respond to the intense computational load of deep neural networks, a plethora of domain-specific ...
NVIDIA Tensor Core is a mixed-precision matrix-matrix multiplication and addition computing unit, wh...
Since data sizes of analytical applications are continuously growing, many data scientists are switc...
Matrix multiplication is a core building block for numerous scientific computing and, more recently,...