The remarkable positive impact of Deep Neural Networks on many Artificial Intelligence (AI) tasks has led to the development of various high performance algorithms as well as specialized processors and accelerators. In this paper we address this scenario by demonstrating that the principles underlying the modern realization of the general matrix multiplication (GEMM) in conventional processor architectures, are also valid to achieve high performance for the type of operations that arise in deep learning (DL) on an exotic accelerator such as the AI Engine (AIE) tile embedded in Xilinx Versal platforms. In particular, our experimental results with a prototype implementation of the GEMM kernel, on a Xilinx Versal VCK190, delivers performance c...
Ponència presentada a 2020 IEEE 32nd International Symposium on Computer Architecture and High Perfo...
General Matrix Multiplication or GEMM kernels take centre place in high performance computing and ma...
The positive societal impacts of artificial intelligence (AI) through the field of deep learning hav...
We revisit a blocked formulation of the direct convolution algorithm that mimics modern realizations...
[EN] We introduce a high performance, multi-threaded realization of the gemm kernel for the ARMv8.2 ...
Efficient implementation of deep neural networks (DNNs) on CPU-based systems is critical owing to th...
The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra l...
A number of recent researches focus on designing accelerators for popular deep learning algorithms. ...
Machine learning has risen to prominence in recent years thanks to advancements in computer technolo...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
Recently, renewed attention to Artificial Intelligence has emerged thanks to algorithms called Deep ...
The parallel nature of FPGA makes it a promising candidate to accelerate machine learning tasks. The...
AbstractThis paper presents results of our study on double-precision general matrix-matrix multiplic...
Edge computing brings artificial intelligence algorithms and graphics processing units closer to dat...
Analog in-memory computing (AIMC) cores offers significant performance and energy benefits for neura...
Ponència presentada a 2020 IEEE 32nd International Symposium on Computer Architecture and High Perfo...
General Matrix Multiplication or GEMM kernels take centre place in high performance computing and ma...
The positive societal impacts of artificial intelligence (AI) through the field of deep learning hav...
We revisit a blocked formulation of the direct convolution algorithm that mimics modern realizations...
[EN] We introduce a high performance, multi-threaded realization of the gemm kernel for the ARMv8.2 ...
Efficient implementation of deep neural networks (DNNs) on CPU-based systems is critical owing to th...
The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra l...
A number of recent researches focus on designing accelerators for popular deep learning algorithms. ...
Machine learning has risen to prominence in recent years thanks to advancements in computer technolo...
The recent “Cambrian explosion” of Deep Learning (DL) algorithms in concert with the end of Moore’s ...
Recently, renewed attention to Artificial Intelligence has emerged thanks to algorithms called Deep ...
The parallel nature of FPGA makes it a promising candidate to accelerate machine learning tasks. The...
AbstractThis paper presents results of our study on double-precision general matrix-matrix multiplic...
Edge computing brings artificial intelligence algorithms and graphics processing units closer to dat...
Analog in-memory computing (AIMC) cores offers significant performance and energy benefits for neura...
Ponència presentada a 2020 IEEE 32nd International Symposium on Computer Architecture and High Perfo...
General Matrix Multiplication or GEMM kernels take centre place in high performance computing and ma...
The positive societal impacts of artificial intelligence (AI) through the field of deep learning hav...