[EN] In the world of algorithm acceleration and the implementation of deep neural networks' recall phase, OpenCL based solutions have a clear tendency to produce perfectly adapted kernels in graphic processor unit (GPU) architectures. However, they fail to obtain the same results when applied to field-programmable gate array (FPGA) based architectures. This situation, along with an enormous advance in new GPU architectures, makes it unfeasible to defend an acceleration solution based on FPGA, even in terms of energy efficiency. Our goal in this paper is to demonstrate that multikernel structures can be written based on classic systolic arrays in OpenCL, trying to extract the most advanced features of FPGAs without having to resort to tradit...
In recent years, with the development of computer science, deep learning is held as competent enough...
The importance of security infrastructures for high-throughput networks has rapidly grown as a resul...
New chips for machine learning applications appear, they are tuned for a specific topology, being ef...
The problem of automatically generating hardware modules from high level application representations...
Many embedded applications have to cope with real-time data streams, e.g. video, audio, network, sen...
In our study, we present the results of the implementation of SHA-512 algorithm in FPGA. The disting...
Deep learning has significantly advanced the state of the art in artificial intelligence, gaining w...
This document presents an evaluation of OpenCL as a mechanism to exploit FPGA resources. To evaluate...
In recent years, deep convolutional neural networks (ConvNet) have shown their popularity in various...
International audienceThe work presented deals with the evaluation of F-PGAs resurgence for hardware...
Due to their hardware architecture, Field Programmable Gate Arrays (FPGAs) are optimally suited for ...
Deep learning has significantly advanced the state of the art in artificial intelligence, gaining wi...
FPGA-based accelerators have recently evolved as strong competitors to the traditional GPU-based acc...
The design cycle for complex special-purpose computing systems is extremely costly and time-consumin...
[EN] In the optimization of deep neural networks (DNNs) via evolutionary algorithms (EAs) and the im...
In recent years, with the development of computer science, deep learning is held as competent enough...
The importance of security infrastructures for high-throughput networks has rapidly grown as a resul...
New chips for machine learning applications appear, they are tuned for a specific topology, being ef...
The problem of automatically generating hardware modules from high level application representations...
Many embedded applications have to cope with real-time data streams, e.g. video, audio, network, sen...
In our study, we present the results of the implementation of SHA-512 algorithm in FPGA. The disting...
Deep learning has significantly advanced the state of the art in artificial intelligence, gaining w...
This document presents an evaluation of OpenCL as a mechanism to exploit FPGA resources. To evaluate...
In recent years, deep convolutional neural networks (ConvNet) have shown their popularity in various...
International audienceThe work presented deals with the evaluation of F-PGAs resurgence for hardware...
Due to their hardware architecture, Field Programmable Gate Arrays (FPGAs) are optimally suited for ...
Deep learning has significantly advanced the state of the art in artificial intelligence, gaining wi...
FPGA-based accelerators have recently evolved as strong competitors to the traditional GPU-based acc...
The design cycle for complex special-purpose computing systems is extremely costly and time-consumin...
[EN] In the optimization of deep neural networks (DNNs) via evolutionary algorithms (EAs) and the im...
In recent years, with the development of computer science, deep learning is held as competent enough...
The importance of security infrastructures for high-throughput networks has rapidly grown as a resul...
New chips for machine learning applications appear, they are tuned for a specific topology, being ef...