Update MKLML version 20170908 that fixes a bug related to data conversions) Add SSD example for bounding box object detection that works for both GPU and MKL backend Add DeepSpeech2 MKL backend optimization that features ~3X improvement Update aeon to 1.0.0 including new version of manifest (doc/source/loading_data.rst#aeon-dataloader) Add CHWD Support for Batch Normalization in mkl backend Modify ResNet-50 model's last layer to match the original ResNet-50 model paper Enable Seq2Seq testing and benchmarkin
Python2/Python3 compatibility [#191] Support for Pascal GPUs Persistent RNN kernels [#262] Implemen...
Bugfix: fixed a bug for the plumed interface when the number of atoms exceeds 1024 #339 fixed a fil...
Metal v0.2.0 Diff since v0.1.2 Closed issues: Threadgroup memory breaks on small datatypes (#26) In...
Optimized SSD MKL backend performance (~3X boost version over version) Bumped aeon version to v1.3.0...
Update Data Loader to aeon https://github.com/NervanaSystems/aeon for flexible, multi-threaded data ...
Optimized DeepSpeech2 MKL backend performance (~7X improvement over the CPU backend) Fused convoluti...
Added support for MKL backend (-b mkl) on Linux, which boosts neon CPU performance significantly Add...
Set MKL backend (-b mkl) as the default CPU backend on Linux (use -b cpu to specify original CPU bac...
Further optimized MKL backend performance for SSD inference Updated MKLML to version 20171227 Enable...
Add support for 3D deconvolution Generative Adversarial Networks (GAN) implementation, and MNIST DCG...
Faster RCNN model Sequence to Sequence container and char_rae recurrent autoencoder model Reshape La...
Skip Thought Vectors (http://arxiv.org/abs/1506.06726) example Dilated convolution support Nesterov ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
Bug fix: Add dilation to object dict and assign defaults to dil_w = dil_h = 1 [#335, #336] Bug fix: ...
1.3.0 Added MS-SVConv: https://arxiv.org/abs/2103.14533 (thanks @humanpose1) added new data generat...
Python2/Python3 compatibility [#191] Support for Pascal GPUs Persistent RNN kernels [#262] Implemen...
Bugfix: fixed a bug for the plumed interface when the number of atoms exceeds 1024 #339 fixed a fil...
Metal v0.2.0 Diff since v0.1.2 Closed issues: Threadgroup memory breaks on small datatypes (#26) In...
Optimized SSD MKL backend performance (~3X boost version over version) Bumped aeon version to v1.3.0...
Update Data Loader to aeon https://github.com/NervanaSystems/aeon for flexible, multi-threaded data ...
Optimized DeepSpeech2 MKL backend performance (~7X improvement over the CPU backend) Fused convoluti...
Added support for MKL backend (-b mkl) on Linux, which boosts neon CPU performance significantly Add...
Set MKL backend (-b mkl) as the default CPU backend on Linux (use -b cpu to specify original CPU bac...
Further optimized MKL backend performance for SSD inference Updated MKLML to version 20171227 Enable...
Add support for 3D deconvolution Generative Adversarial Networks (GAN) implementation, and MNIST DCG...
Faster RCNN model Sequence to Sequence container and char_rae recurrent autoencoder model Reshape La...
Skip Thought Vectors (http://arxiv.org/abs/1506.06726) example Dilated convolution support Nesterov ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
Bug fix: Add dilation to object dict and assign defaults to dil_w = dil_h = 1 [#335, #336] Bug fix: ...
1.3.0 Added MS-SVConv: https://arxiv.org/abs/2103.14533 (thanks @humanpose1) added new data generat...
Python2/Python3 compatibility [#191] Support for Pascal GPUs Persistent RNN kernels [#262] Implemen...
Bugfix: fixed a bug for the plumed interface when the number of atoms exceeds 1024 #339 fixed a fil...
Metal v0.2.0 Diff since v0.1.2 Closed issues: Threadgroup memory breaks on small datatypes (#26) In...