Deep Learning has achieved outstanding results in many fields and led to groundbreaking discoveries. With the steady increase in datasets and model sizes, there has been a recent surge in Machine Learning applications in High-Performance Computing (HPC) to speed up training. Deep Neural Network (DNN) frameworks use distributed training to enable faster time to convergence and alleviate memory capacity limitations when training large models or using high dimension inputs. However, training DNN in HPC infrastructures presents a unique set of challenges: scalability, IO contention, network congestion and fault tolerance. Solving these problems is particularly challenging and unique due to DL applications’ nature and the history of adaptation o...
Recent advances in storage technologies and high performance interconnects have made possible in the...
Recently, high performance processor designs have evolved toward Chip-Multiprocessor (CMP) architect...
Since their existence, computers have been a great asset to mankind, primarily because of their abil...
Deep Learning has achieved outstanding results in many fields and led to groundbreaking discoveries....
Inter-node communication has turned out to be one of the determining factors of the performance on m...
As high performance computing (HPC) systems continue to grow, their fault rate increases. Applicatio...
Deep Learning (DL) is acquiring more and more applications in the industry, but not all professional...
Memory systems are signicant contributors to the overall power requirements, energy consumption, and...
The recent innovations in Machine Learning techniques have led to a large utilization of intelligent...
Mención Internacional en el título de doctorA great number of scientific projects need supercomputin...
Deep learning powers many transformative core technologies including Autonomous Driving, Natural Lan...
The memory system is a significant contributor for most of the current challenges in computer archit...
Convolutional Neural Networks (CNNs) have become the most used and efficient way to identify and cla...
In the last decade we witnessed an immense evolution of the computing infrastructures in terms of p...
Increasingly High-Performance Computing (HPC) applications run on heterogeneous multi-core platforms...
Recent advances in storage technologies and high performance interconnects have made possible in the...
Recently, high performance processor designs have evolved toward Chip-Multiprocessor (CMP) architect...
Since their existence, computers have been a great asset to mankind, primarily because of their abil...
Deep Learning has achieved outstanding results in many fields and led to groundbreaking discoveries....
Inter-node communication has turned out to be one of the determining factors of the performance on m...
As high performance computing (HPC) systems continue to grow, their fault rate increases. Applicatio...
Deep Learning (DL) is acquiring more and more applications in the industry, but not all professional...
Memory systems are signicant contributors to the overall power requirements, energy consumption, and...
The recent innovations in Machine Learning techniques have led to a large utilization of intelligent...
Mención Internacional en el título de doctorA great number of scientific projects need supercomputin...
Deep learning powers many transformative core technologies including Autonomous Driving, Natural Lan...
The memory system is a significant contributor for most of the current challenges in computer archit...
Convolutional Neural Networks (CNNs) have become the most used and efficient way to identify and cla...
In the last decade we witnessed an immense evolution of the computing infrastructures in terms of p...
Increasingly High-Performance Computing (HPC) applications run on heterogeneous multi-core platforms...
Recent advances in storage technologies and high performance interconnects have made possible in the...
Recently, high performance processor designs have evolved toward Chip-Multiprocessor (CMP) architect...
Since their existence, computers have been a great asset to mankind, primarily because of their abil...