Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch

Zhao, Xunyi
Hellard, Théotime Le
Eyraud, Lionel
Gusak, Julia
Beaumont, Olivier

Publication date

July 2023

Language

English

Abstract

We propose Rockmate to control the memory requirements when training PyTorch DNN models. Rockmate is an automatic tool that starts from the model code and generates an equivalent model, using a predefined amount of memory for activations, at the cost of a few re-computations. Rockmate automatically detects the structure of computational and data dependencies and rewrites the initial model as a sequence of complex blocks. We show that such a structure is widespread and can be found in many models in the literature (Transformer based models, ResNet, RegNets,...). This structure allows us to solve the problem in a fast and efficient way, using an adaptation of Checkmate (too slow on the whole model but general) at the level of individual block...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch

Abstract

Extracted data

Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch

Abstract

Extracted data

Related items

Related items