TeraHeap: Reducing Memory Pressure in Managed Big Data Frameworks

Kolokasis Iacovos
Evdorou Giannos
Akram Shoaib
Kozanitis Christos
Papagiannis Anastasios
Zakkak S. Foivos
Pratikakis Polyvios
Bilas Angelos

Publication date

January 2023

DOI

Abstract

Big data frameworks, such as Spark and Giraph, suffer from high memory pressure because they allocate massive volumes of long-lived objects on the managed heap. Thus, frameworks temporarily move long-lived objects outside the managed heap (off-heap) on a fast storage device. Unfortunately, this practice results in: (1) high serialization/deserialization (S/D) cost, and (2) high garbage collection (GC) cost when many off-heap objects are moved back to the managed heap for processing. In this paper, we propose HugeHeap, which extends the managed runtime (JVM) to use a second, high-capacity heap over a fast storage device that coexists with the regular heap. HugeHeap provides direct access to objects on the second heap (no S/D). It also reduc...

Extracted data

We use cookies to provide a better user experience.

Data Protection

TeraHeap: Reducing Memory Pressure in Managed Big Data Frameworks

Abstract

Extracted data

TeraHeap: Reducing Memory Pressure in Managed Big Data Frameworks

Abstract

Extracted data

Related items

Related items