This report evaluates whether an interpreted high-level garbage collected language has enough information about its memory behaviour to make better cache decisions than modern general CPU hardware. With a generational garbage collector, depending on promotion algorithm and generation size, around 90% of all objects never leave the first generation. This report is based on the hypothesis that, because of the low promotion rate, accesses to higher generations are sufficiently rare not to benefit from caching. To test this hypothesis, we built an operating system with a Scheme interpreter in kernel mode, where the interpreter controls the cache. Generic x86 PC hardware was used, since it allows fine-grained control of cache decisions. Measurem...
As the development of hardware progresses, computers are expected to solve increasingly complex prob...
As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent,...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
This report evaluates whether an interpreted high-level garbage collected language has enough inform...
In this report, we consider the design of cache memories to support the execution of languages that ...
Due to garbage collection and language features that preclude stack-based allocation, functional pro...
The speed of modern computers can be increased by organizing computations so that memory access patt...
Prestanda har historiskt sett alltid varit av betydelse för nyttjandet av datorer, vilket lett till ...
. An interpreter for a virtual stack machine can spend a significant part of its execution time fetc...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The cost of accessing main memory is increasing. Machine designers have tried to mitigate the conseq...
Automatic memory management is an important concept in many high order languages. It improves produc...
. To deepen our quantitative understanding of the performance of lazy evaluation, we have studied th...
The cost of accessing main memory is increasing. Machine designers have tried to mitigate the conseq...
As the development of hardware progresses, computers are expected to solve increasingly complex prob...
As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent,...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
This report evaluates whether an interpreted high-level garbage collected language has enough inform...
In this report, we consider the design of cache memories to support the execution of languages that ...
Due to garbage collection and language features that preclude stack-based allocation, functional pro...
The speed of modern computers can be increased by organizing computations so that memory access patt...
Prestanda har historiskt sett alltid varit av betydelse för nyttjandet av datorer, vilket lett till ...
. An interpreter for a virtual stack machine can spend a significant part of its execution time fetc...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The cost of accessing main memory is increasing. Machine designers have tried to mitigate the conseq...
Automatic memory management is an important concept in many high order languages. It improves produc...
. To deepen our quantitative understanding of the performance of lazy evaluation, we have studied th...
The cost of accessing main memory is increasing. Machine designers have tried to mitigate the conseq...
As the development of hardware progresses, computers are expected to solve increasingly complex prob...
As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent,...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...