This paper describes several methods for improving the scalability of memory disambiguation hardware for future high ILP processors. As the number of in-flight instructions grows with issue width and pipeline depth, the load/store queues (LSQ) threaten to become a bottleneck in both power and latency. By employing lightweight approximate hashing in hardware with structures called Bloom filters many improvements to the LSQ are possible. We propose two types of filtering schemes using Bloom filters: search filtering, which uses hashing to reduce both the number of lookups to the LSQ and the number of entries that must be searched, and state filtering, in which the number of entries kept in the LSQs is reduced by coupling address predictors an...
Multicore processors have emerged as a powerful platform on which to efficiently exploit thread-leve...
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...
Because they are based on large content-addressable memories, load-store queues (LSQs) present imple...
A store queue (SQ) is a critical component of the load execution machinery. High ILP processors requ...
One of the main challenges of modern processor designs is the implementation of scalable and efficie...
In most modern processor designs, the HW dedicated to store data and instructions (memory hierarchy)...
The load-store queue (LQ-SQ) of modem superscalar processors is responsible for keeping the order of...
In high-end processors, increasing the number of in-flight instructions can improve performance by o...
A Large instruction window is a key requirement to exploit greater Instruction Level Parallelism in ...
International audienceMemory disambiguation mechanisms, coupled with load/store queues in out-of-ord...
A store queue (SQ) is a critical component of the load execution machinery. High ILP processors requ...
Many network security applications require large virus signature sets to be maintained, retrieved, a...
Journal ArticleModern superscalar processors use wide instruction issue widths and out-of-order exe...
One of the main challenges of modern processor designs is the implementation of scalable and efficie...
Multicore processors have emerged as a powerful platform on which to efficiently exploit thread-leve...
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...
Because they are based on large content-addressable memories, load-store queues (LSQs) present imple...
A store queue (SQ) is a critical component of the load execution machinery. High ILP processors requ...
One of the main challenges of modern processor designs is the implementation of scalable and efficie...
In most modern processor designs, the HW dedicated to store data and instructions (memory hierarchy)...
The load-store queue (LQ-SQ) of modem superscalar processors is responsible for keeping the order of...
In high-end processors, increasing the number of in-flight instructions can improve performance by o...
A Large instruction window is a key requirement to exploit greater Instruction Level Parallelism in ...
International audienceMemory disambiguation mechanisms, coupled with load/store queues in out-of-ord...
A store queue (SQ) is a critical component of the load execution machinery. High ILP processors requ...
Many network security applications require large virus signature sets to be maintained, retrieved, a...
Journal ArticleModern superscalar processors use wide instruction issue widths and out-of-order exe...
One of the main challenges of modern processor designs is the implementation of scalable and efficie...
Multicore processors have emerged as a powerful platform on which to efficiently exploit thread-leve...
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...