Techniques for enlarging instruction streams

Instruction fetch architectures and code layout optimizations

Ramírez Bellido, Alejandro
Larriba Pey, Josep
Valero Cortés, Mateo

The design of higher performance processors has been following two major trends: increasing the pipe...

Temporal instruction fetch streaming

Ferdman, Michael
Wenisch, Thomas F.
Ailamaki, Anastasia
Falsafi, Babak
Moshovos, Andreas

April 2009

L1 instruction-cache misses pose a critical performance bottleneck in commercial server workloads. C...

Hardware Optimizations Enabled by a Decoupled Fetch Architecture

Reinman, Glenn

June 2001

In the pursuit of instruction-level parallelism, significant demands are placed on a processor's ins...

Techniques for enlarging instruction streams

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

This work presents several techniques for enlarging instruction streams. We call stream to a sequenc...

Enlarging instruction streams

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

October 2007

The stream fetch engine is a high-performance fetch architecture based on the concept of an instruct...

Predicting multiple streams per cycle

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

January 2005

The next stream predictor is an accurate branch predictor that provides stream level sequencing. Eve...

Reducing fetch architecture complexity using procedure inlining

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

Fetch engine performance is seriously limited by the branch prediction table access latency. This fa...

Fetching instruction streams

Ramírez Bellido, Alejandro
Santana Jaria, Oliverio J.
Larriba Pey, Josep
Valero Cortés, Mateo

January 2002

Fetch performance is a very important factor because it effectively limits the overall processor per...

Caching and predicting branch sequences for improved fetch effectiveness

Onder, Soner
Xu, Jun
Gupta, R.

January 2000

A sequence of branch instructions in the dynamic instruction stream forms a branch sequence if at mo...

Instruction Fetch Architectures and Code Layout Optimizations

Alex Ramirez
Josep L. Larriba-pey
Mateo Valero

February 2008

The design of higher performance processors has been following two major trends: increasing the pipe...

Latency tolerant branch predictors

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

January 2003

The access latency of branch predictors is a well known problem of fetch engine design. Prediction o...

Enhancing the instruction fetching mechanism using data compression.

Chen, I-Cheng

January 1997

The continually increasing speed of microprocessors stresses the need for ever faster instruction fe...

Alternative Schemes for High-Bandwidth Instruction Fetching

Michaud, Pierre
Seznec, André
Jourdan, Stéphan
Sainrat, Pascal

January 1998

Future processors combining out-of-order execution with aggressive speculation techniques will need ...

M.J.Flynn, “Strategies for branch target buffers

Brian K. Bray
M. J. Flynn
Brian K. Bray
M. J. Flynn
Brian K. Bray
M. J. Flynn

January 1991

Achieving high instruction issue rates depends on the ability to dynamically predict branches. We co...

A Low-Complexity Fetch Architecture for High-Performance Superscalar Processors

Santana, O.J. (Oliverio J.)
Ramirez, A. (Alex)
Larriba-Pey, J.L. (Josep L.)
Valero, M. (Mateo)

January 2004

Fetch engine performance is a key topic in superscalar processors, since it limits the instructionle...

Instruction fetch architectures and code layout optimizations

Ramírez Bellido, Alejandro
Larriba Pey, Josep
Valero Cortés, Mateo

The design of higher performance processors has been following two major trends: increasing the pipe...

Temporal instruction fetch streaming

Ferdman, Michael
Wenisch, Thomas F.
Ailamaki, Anastasia
Falsafi, Babak
Moshovos, Andreas

April 2009

L1 instruction-cache misses pose a critical performance bottleneck in commercial server workloads. C...

Hardware Optimizations Enabled by a Decoupled Fetch Architecture

Reinman, Glenn

June 2001

In the pursuit of instruction-level parallelism, significant demands are placed on a processor's ins...

Techniques for enlarging instruction streams

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

This work presents several techniques for enlarging instruction streams. We call stream to a sequenc...

Enlarging instruction streams

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

October 2007

The stream fetch engine is a high-performance fetch architecture based on the concept of an instruct...

Predicting multiple streams per cycle

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

January 2005

The next stream predictor is an accurate branch predictor that provides stream level sequencing. Eve...

Reducing fetch architecture complexity using procedure inlining

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

Fetch engine performance is seriously limited by the branch prediction table access latency. This fa...

Fetching instruction streams

Ramírez Bellido, Alejandro
Santana Jaria, Oliverio J.
Larriba Pey, Josep
Valero Cortés, Mateo

January 2002

Fetch performance is a very important factor because it effectively limits the overall processor per...

Caching and predicting branch sequences for improved fetch effectiveness

Onder, Soner
Xu, Jun
Gupta, R.

January 2000

A sequence of branch instructions in the dynamic instruction stream forms a branch sequence if at mo...

Instruction Fetch Architectures and Code Layout Optimizations

Alex Ramirez
Josep L. Larriba-pey
Mateo Valero

February 2008

The design of higher performance processors has been following two major trends: increasing the pipe...

Latency tolerant branch predictors

Santana Jaria, Oliverio J.
Ramírez Bellido, Alejandro
Valero Cortés, Mateo

January 2003

The access latency of branch predictors is a well known problem of fetch engine design. Prediction o...

Enhancing the instruction fetching mechanism using data compression.

Chen, I-Cheng

January 1997

The continually increasing speed of microprocessors stresses the need for ever faster instruction fe...

Alternative Schemes for High-Bandwidth Instruction Fetching

Michaud, Pierre
Seznec, André
Jourdan, Stéphan
Sainrat, Pascal

January 1998

Future processors combining out-of-order execution with aggressive speculation techniques will need ...

M.J.Flynn, “Strategies for branch target buffers

Brian K. Bray
M. J. Flynn
Brian K. Bray
M. J. Flynn
Brian K. Bray
M. J. Flynn

January 1991

Achieving high instruction issue rates depends on the ability to dynamically predict branches. We co...

A Low-Complexity Fetch Architecture for High-Performance Superscalar Processors

Santana, O.J. (Oliverio J.)
Ramirez, A. (Alex)
Larriba-Pey, J.L. (Josep L.)
Valero, M. (Mateo)

January 2004

Fetch engine performance is a key topic in superscalar processors, since it limits the instructionle...

Instruction fetch architectures and code layout optimizations

Ramírez Bellido, Alejandro
Larriba Pey, Josep
Valero Cortés, Mateo

The design of higher performance processors has been following two major trends: increasing the pipe...

Temporal instruction fetch streaming

Ferdman, Michael
Wenisch, Thomas F.
Ailamaki, Anastasia
Falsafi, Babak
Moshovos, Andreas

April 2009

L1 instruction-cache misses pose a critical performance bottleneck in commercial server workloads. C...

Hardware Optimizations Enabled by a Decoupled Fetch Architecture

Reinman, Glenn

June 2001

In the pursuit of instruction-level parallelism, significant demands are placed on a processor's ins...

Techniques for enlarging instruction streams

Abstract

Extracted data

Techniques for enlarging instruction streams

Abstract

Extracted data

Related items

Related items