Machine learning techniques are applicable to computer system optimization. We show that shared memory multiprocessors can successfully utilize machine learning algorithms for memory access pattern prediction. In particular three different on-line machine learning prediction techniques were tested to learn and predict repetitive memory access patterns for three typical parallel processing applications, the 2-D relaxation algorithm, matrix multiply and Fast Fourier Transform on a shared memory multiprocessor. The predictions were then used by a routing control algorithm to reduce control latency in the interconnection network by configuring the interconnection network to provide needed memory access paths before they were requested. Three tr...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
he Von Neumann bottleneck is a persistent problem in computer architecture, causing stalls and waste...
Scientific and technological advances in the area of integrated circuits have allowed the performanc...
Shared memory multiprocessors require reconfigurable interconnection networks (INs) for scalability...
A neural network based technique is introduced which hides the control latency of reconfigurable int...
A neural network based technique is introduced which hides the control latency of reconfigurable int...
Modern operating systems use main memory as a cache over disk-based storage. The time spent waiting ...
Efficient data supply to the processor is the one of the keys to achieve high performance. However, ...
The efficient mapping of program parallelism to multi-core processors is highly dependent on the und...
The solutions to many problems in computer architecture involve predictions, which are often based o...
Embedded systems need to respect stringent real time constraints. Various hardware components includ...
The vast number of transistors available through modern fabrication technology gives architects an u...
Improving the reliability and performance are of utmost importance for any system. This thesis prese...
Recent research advocates using general message predictors to learn and predict the coherence activi...
Cache memories are commonly implemented through multiple memory banks to improve bandwidth and laten...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
he Von Neumann bottleneck is a persistent problem in computer architecture, causing stalls and waste...
Scientific and technological advances in the area of integrated circuits have allowed the performanc...
Shared memory multiprocessors require reconfigurable interconnection networks (INs) for scalability...
A neural network based technique is introduced which hides the control latency of reconfigurable int...
A neural network based technique is introduced which hides the control latency of reconfigurable int...
Modern operating systems use main memory as a cache over disk-based storage. The time spent waiting ...
Efficient data supply to the processor is the one of the keys to achieve high performance. However, ...
The efficient mapping of program parallelism to multi-core processors is highly dependent on the und...
The solutions to many problems in computer architecture involve predictions, which are often based o...
Embedded systems need to respect stringent real time constraints. Various hardware components includ...
The vast number of transistors available through modern fabrication technology gives architects an u...
Improving the reliability and performance are of utmost importance for any system. This thesis prese...
Recent research advocates using general message predictors to learn and predict the coherence activi...
Cache memories are commonly implemented through multiple memory banks to improve bandwidth and laten...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
he Von Neumann bottleneck is a persistent problem in computer architecture, causing stalls and waste...
Scientific and technological advances in the area of integrated circuits have allowed the performanc...