NUMA (non-uniform memory access) servers are commonly used in high-performance computing and datacenters. Within each server, a processor-interconnect (e.g., Intel QPI, AMD HyperTransport) is used to communicate between the different sockets or nodes. In this work, we explore the impact of the processor-interconnect on overall performance -- in particular, the performance un- fairness caused by processor-interconnect arbitration. It is well known that locally-fair arbitration does not guarantee globally-fair bandwidth sharing as closer nodes receive more bandwidth in a multi-hop network. However, this work demonstrates that the opposite can occur in a commodity NUMA server where remote nodes receive higher bandwidth (and perform better). We...
Due to their excellent price-performance ratio, clusters built from commodity nodes have become broa...
Abstract—With wide application of virtualization technology, tenants are able to access isolated clo...
In a network-on-chip (NoC) based system, the NoC is a shared resource among multiple processor cores...
Present day multi-core processors integrate dozens of small processing cores with an on-chip network...
Fair arbitration in the access to hardware shared resources is fundamental to obtain low worst-case ...
International audienceNowadays, virtualization is a central element in data centers as it allows sha...
In Proc. of the 2014 Makassar International Conference on Electrical Engineering and Informatics (MI...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
Proceedings of: Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 201...
This paper studies application performance on systems with strongly non-uniform remote memory access...
Interconnection networks usually consist of a fabric of interconnected routers, which receive packet...
While virtualization only introduces a negligible overhead on machines with few cores, this is not t...
In this paper, the concept of fairness in computer networks is investigated. We motivate the need of...
This paper studies application performance on systems with strongly non-uniform remote memory access...
In Generalized Processor Sharing (GPS), the goal of the scheduler is to maximize the processor utili...
Due to their excellent price-performance ratio, clusters built from commodity nodes have become broa...
Abstract—With wide application of virtualization technology, tenants are able to access isolated clo...
In a network-on-chip (NoC) based system, the NoC is a shared resource among multiple processor cores...
Present day multi-core processors integrate dozens of small processing cores with an on-chip network...
Fair arbitration in the access to hardware shared resources is fundamental to obtain low worst-case ...
International audienceNowadays, virtualization is a central element in data centers as it allows sha...
In Proc. of the 2014 Makassar International Conference on Electrical Engineering and Informatics (MI...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
Proceedings of: Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 201...
This paper studies application performance on systems with strongly non-uniform remote memory access...
Interconnection networks usually consist of a fabric of interconnected routers, which receive packet...
While virtualization only introduces a negligible overhead on machines with few cores, this is not t...
In this paper, the concept of fairness in computer networks is investigated. We motivate the need of...
This paper studies application performance on systems with strongly non-uniform remote memory access...
In Generalized Processor Sharing (GPS), the goal of the scheduler is to maximize the processor utili...
Due to their excellent price-performance ratio, clusters built from commodity nodes have become broa...
Abstract—With wide application of virtualization technology, tenants are able to access isolated clo...
In a network-on-chip (NoC) based system, the NoC is a shared resource among multiple processor cores...