Distributed key-value stores (KVS) are a well-established approach for cloud data-intensive applications, but they were not designed to consider workloads with data access skew, mainly caused by popular data. In this work, we analyze the problem of replica placement on KVS for workloads with data access skew. We formally define our problem as a multi-objective optimization problem because not only load imbalance cost, but replica maintenance and reconfiguration costs affect system performance as well. To solve the replica placement problem, we present the PopRing replica placement component based on Genetic algorithms to find new replica placements efficiently. Next, we extend PopRing framework with a hyper-parameter optimization component ...
Avoiding latency variability in distributed storage systems is challenging. Even in well-provisioned...
Achieving timely access to data objects is a major challenge in big distributed systems like the Int...
International audienceAvoiding latency variability in distributed storage systems is challenging. Ev...
Replication plays an important role for storage system to improve dataavailability, throughput and r...
Today, users of interactive services such as e-commerce, web search have increasingly high expectati...
This paper addresses the problem of autonomic data placement in replicated key-value stores. The goa...
National audienceDistributed Hash Tables (DHTs) provide a scalable solution for data sharing in P2P ...
Nowadays, replication technique is widely used in datacenter storage systems to prevent data loss. D...
International audienceDistributed Hash Tables (DHTs) provide a scalable solution for data sharing in...
In the cloud storage system, data sets replicas technology can efficiently enhance data availability...
Replication is an essential corner stone fordata storage not only for traditional storage systemsbut...
Peer-assisted cloud storage systems use the unutilizedresources of the clients subscribed to a stora...
In distributed key-value storage systems, Apache Cassandra is known for its scalability and fault to...
The Hadoop Distributed File System (HDFS) is a distributed storage system that stores large volumes ...
Distributed systems offer resources to be accessed geographically for large-scale data requests of d...
Avoiding latency variability in distributed storage systems is challenging. Even in well-provisioned...
Achieving timely access to data objects is a major challenge in big distributed systems like the Int...
International audienceAvoiding latency variability in distributed storage systems is challenging. Ev...
Replication plays an important role for storage system to improve dataavailability, throughput and r...
Today, users of interactive services such as e-commerce, web search have increasingly high expectati...
This paper addresses the problem of autonomic data placement in replicated key-value stores. The goa...
National audienceDistributed Hash Tables (DHTs) provide a scalable solution for data sharing in P2P ...
Nowadays, replication technique is widely used in datacenter storage systems to prevent data loss. D...
International audienceDistributed Hash Tables (DHTs) provide a scalable solution for data sharing in...
In the cloud storage system, data sets replicas technology can efficiently enhance data availability...
Replication is an essential corner stone fordata storage not only for traditional storage systemsbut...
Peer-assisted cloud storage systems use the unutilizedresources of the clients subscribed to a stora...
In distributed key-value storage systems, Apache Cassandra is known for its scalability and fault to...
The Hadoop Distributed File System (HDFS) is a distributed storage system that stores large volumes ...
Distributed systems offer resources to be accessed geographically for large-scale data requests of d...
Avoiding latency variability in distributed storage systems is challenging. Even in well-provisioned...
Achieving timely access to data objects is a major challenge in big distributed systems like the Int...
International audienceAvoiding latency variability in distributed storage systems is challenging. Ev...