International audienceThis paper proposes a new data placement policy to allocate data blocks across storage servers of distribute/parallel file systems, for yielding even block access workload distribution. To this end, we first analyze the history of block access sequence of a specific application, and then introduce a k-partition algorithm to divide data blocks into multiple groups, by referring their access frequency. After that, each group has almost same access workloads, we can thus distribute these block groups onto storage servers of distributed file system, to achieve the goal of uniformly assigning data blocks when running the application. In summary, this newly proposed data placement policy can yield not only an even data distr...
During the last few decades, Data-intensive File Systems (DiFS), such as Google File System (GFS) an...
AbstractThe effectiveness of a distributed system hinges on the manner in which tasks and data are a...
Part 9: StorageInternational audienceMany current distributed file systems use erasure-coding based ...
Nowadays, replication technique is widely used in data centerstorage systems to prevent data loss. D...
The amount of encoded data replication in an erasure-coded clustered storage system has a great impa...
In this paper, a method of balancing both access fre-quency and data amount for a distributed parall...
Distributed file systems have become popular because they allow information to be shared be between ...
Disk drives are the bottleneck in the processing of large amounts of data used in almost all common ...
The current Hadoop block placement policy do not fairly and evenly distributes replicas of blocks wr...
Parallel systems leverage parallel file systems to efficiently perform I/O to shared files. These pa...
Part 5: I/O, File Systems, and Data ManagementInternational audienceThis paper presents a novel mech...
Parallel transmission algorithms CLBA and DAS greatly improved the speed of transferring data files,...
With the rapid development of computation capability, the massive increase in data volume has outmod...
We present a randomized block-level storage virtualization for arbitrary heterogeneous storage syste...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
During the last few decades, Data-intensive File Systems (DiFS), such as Google File System (GFS) an...
AbstractThe effectiveness of a distributed system hinges on the manner in which tasks and data are a...
Part 9: StorageInternational audienceMany current distributed file systems use erasure-coding based ...
Nowadays, replication technique is widely used in data centerstorage systems to prevent data loss. D...
The amount of encoded data replication in an erasure-coded clustered storage system has a great impa...
In this paper, a method of balancing both access fre-quency and data amount for a distributed parall...
Distributed file systems have become popular because they allow information to be shared be between ...
Disk drives are the bottleneck in the processing of large amounts of data used in almost all common ...
The current Hadoop block placement policy do not fairly and evenly distributes replicas of blocks wr...
Parallel systems leverage parallel file systems to efficiently perform I/O to shared files. These pa...
Part 5: I/O, File Systems, and Data ManagementInternational audienceThis paper presents a novel mech...
Parallel transmission algorithms CLBA and DAS greatly improved the speed of transferring data files,...
With the rapid development of computation capability, the massive increase in data volume has outmod...
We present a randomized block-level storage virtualization for arbitrary heterogeneous storage syste...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
During the last few decades, Data-intensive File Systems (DiFS), such as Google File System (GFS) an...
AbstractThe effectiveness of a distributed system hinges on the manner in which tasks and data are a...
Part 9: StorageInternational audienceMany current distributed file systems use erasure-coding based ...