The distributed data management system Rucio manages all data of the ATLAS collaboration across the grid. Automation such as replication and rebalancing are an important part to ensure the minimum workflow execution times. In this paper, a new rebalancing algorithm based on machine learning is proposed. First, it can run independently of the existing rebalancing mechanism and can be modularised. It collects data from other services and learns optimality as it runs in the background. Periodically this learning agent takes a subset of the global datasets and proposes them for redistribution to reduce waiting times. The user can interact and choose to accept, decline, or override the dataset placement suggestions. The accepted items are shifte...
This paper presents a system to predict future data popularity for data-intensive systems, such as A...
National audienceMany companies are using MapReduce applications to process very large amounts of da...
The prosperity of Big Data owes to the advances in distributed computing systems, which make it poss...
The distributed data management system Rucio manages all data of the ATLAS collaboration across the ...
The distributed data management system Rucio manages all data of the ATLAS collaboration across the ...
The ATLAS Distributed Data Management system stores more than 220PB of physics data across more than...
This contribution presents a study on the applicability and usefulness of dynamic data placement met...
This contribution presents a study on the applicability and usefulness of dynamic data placement met...
A core problem affecting distributed data management systems relates to deciding the optimal system...
The increasing volume of physics data is posing a critical challenge to the ATLAS experiment. In ant...
International audienceMany companies are using MapReduce applications to process very large amounts ...
In-memory transactional data grids have revealed extremely suited for cloud based environments, give...
Thesis (Ph.D.)--University of Washington, 2019Distributed systems consist of many components that in...
Cloud technologies provide capabilities that can guarantee to the end user high availability, perfor...
This paper proposes a method of migrating workload among geo-distributed data centres that are equip...
This paper presents a system to predict future data popularity for data-intensive systems, such as A...
National audienceMany companies are using MapReduce applications to process very large amounts of da...
The prosperity of Big Data owes to the advances in distributed computing systems, which make it poss...
The distributed data management system Rucio manages all data of the ATLAS collaboration across the ...
The distributed data management system Rucio manages all data of the ATLAS collaboration across the ...
The ATLAS Distributed Data Management system stores more than 220PB of physics data across more than...
This contribution presents a study on the applicability and usefulness of dynamic data placement met...
This contribution presents a study on the applicability and usefulness of dynamic data placement met...
A core problem affecting distributed data management systems relates to deciding the optimal system...
The increasing volume of physics data is posing a critical challenge to the ATLAS experiment. In ant...
International audienceMany companies are using MapReduce applications to process very large amounts ...
In-memory transactional data grids have revealed extremely suited for cloud based environments, give...
Thesis (Ph.D.)--University of Washington, 2019Distributed systems consist of many components that in...
Cloud technologies provide capabilities that can guarantee to the end user high availability, perfor...
This paper proposes a method of migrating workload among geo-distributed data centres that are equip...
This paper presents a system to predict future data popularity for data-intensive systems, such as A...
National audienceMany companies are using MapReduce applications to process very large amounts of da...
The prosperity of Big Data owes to the advances in distributed computing systems, which make it poss...