Metadata snapshots are a common method for gaining insight into file systems due to their small size and relative ease of acquisition. Since they are static, most researchers have used them for relatively simple analyses such as file size distributions and age of files. We hypothesize that it is possible to gain much richer insights into file system and user behavior by clustering features in metadata snapshots and comparing the entropy within clusters to the entropy within natural partitions such as directory hierarchies. We discuss several different methods for gaining deeper insights into metadata snapshots, and show a small proof of concept using data from Los Alamos National Laboratories. In our initial work, we see evidence that it is...
Online archival capabilities like snapshots or checkpoints are fast becoming an essential component ...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
A huge increase in data storage and processing requirements has lead to Big Data, for which next gen...
Abstract—Metadata snapshots are a common method for gaining insight into filesystems due to their sm...
File system consistency frequently involves a choice between raw performance and integrity guarantee...
Efficient namespace metadata management is becoming more important as next-generation file systems a...
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing me...
Modern high end computing systems store hundreds of petabytes of data and have billions of files, as...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing me...
Abstract—An efficient and distributed scheme for file mapping or file lookup is critical in decentra...
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing me...
One of the most fundamental storage system research tasks is activity tracing. By understanding the ...
With the advent of emerging e-Science applications, today\u27s scientific research increasingly re...
Investigating cybersecurity incidents requires in-depth knowledge from the analyst. Moreover, the wh...
Online archival capabilities like snapshots or checkpoints are fast becoming an essential component ...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
A huge increase in data storage and processing requirements has lead to Big Data, for which next gen...
Abstract—Metadata snapshots are a common method for gaining insight into filesystems due to their sm...
File system consistency frequently involves a choice between raw performance and integrity guarantee...
Efficient namespace metadata management is becoming more important as next-generation file systems a...
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing me...
Modern high end computing systems store hundreds of petabytes of data and have billions of files, as...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing me...
Abstract—An efficient and distributed scheme for file mapping or file lookup is critical in decentra...
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing me...
One of the most fundamental storage system research tasks is activity tracing. By understanding the ...
With the advent of emerging e-Science applications, today\u27s scientific research increasingly re...
Investigating cybersecurity incidents requires in-depth knowledge from the analyst. Moreover, the wh...
Online archival capabilities like snapshots or checkpoints are fast becoming an essential component ...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
A huge increase in data storage and processing requirements has lead to Big Data, for which next gen...