For a large-scale data-intensive environment, such as the World-Wide Web or data warehousing, we often make local copies of remote data sources. Due to limited network and computational resources, however, it is often difficult to monitor the sources constantly to check for changes and to download changed data items to the copies. In this scenario, our goal is to detect as many changes as we can using the fixed download resources that we have. In this paper we propose three sampling-based download policies that can identify more changed data items effectively. In our sampling-based approach, we first sample a small number of data items from each data source and download more data items from the sources with more changed samples. We analyze ...
We propose incremental least squares density difference (LSDD) change detection method, an increment...
An important problem in data mining is detecting changes in large data sets. Although there are a va...
Statistical process control techniques have been widely used for online process monitoring and diagn...
Perhaps the most flexible synopsis of a database is a uniform random sample of the data; such sample...
Abstract—Network applications commonly maintain local copies of remote data sources in order to prov...
We propose a sampling infrastructure for gathering information about software from the set of runs e...
Networked critical infrastructures are of national importance. However, such infrastructures are run...
One response to the proliferation of large datasets has been to develop ingenious ways to throw reso...
Given that Internet traffic speed and volume are growing at a rapid pace, monitoring the network in ...
Currently, data auditing is an important means to check the integrity of the data stored on the clou...
Caching in the World Wide Web is based on two critical assumptions: that a significant fraction of r...
International audienceGiven that Internet traffic speed and volume are growing at a rapid pace, moni...
Identifying heavy hitters is essential for network monitoring, management, charging and etc. Existin...
As a software system evolves, developers make changes to add new features ot fix different kinds of ...
International audienceA search engine maintains local copies of different web pages to provide quick...
We propose incremental least squares density difference (LSDD) change detection method, an increment...
An important problem in data mining is detecting changes in large data sets. Although there are a va...
Statistical process control techniques have been widely used for online process monitoring and diagn...
Perhaps the most flexible synopsis of a database is a uniform random sample of the data; such sample...
Abstract—Network applications commonly maintain local copies of remote data sources in order to prov...
We propose a sampling infrastructure for gathering information about software from the set of runs e...
Networked critical infrastructures are of national importance. However, such infrastructures are run...
One response to the proliferation of large datasets has been to develop ingenious ways to throw reso...
Given that Internet traffic speed and volume are growing at a rapid pace, monitoring the network in ...
Currently, data auditing is an important means to check the integrity of the data stored on the clou...
Caching in the World Wide Web is based on two critical assumptions: that a significant fraction of r...
International audienceGiven that Internet traffic speed and volume are growing at a rapid pace, moni...
Identifying heavy hitters is essential for network monitoring, management, charging and etc. Existin...
As a software system evolves, developers make changes to add new features ot fix different kinds of ...
International audienceA search engine maintains local copies of different web pages to provide quick...
We propose incremental least squares density difference (LSDD) change detection method, an increment...
An important problem in data mining is detecting changes in large data sets. Although there are a va...
Statistical process control techniques have been widely used for online process monitoring and diagn...