There is currently a burst of Big Data (BD) processed and stored in huge raw data repositories, commonly called Data Lakes (DL). These BD require new techniques of data integration and schema alignment in order to make the data usable by its consumers and to discover the relationships linking their content. This can be provided by metadata services which discover and describe their content. However, there is currently a lack of a systematic approach for such kind of metadata discovery and management. Thus, we propose a framework for the profiling of informational content stored in the DL, which we call information profiling. The profiles are stored as metadata to support data analysis. We formally define a metadata management process which ...
International audienceThe rise of big data has revolutionized data exploitation practices and led to...
Metadata have always played a key role in favoring the cooperation of heterogeneous data sources. Th...
For more than 30 decades, data warehouses have been considered the only business intelligence storag...
There is currently a burst of Big Data (BD) processed and stored in huge raw data repositories, comm...
There is currently a burst of Big Data (BD) processed and stored in huge raw data repositories, comm...
International audienceData lakes have emerged as an alternative to data warehouses for the storage, ...
In addition to volume and velocity, Big data is also characterized by its variety. Variety in struct...
In addition to volume and velocity, Big data is also characterized by its variety. Variety in struct...
Although big data is being discussed for some years, it still has many research challenges, such as ...
International audienceOver the past decade, the data lake concept has emerged as an alternative to d...
47th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2...
International audienceIn 2010, the concept of data lake emerged as an alternative to data warehouses...
To prevent data lakes from being invisible and inaccessible to users, an efficient metadata manageme...
The heterogeneity of sources in Big Data systems requires new integration approaches which can handl...
As the challenge of our time, Big Data still has many research hassles, especially the variety of da...
International audienceThe rise of big data has revolutionized data exploitation practices and led to...
Metadata have always played a key role in favoring the cooperation of heterogeneous data sources. Th...
For more than 30 decades, data warehouses have been considered the only business intelligence storag...
There is currently a burst of Big Data (BD) processed and stored in huge raw data repositories, comm...
There is currently a burst of Big Data (BD) processed and stored in huge raw data repositories, comm...
International audienceData lakes have emerged as an alternative to data warehouses for the storage, ...
In addition to volume and velocity, Big data is also characterized by its variety. Variety in struct...
In addition to volume and velocity, Big data is also characterized by its variety. Variety in struct...
Although big data is being discussed for some years, it still has many research challenges, such as ...
International audienceOver the past decade, the data lake concept has emerged as an alternative to d...
47th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2...
International audienceIn 2010, the concept of data lake emerged as an alternative to data warehouses...
To prevent data lakes from being invisible and inaccessible to users, an efficient metadata manageme...
The heterogeneity of sources in Big Data systems requires new integration approaches which can handl...
As the challenge of our time, Big Data still has many research hassles, especially the variety of da...
International audienceThe rise of big data has revolutionized data exploitation practices and led to...
Metadata have always played a key role in favoring the cooperation of heterogeneous data sources. Th...
For more than 30 decades, data warehouses have been considered the only business intelligence storag...