International audienceWith the tremendous growth of unstructured data in the Business Intelligence, there is a need for incorporating textual data into data warehouses, to provide an appropriate multidimensional analysis (OLAP) and develop new approaches that take into account the textual content of data. This will provide textual measures to users who wish to analyse documents online. In this paper, we propose a new aggregation function for textual data in an OLAP context. For aggregating keywords, our contribution is to use a data mining technique, such as kmeans, but with a distance based on the Google similarity distance. Thus our approach considers the semantic similarity of keywords for their aggregation. The performance of our approa...
The focus of this thesis is comparison of analysis of text-document similarity using clustering algo...
Document clustering techniques have been applied in several areas, with the web as one of the most r...
The spectacular increasing of Data is due to the appearance of networks and smartphones. Amount 42% ...
International audienceWith the tremendous growth of unstructured data in the Business Intelligence, ...
International audienceData warehousing and On-Line Analytical Processing (OLAP) are essential elemen...
International audienceIn the last decade, OnLine Analytical Processing (OLAP) has taken an increasin...
International audienceIn the last decade, Online Analytical Processing (OLAP) has taken an increasin...
International audienceAbstract Text mining approaches are commonly used to discover relevant informa...
International audienceWe present in this paper a system for textual aggregation from scientific docu...
Generally,Text mining applications disregard the side-information contained within the text document...
International audienceNowadays, most organizations deal with complex data having different formats a...
Abstract. For a few years, on-line analysis processing (OLAP) and data mining have known parallel an...
The documents similarity metric is a substantial tool applied in areas such as determining topic in ...
The advancements in the fields of mobile computing, grid computing, cloud computing, Internet of Thi...
[[abstract]]On-line analytical processing (OLAP) is a common solution that modern enterprises use to...
The focus of this thesis is comparison of analysis of text-document similarity using clustering algo...
Document clustering techniques have been applied in several areas, with the web as one of the most r...
The spectacular increasing of Data is due to the appearance of networks and smartphones. Amount 42% ...
International audienceWith the tremendous growth of unstructured data in the Business Intelligence, ...
International audienceData warehousing and On-Line Analytical Processing (OLAP) are essential elemen...
International audienceIn the last decade, OnLine Analytical Processing (OLAP) has taken an increasin...
International audienceIn the last decade, Online Analytical Processing (OLAP) has taken an increasin...
International audienceAbstract Text mining approaches are commonly used to discover relevant informa...
International audienceWe present in this paper a system for textual aggregation from scientific docu...
Generally,Text mining applications disregard the side-information contained within the text document...
International audienceNowadays, most organizations deal with complex data having different formats a...
Abstract. For a few years, on-line analysis processing (OLAP) and data mining have known parallel an...
The documents similarity metric is a substantial tool applied in areas such as determining topic in ...
The advancements in the fields of mobile computing, grid computing, cloud computing, Internet of Thi...
[[abstract]]On-line analytical processing (OLAP) is a common solution that modern enterprises use to...
The focus of this thesis is comparison of analysis of text-document similarity using clustering algo...
Document clustering techniques have been applied in several areas, with the web as one of the most r...
The spectacular increasing of Data is due to the appearance of networks and smartphones. Amount 42% ...