With the development of the Web and the high availability of storage spaces, more and more documents become accessible. For that reason, similarity learning suffers from a scalability problem in both memory use and computational time when a data set is large. This paper provides a fuzzy triadic similarity measure to calculate memberships in a context of document co-clustering. It allows computing simultaneously fuzzy co-similarity matrices between documents/sentences and sentences/words. Each one is built on the basis of the others. The proposed model is extended to tackle the problem of large data sets by a splitting architecture which deals with a new fuzzy triadic similarity to parallelize both memory use and computation on distributed c...
Contributed 28: Social Networks and ClusteringInternational audienceIn data analysis domain, data ar...
Semantic similarity is the process of identifying relevant data semantically. The traditional way of...
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6980898This work exploits the use of a fu...
AbstractStatistical measure of finding Similar Sentences using a novel Fuzzy clustering algorithm fr...
Abstract—Co-clustering has been defined as a way to or-ganize simultaneously subsets of instances an...
The paper advocates the use of a new fuzzy-based clustering algorithm for document categorization. E...
Abstract Correlation Preserving Indexing is a spectral clustering method which discovers intrinsic s...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
This paper is accepted as a long paper with an oral presentation by the IEEE international conferenc...
This thesis investigates the performance of fuzzy clustering for dynamically discovering content rel...
Abstract: In this paper, a unified framework for clustering documents based on vocabulary overlap an...
International audienceThis paper introduces fuzzy clustering algorithms that can partition objects t...
Abstract: Clustering is the problem of discovering “meaningful ” groups in given data. The first and...
The constant success of the Internet made the number of text documents in electronic forms increases...
Contributed 28: Social Networks and ClusteringInternational audienceIn data analysis domain, data ar...
Semantic similarity is the process of identifying relevant data semantically. The traditional way of...
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6980898This work exploits the use of a fu...
AbstractStatistical measure of finding Similar Sentences using a novel Fuzzy clustering algorithm fr...
Abstract—Co-clustering has been defined as a way to or-ganize simultaneously subsets of instances an...
The paper advocates the use of a new fuzzy-based clustering algorithm for document categorization. E...
Abstract Correlation Preserving Indexing is a spectral clustering method which discovers intrinsic s...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
This paper is accepted as a long paper with an oral presentation by the IEEE international conferenc...
This thesis investigates the performance of fuzzy clustering for dynamically discovering content rel...
Abstract: In this paper, a unified framework for clustering documents based on vocabulary overlap an...
International audienceThis paper introduces fuzzy clustering algorithms that can partition objects t...
Abstract: Clustering is the problem of discovering “meaningful ” groups in given data. The first and...
The constant success of the Internet made the number of text documents in electronic forms increases...
Contributed 28: Social Networks and ClusteringInternational audienceIn data analysis domain, data ar...
Semantic similarity is the process of identifying relevant data semantically. The traditional way of...
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6980898This work exploits the use of a fu...