Abstract The rapid proliferation of the World Wide Web has increased the importance and prevalence of text as a medium for dissemination of information. A variety of text mining and management algorithms have been developed in recent years such as clustering, classification, indexing, and similarity search. Almost all these applications use the well-known vector-space model for text representation and analysis. While the vector-space model has proven itself to be an effective and efficient representation for mining purposes, it does not preserve information about the ordering of the words in the representation. In this paper, we will introduce the concept of distance graph representations of text data. Such representations preserve informat...
International audienceIn this paper we revisit this main idea of co-word analysis based on the compu...
Text has been the dominant way of storing data in computer systems and sending information around th...
It is well known that supervised text classification methods need to learn from many labeled exampl...
Almost all text applications use the well known vector-space model for text representation and analy...
International audienceGraphs have been widely used as modeling tools in Natural Language Processing ...
Abstract Text Mining is a research area of retrieving high quality hidden information such as patter...
The main topic of this doctoral dissertation is the extraction of valuable in- formation associate...
Text representation models are the fundamental basis for information retrieval and text mining tasks...
Nowadays semantic information of text is used largely for text classification task instead of bag-of...
In this dissertation we introduce several novel techniques for performing data mining on web documen...
We propose a graph-based representation of text collections where the nodes are textual units such a...
In this chapter we enhance the representation of web documents by utilizing graphs instead of vector...
Abstract—In this paper, we propose an approach to explore large texts by highlighting coherent sub-p...
Over the last few years, a number of ar-eas of natural language processing have begun applying graph...
Text is the most common form of storing information. Hence clustering of text could give us some ve...
International audienceIn this paper we revisit this main idea of co-word analysis based on the compu...
Text has been the dominant way of storing data in computer systems and sending information around th...
It is well known that supervised text classification methods need to learn from many labeled exampl...
Almost all text applications use the well known vector-space model for text representation and analy...
International audienceGraphs have been widely used as modeling tools in Natural Language Processing ...
Abstract Text Mining is a research area of retrieving high quality hidden information such as patter...
The main topic of this doctoral dissertation is the extraction of valuable in- formation associate...
Text representation models are the fundamental basis for information retrieval and text mining tasks...
Nowadays semantic information of text is used largely for text classification task instead of bag-of...
In this dissertation we introduce several novel techniques for performing data mining on web documen...
We propose a graph-based representation of text collections where the nodes are textual units such a...
In this chapter we enhance the representation of web documents by utilizing graphs instead of vector...
Abstract—In this paper, we propose an approach to explore large texts by highlighting coherent sub-p...
Over the last few years, a number of ar-eas of natural language processing have begun applying graph...
Text is the most common form of storing information. Hence clustering of text could give us some ve...
International audienceIn this paper we revisit this main idea of co-word analysis based on the compu...
Text has been the dominant way of storing data in computer systems and sending information around th...
It is well known that supervised text classification methods need to learn from many labeled exampl...