Almost all text applications use the well known vector-space model for text representation and analysis. While the vector-space model has proven itself to be an effective and efficient representation for mining purposes, it does not preserve information about the order-ing of the words in the representation. In this paper, we will intro-duce the concept of distance graph representations of text data. Such representations preserve distance and ordering information between the words, and provide a much richer representation of the underlying text. This approach enables knowledge discovery from text which is not possible with the use of a pure vector-space representation, because it loses much less information about the or-dering of the underl...
Text has been the dominant way of storing data in computer systems and sending information around th...
Text similarity measurement is a fundamental issue in many textual applications such as document clu...
We propose a graph-based representation of text collections where the nodes are textual units such a...
Abstract The rapid proliferation of the World Wide Web has increased the importance and prevalence o...
Abstract Text Mining is a research area of retrieving high quality hidden information such as patter...
Text representation models are the fundamental basis for information retrieval and text mining tasks...
International audienceGraphs have been widely used as modeling tools in Natural Language Processing ...
Nowadays semantic information of text is used largely for text classification task instead of bag-of...
The main topic of this doctoral dissertation is the extraction of valuable in- formation associate...
International audienceIn this article we present a new approach for the classification of structured...
Knowledge graphs are becoming ubiquitous in many scientific and industrial domains, ranging from bio...
A common and standard approach to model text document is bag-of-words. This model is suitable for ca...
Text classification using semantic information is the latest trend of research due to its greater po...
In this chapter we enhance the representation of web documents by utilizing graphs instead of vector...
Text is the most common form of storing information. Hence clustering of text could give us some ve...
Text has been the dominant way of storing data in computer systems and sending information around th...
Text similarity measurement is a fundamental issue in many textual applications such as document clu...
We propose a graph-based representation of text collections where the nodes are textual units such a...
Abstract The rapid proliferation of the World Wide Web has increased the importance and prevalence o...
Abstract Text Mining is a research area of retrieving high quality hidden information such as patter...
Text representation models are the fundamental basis for information retrieval and text mining tasks...
International audienceGraphs have been widely used as modeling tools in Natural Language Processing ...
Nowadays semantic information of text is used largely for text classification task instead of bag-of...
The main topic of this doctoral dissertation is the extraction of valuable in- formation associate...
International audienceIn this article we present a new approach for the classification of structured...
Knowledge graphs are becoming ubiquitous in many scientific and industrial domains, ranging from bio...
A common and standard approach to model text document is bag-of-words. This model is suitable for ca...
Text classification using semantic information is the latest trend of research due to its greater po...
In this chapter we enhance the representation of web documents by utilizing graphs instead of vector...
Text is the most common form of storing information. Hence clustering of text could give us some ve...
Text has been the dominant way of storing data in computer systems and sending information around th...
Text similarity measurement is a fundamental issue in many textual applications such as document clu...
We propose a graph-based representation of text collections where the nodes are textual units such a...