This thesis addresses the tasks of concept disambiguation and clustering. Concept disambiguation is the task of linking common nouns and proper names in a text – henceforth called mentions – to their corresponding concepts in a predefined inventory. Concept clustering is the task of clustering mentions, so that all mentions in one cluster denote the same concept. In this thesis, we investigate concept disambiguation and clustering from a discourse perspective and propose a discourse-aware approach for joint concept disambiguation and clustering in the framework of Markov logic. The contributions of this thesis are fourfold: Joint Concept Disambiguation and Clustering. In previous approaches, concept disambiguation and concept clustering ...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
Alaçam Ö, Schüz S, Wegrzyn M, Kißler J, Zarrieß S. Exploring Semantic Spaces for Detecting Clusterin...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
An automated method for clustering terms/concepts from a set of documents on the same topic was deve...
International audienceWe consider a challenging clustering task: the clustering of multi-word terms ...
thesisTerm co-occurrence data has been extensively used in many applications ranging from informatio...
This paper presents work in progress on clustering methods that identify semantic concepts in a docu...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
In the information age we much depend on our ability to find information hidden in mostly unstructur...
Cimiano P, Hotho A, Staab S. Clustering Concept Hierarchies from Text. In: Proceedings of the Confe...
We consider a challenging clustering task: the clustering of muti-word terms without document co-occ...
Concepts play a central role in many applications. This includes settings where concepts have to be ...
Abstract- Usually in text mining techniques the basic measures like term frequency of a term (word o...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
Alaçam Ö, Schüz S, Wegrzyn M, Kißler J, Zarrieß S. Exploring Semantic Spaces for Detecting Clusterin...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
An automated method for clustering terms/concepts from a set of documents on the same topic was deve...
International audienceWe consider a challenging clustering task: the clustering of multi-word terms ...
thesisTerm co-occurrence data has been extensively used in many applications ranging from informatio...
This paper presents work in progress on clustering methods that identify semantic concepts in a docu...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
In the information age we much depend on our ability to find information hidden in mostly unstructur...
Cimiano P, Hotho A, Staab S. Clustering Concept Hierarchies from Text. In: Proceedings of the Confe...
We consider a challenging clustering task: the clustering of muti-word terms without document co-occ...
Concepts play a central role in many applications. This includes settings where concepts have to be ...
Abstract- Usually in text mining techniques the basic measures like term frequency of a term (word o...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
Alaçam Ö, Schüz S, Wegrzyn M, Kißler J, Zarrieß S. Exploring Semantic Spaces for Detecting Clusterin...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...