Category hierarchies often evolve at a much slower pace than the documents reside in. With newly available documents kept adding into a hierarchy, new topics emerge and documents within the same category become less topically cohesive. In this paper, we propose a novel automatic approach to modifying a given category hierarchy by redistributing its documents into more topically cohesive categories. The modification is achieved with three operations (namely, sprout, merge, and assign) with reference to an auxiliary hierarchy for additional semantic information; the auxiliary hierarchy covers a similar set of topics as the hierarchy to be modified. Our user study shows that the modified category hierarchy is semantically meaningful. As an ext...
Hierarchical categorization of documents is a task receiving growing interest due to the widespread ...
Hierarchical classifications are concept hierarchies used to organize large amounts of documents. Fi...
Hierarchies have long been used for organization, summarization, and access to information. In this ...
Category hierarchies often evolve at a much slower pace than the documents reside in. With newly ava...
Hierarchical models have been shown to be effective in content classification. However, the performa...
Category hierarchy is an abstraction mechanism for efficiently managing large-scale resources. In an...
Hierarchical models have been shown to be effective in content classification. However, we observe t...
Most of the research on text categorization has focused on classifying text documents into a set of ...
This paper presents a means of automatically deriving a hierarchical organization of concepts from a...
The need to classify text documents within topic hierarchies has given rise to techniques that use t...
Hierarchies are a natural way for people to organize information, as reflected by the common use of ...
Document hierarchies provide views of a collection at different levels of granularity, making it eas...
he management of hierarchically organized data is starting to play a key role in the knowledge manag...
Preserving the user’s preference in document-category management is essential because it affects his...
In this paper, the problem of classifying a HTML documents into a hierarchy of categories is invest...
Hierarchical categorization of documents is a task receiving growing interest due to the widespread ...
Hierarchical classifications are concept hierarchies used to organize large amounts of documents. Fi...
Hierarchies have long been used for organization, summarization, and access to information. In this ...
Category hierarchies often evolve at a much slower pace than the documents reside in. With newly ava...
Hierarchical models have been shown to be effective in content classification. However, the performa...
Category hierarchy is an abstraction mechanism for efficiently managing large-scale resources. In an...
Hierarchical models have been shown to be effective in content classification. However, we observe t...
Most of the research on text categorization has focused on classifying text documents into a set of ...
This paper presents a means of automatically deriving a hierarchical organization of concepts from a...
The need to classify text documents within topic hierarchies has given rise to techniques that use t...
Hierarchies are a natural way for people to organize information, as reflected by the common use of ...
Document hierarchies provide views of a collection at different levels of granularity, making it eas...
he management of hierarchically organized data is starting to play a key role in the knowledge manag...
Preserving the user’s preference in document-category management is essential because it affects his...
In this paper, the problem of classifying a HTML documents into a hierarchy of categories is invest...
Hierarchical categorization of documents is a task receiving growing interest due to the widespread ...
Hierarchical classifications are concept hierarchies used to organize large amounts of documents. Fi...
Hierarchies have long been used for organization, summarization, and access to information. In this ...