In this paper we present two methodologies for rapidly inducing multiple subject-specific taxonomies from crawled data. The first method involves a sentence-level words co-occurrence frequency method for building the taxonomy, while the second involves the bootstrapping of a Word2Vec based algorithm with a directed crawler. We exploit the multilingual open-content directory of the World Wide Web, DMOZ 1 to seed the crawl, and the domain name to direct the crawl. This domain corpus is then input to our algorithm that can automatically induce taxonomies. The induced taxonomies provide hierarchical semantic dimensions for the purposes of faceted browsing. As part of an ongoing personal semantics project, we applied the resulting taxonomies to ...
Building taxonomies for Web content manually is costly and time-consuming. An alternative is to allo...
We present a system for taxonomy construction that reached the first place in all subtasks of the Se...
This archive contains a collection of computational models called word embeddings. These are vectors...
In this paper we present two methodologies for rapidly inducing multiple subject-specific taxonomies...
We introduce EXTASEM!, a novel approach for the automatic learning of lexical taxonomies from domain...
We introduce ExTaSem!, a novel approach for the automatic learning of lexical taxonomies from domain...
Users of the web are currently the main content authors. Social networks form a valuable source of l...
In this paper we present a graph-based approach aimed at learning a lexical taxonomy automatically s...
Semantic taxonomies are powerful tools that provide structured knowledge to Natural Language Process...
We propose a novel algorithm for inducing semantic taxonomies. Previous algorithms for taxonomy indu...
Knowledge structures, such as taxonomies, are key to the organization and management of Web content,...
With the rise of social media, learning from informal text has become increasingly important. We pre...
Collaborative tagging becomes a common feature of current web sites, facilitating ordinary users to ...
In this paper we investigate whether it is possible to create a computational approach that allows u...
Collaborative tagging allows users to tag online resources. We refer to the large database of tags a...
Building taxonomies for Web content manually is costly and time-consuming. An alternative is to allo...
We present a system for taxonomy construction that reached the first place in all subtasks of the Se...
This archive contains a collection of computational models called word embeddings. These are vectors...
In this paper we present two methodologies for rapidly inducing multiple subject-specific taxonomies...
We introduce EXTASEM!, a novel approach for the automatic learning of lexical taxonomies from domain...
We introduce ExTaSem!, a novel approach for the automatic learning of lexical taxonomies from domain...
Users of the web are currently the main content authors. Social networks form a valuable source of l...
In this paper we present a graph-based approach aimed at learning a lexical taxonomy automatically s...
Semantic taxonomies are powerful tools that provide structured knowledge to Natural Language Process...
We propose a novel algorithm for inducing semantic taxonomies. Previous algorithms for taxonomy indu...
Knowledge structures, such as taxonomies, are key to the organization and management of Web content,...
With the rise of social media, learning from informal text has become increasingly important. We pre...
Collaborative tagging becomes a common feature of current web sites, facilitating ordinary users to ...
In this paper we investigate whether it is possible to create a computational approach that allows u...
Collaborative tagging allows users to tag online resources. We refer to the large database of tags a...
Building taxonomies for Web content manually is costly and time-consuming. An alternative is to allo...
We present a system for taxonomy construction that reached the first place in all subtasks of the Se...
This archive contains a collection of computational models called word embeddings. These are vectors...