In this paper we present CrumbTrail, an algorithm to clean large and dense knowledge graphs. CrumbTrail removes cycles, out-of-domain nodes and non-essential nodes, i.e., those that can be safely removed without breaking the knowledge graph’s connectivity. It achieves this through a bottom-up topological pruning on the basis of a set of input concepts that, for instance, a user can select in order to identify a domain of interest. Our technique can be applied to both noisy hypernymy graphs – typically generated by ontology learning algorithms as intermediate representations – as well as crowdsourced resources like Wikipedia, in order to obtain clean, domain-focused concept hierarchies. CrumbTrail overcomes the time and space complexity limi...
The spread and abundance of electronic documents requires automatic techniques for extracting useful...
( † = corresponding author) Abstract. We consider the problem of identifying inherited content in k...
Knowledge graphs are used to represent relational information in terms of triples. To enable learnin...
In this paper we present CrumbTrail, an algorithm to clean large and dense knowledge graphs. CrumbTr...
In this paper we present an efficient and highly accurate algorithm to prune noisy or over-ambiguous...
In this paper, we present ContrastMedium, an algorithm that transforms noisy semantic networks into ...
In 2004 we published in this journal an article describing OntoLearn, one of the first systems to au...
9th International ACM Web Science Conference 2017, held from June 26 to June 28, 2017 in Troy, NY (U...
In 2004 we published in this journal an article describing OntoLearn, one of the first systems to au...
Knowledge graphs are a way to represent complex structured and unstructured information integrated ...
Abstract. The spread and abundance of electronic documents requires automatic techniques for extract...
Domain knowledge plays a significant role in powering a number of intelligent applications such as e...
We present MKGDB, a large-scale graph database created as a combination of multiple taxonomy backbon...
Conventional representation learning algorithms for knowledge graphs (KG) map each entity to a uniqu...
In modern machine learning,raw data is the preferred input for our models. Where a decade ago data s...
The spread and abundance of electronic documents requires automatic techniques for extracting useful...
( † = corresponding author) Abstract. We consider the problem of identifying inherited content in k...
Knowledge graphs are used to represent relational information in terms of triples. To enable learnin...
In this paper we present CrumbTrail, an algorithm to clean large and dense knowledge graphs. CrumbTr...
In this paper we present an efficient and highly accurate algorithm to prune noisy or over-ambiguous...
In this paper, we present ContrastMedium, an algorithm that transforms noisy semantic networks into ...
In 2004 we published in this journal an article describing OntoLearn, one of the first systems to au...
9th International ACM Web Science Conference 2017, held from June 26 to June 28, 2017 in Troy, NY (U...
In 2004 we published in this journal an article describing OntoLearn, one of the first systems to au...
Knowledge graphs are a way to represent complex structured and unstructured information integrated ...
Abstract. The spread and abundance of electronic documents requires automatic techniques for extract...
Domain knowledge plays a significant role in powering a number of intelligent applications such as e...
We present MKGDB, a large-scale graph database created as a combination of multiple taxonomy backbon...
Conventional representation learning algorithms for knowledge graphs (KG) map each entity to a uniqu...
In modern machine learning,raw data is the preferred input for our models. Where a decade ago data s...
The spread and abundance of electronic documents requires automatic techniques for extracting useful...
( † = corresponding author) Abstract. We consider the problem of identifying inherited content in k...
Knowledge graphs are used to represent relational information in terms of triples. To enable learnin...