Summarization: Recently, there has been increasing interest in extending relational query processing to include data obtained from unstructured sources. A common approach is to use stand-alone Information Extraction (IE) techniques to identify and label entities within blocks of text; the resulting entities are then imported into a standard database and processed using relational queries. This two-part approach, however, suffers from two main drawbacks. First, IE is inherently probabilistic, but traditional query processing does not properly handle probabilistic data, resulting in reduced answer quality. Second, performance inefficiencies arise due to the separation of IE from query processing. In this paper, we address these two problems b...
This paper proposes a new approach for approximate evaluation of #P-hard queries with probabilistic ...
Probabilistic databases have received considerable attention recently due to the need for storing un...
Incorporating probabilities into the semantics of incomplete databases has posed many challenges, fo...
Summarization: Unstructured text represents a large fraction of the world’s data. It often contains ...
Summarization: In the database community, work on information extraction (IE) has centered on two th...
Probabilistic Databases (PDBs) lie at the expressive intersection of databases, first-order logic, a...
Unstructured data like emails, addresses, invoices, call transcripts, reviews, and press releases ar...
AbstractMany applications today need to manage uncertain data, such as information extraction (IE), ...
During the past few years, the number of applications that need to process large-scale data has grow...
Probabilistic data and knowledge bases are becoming increasingly important in academia and industry....
Over the past decade, the two research areas of probabilistic databases and probabilistic programmin...
Abstract—Many applications today need to manage data that is uncertain, such as information extracti...
Abstract Probabilistic inference over large data sets is an increasingly important data management c...
Summarization: Recent entity resolution approaches exhibit benefits when addressing the problem thro...
Although information extraction and data mining appear together in many applications, their interfac...
This paper proposes a new approach for approximate evaluation of #P-hard queries with probabilistic ...
Probabilistic databases have received considerable attention recently due to the need for storing un...
Incorporating probabilities into the semantics of incomplete databases has posed many challenges, fo...
Summarization: Unstructured text represents a large fraction of the world’s data. It often contains ...
Summarization: In the database community, work on information extraction (IE) has centered on two th...
Probabilistic Databases (PDBs) lie at the expressive intersection of databases, first-order logic, a...
Unstructured data like emails, addresses, invoices, call transcripts, reviews, and press releases ar...
AbstractMany applications today need to manage uncertain data, such as information extraction (IE), ...
During the past few years, the number of applications that need to process large-scale data has grow...
Probabilistic data and knowledge bases are becoming increasingly important in academia and industry....
Over the past decade, the two research areas of probabilistic databases and probabilistic programmin...
Abstract—Many applications today need to manage data that is uncertain, such as information extracti...
Abstract Probabilistic inference over large data sets is an increasingly important data management c...
Summarization: Recent entity resolution approaches exhibit benefits when addressing the problem thro...
Although information extraction and data mining appear together in many applications, their interfac...
This paper proposes a new approach for approximate evaluation of #P-hard queries with probabilistic ...
Probabilistic databases have received considerable attention recently due to the need for storing un...
Incorporating probabilities into the semantics of incomplete databases has posed many challenges, fo...