A significant amount of information is expressed as the semi-structured, non-grammatical text found in auction listings and classified advertisements. There would be value in automatically fitting this type of information to relational database schemas. Work has been conducted on automatically populating such a database, and automatic schema mapping is a large area of research, but the problem of automatically generating such a schema is relatively unaddressed. Schema Discovery is the recent research thrust that addresses this. In this paper, we introduce the TESS system for knowledge-guided schema discovery from semi-structured text. TESS performs term extraction over listing text and applies semantic reasoning to differentiate common term...
Relational Keyword Search (R-KwS) systems enable naive/informal users to explore and retrieve inform...
Over the past decade, modern search engines have made significant progress towards better understand...
Data mining and data warehousing are two key technologies which have made significant contributions ...
International audienceMore and more weakly structured, and irregular data sources are becoming avail...
We present a query language for searching collections of structured text. Documents within the colle...
Schema matching is a critical problem for integrating heterogeneous information sources. Traditional...
Semistructured data is one of the new challenging research areas in the database community. We belie...
The schema of a database models the knowledge content of the database. However, database users often...
Tabular data is an abundant source of information on the Web, but remains mostly isolated from the l...
Knowledge Discovery in Databases (KDD), also known as data mining, focuses on the computerized explo...
This thesis looks into the process of automatically expanding image searches based on tags and the d...
Abstract. Although an increasing number of RDF knowledge bases are published, many of those consist ...
The emerging field of semistructured data leads to new ways of representing data as 'schemales...
The information age is characterized by a rapid growth in the amount of information available in ele...
Traditional database management requires design and ensures declarativity. In the context of semistr...
Relational Keyword Search (R-KwS) systems enable naive/informal users to explore and retrieve inform...
Over the past decade, modern search engines have made significant progress towards better understand...
Data mining and data warehousing are two key technologies which have made significant contributions ...
International audienceMore and more weakly structured, and irregular data sources are becoming avail...
We present a query language for searching collections of structured text. Documents within the colle...
Schema matching is a critical problem for integrating heterogeneous information sources. Traditional...
Semistructured data is one of the new challenging research areas in the database community. We belie...
The schema of a database models the knowledge content of the database. However, database users often...
Tabular data is an abundant source of information on the Web, but remains mostly isolated from the l...
Knowledge Discovery in Databases (KDD), also known as data mining, focuses on the computerized explo...
This thesis looks into the process of automatically expanding image searches based on tags and the d...
Abstract. Although an increasing number of RDF knowledge bases are published, many of those consist ...
The emerging field of semistructured data leads to new ways of representing data as 'schemales...
The information age is characterized by a rapid growth in the amount of information available in ele...
Traditional database management requires design and ensures declarativity. In the context of semistr...
Relational Keyword Search (R-KwS) systems enable naive/informal users to explore and retrieve inform...
Over the past decade, modern search engines have made significant progress towards better understand...
Data mining and data warehousing are two key technologies which have made significant contributions ...