Abstract. A large number of e-commerce websites have started to markup their products using standards such as Microdata, Microfor-mats, and RDFa. However, the markup is mostly not as fine-grained as desirable for applications and mostly consists of free text properties. This paper discusses the challenges that arise in the task of matching descriptions of electronic products from several thousand e-shops that o↵er Microdata markup. Specifically, our goal is to extract product at-tributes from product o↵ers, by means of regular expressions, in order to build well structured product specifications. For this purpose we present a technique for learning regular expressions. We evaluate our attribute extraction approach using 1.9 million product ...
Abstract. We develop an unsupervised learning framework for extract-ing popular product attributes f...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
International audienceRegular expressions (REs) are a widely used tool when con- sidering textual da...
A large number of e-commerce websites have started tomarkup their products using standards such as M...
Comparison shopping portals integrate product offers from large numbers of e-shops in order to suppo...
Large numbers of websites have started to markup their content using standards such as Microdata, Mi...
Resources of professional companies operating on the medical services market contain data from a hug...
Online product search engines such as Google and Yahoo shopping, rely on having extensive and comple...
A “marketplace” is an e-commerce medium where product and inventory information is provided by varyi...
A large class of entity extraction tasks from text that is either semistructured or fully unstructur...
On-line retailers as well as e-shoppers are very interested in gathering product records from the We...
International audienceFeature Models (FMs) are used extensively in software product line engineering...
Technology world has greatly evolved over the past decades, which led to inflated data volume. This ...
This list of guides was provided as a supporting document as part of the presentation, "E-Resource C...
We propose a system for the automatic generation of regular expressions for text-extraction tasks. T...
Abstract. We develop an unsupervised learning framework for extract-ing popular product attributes f...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
International audienceRegular expressions (REs) are a widely used tool when con- sidering textual da...
A large number of e-commerce websites have started tomarkup their products using standards such as M...
Comparison shopping portals integrate product offers from large numbers of e-shops in order to suppo...
Large numbers of websites have started to markup their content using standards such as Microdata, Mi...
Resources of professional companies operating on the medical services market contain data from a hug...
Online product search engines such as Google and Yahoo shopping, rely on having extensive and comple...
A “marketplace” is an e-commerce medium where product and inventory information is provided by varyi...
A large class of entity extraction tasks from text that is either semistructured or fully unstructur...
On-line retailers as well as e-shoppers are very interested in gathering product records from the We...
International audienceFeature Models (FMs) are used extensively in software product line engineering...
Technology world has greatly evolved over the past decades, which led to inflated data volume. This ...
This list of guides was provided as a supporting document as part of the presentation, "E-Resource C...
We propose a system for the automatic generation of regular expressions for text-extraction tasks. T...
Abstract. We develop an unsupervised learning framework for extract-ing popular product attributes f...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
International audienceRegular expressions (REs) are a widely used tool when con- sidering textual da...