Product matching is a central task within e-commerce applications such as price comparison portals and online market places. State-of-the-art product matching methods achieve F1 scores above 0.90 using deep learning techniques combined with huge amounts of training data (e.g > 100K pairs of offers). Gathering and maintaining such large training corpora is costly, as it implies labeling pairs of offers as matches or non-matches. Acquiring the ability to be good at product matching thus means a major investment for an e-commerce company. This paper shows that the manual labeling of training data for product matching can be replaced by relying exclusively on schema.org annotations gathered from the public Web. We show that using only schema.or...
Comparison shopping portals integrate product offers from large numbers of e-shops in order to suppo...
Schema matching is the process of establishing correspondences between the attributes of database sc...
the date of receipt and acceptance should be inserted later Abstract Web-scale data integration invo...
Product matching is a central task within e-commerce applications such as price comparison portals a...
Product matching is the task of deciding whether two product descriptions refer to the same real-wor...
Deep neural networks are increasingly used for tasks such as entity resolution, sentiment analysis, ...
A current research question in the area of entity resolution (also called link discovery or duplicat...
Schema matching is a critical problem for integrating heterogeneous information sources. Traditional...
Transformer-based models like BERT have pushed the state-of the-art for a wide range of tasks in nat...
Transformer-based entity matching methods have significantly moved the state of the art for less-str...
Contrastive learning has moved the state of the art for many tasks in computer vision and informatio...
Schema/ontology matching consists in finding matches between types, properties and entities in heter...
Schema matching has been a researched topic for over 20 years. Therefore, many schema matching solut...
Discovering correspondences between schema elements is a crucial task for data integration. Most mat...
An e-commerce catalog typically comprises of specifications for millions of products. The search eng...
Comparison shopping portals integrate product offers from large numbers of e-shops in order to suppo...
Schema matching is the process of establishing correspondences between the attributes of database sc...
the date of receipt and acceptance should be inserted later Abstract Web-scale data integration invo...
Product matching is a central task within e-commerce applications such as price comparison portals a...
Product matching is the task of deciding whether two product descriptions refer to the same real-wor...
Deep neural networks are increasingly used for tasks such as entity resolution, sentiment analysis, ...
A current research question in the area of entity resolution (also called link discovery or duplicat...
Schema matching is a critical problem for integrating heterogeneous information sources. Traditional...
Transformer-based models like BERT have pushed the state-of the-art for a wide range of tasks in nat...
Transformer-based entity matching methods have significantly moved the state of the art for less-str...
Contrastive learning has moved the state of the art for many tasks in computer vision and informatio...
Schema/ontology matching consists in finding matches between types, properties and entities in heter...
Schema matching has been a researched topic for over 20 years. Therefore, many schema matching solut...
Discovering correspondences between schema elements is a crucial task for data integration. Most mat...
An e-commerce catalog typically comprises of specifications for millions of products. The search eng...
Comparison shopping portals integrate product offers from large numbers of e-shops in order to suppo...
Schema matching is the process of establishing correspondences between the attributes of database sc...
the date of receipt and acceptance should be inserted later Abstract Web-scale data integration invo...