Large pre-trained neural networks are ubiquitous and critical to the success of many downstream tasks in natural language processing and computer vision. However, within the field of web information retrieval, there is a stark contrast in the lack of similarly flexible and powerful pre-trained models that can properly parse webpages. Consequently, we believe that common machine learning tasks like content extraction and information mining from webpages have low-hanging gains that yet remain untapped. We aim to close the gap by introducing an agnostic deep graph neural network feature extractor that can ingest webpage structures, pre-train self-supervised on massive unlabeled data, and fine-tune to arbitrary tasks on webpages effectually. ...
Recently, neural network based methods have shown their power in learning more expressive features o...
An artificial neural network model, capable of processing general types of graph structured data, ha...
There has been a rising interest in graph neural networks (GNNs) for representation learning over th...
This paper tackles the under-explored problem of DOM tree element representation learning. We advanc...
Deep learning has become the dominant approach in coping with various tasks in Natural LanguageProce...
In this paper, we propose mining the growing amount of information present on the internet in the fo...
© 2020 Yimeng DaiThe number of webpages is growing exponentially, which results in a great volume of...
Graph Neural Networks (GNNs) are a promising deep learning approach for circumventing many real-worl...
In recent years, the usage of the Internet has increased tremendously, and the total number of web p...
International audienceThe keep-growing content of Web images is probably the next important data sou...
Recently, text classification model based on graph neural network (GNN) has attracted more and more ...
This paper utilizes Ant-Miner - the first Ant Colony algorithm for discovering classification rules ...
Fast-developing fields such as Artificial Intelligence (AI) often outpace the efforts of encyclopedi...
Part 3: Ontology-Web and Social Media AI Modeling (OWESOM)International audienceWeb wrappers are sys...
Web mining related exploration is getting the chance to be more essential these days in view of the ...
Recently, neural network based methods have shown their power in learning more expressive features o...
An artificial neural network model, capable of processing general types of graph structured data, ha...
There has been a rising interest in graph neural networks (GNNs) for representation learning over th...
This paper tackles the under-explored problem of DOM tree element representation learning. We advanc...
Deep learning has become the dominant approach in coping with various tasks in Natural LanguageProce...
In this paper, we propose mining the growing amount of information present on the internet in the fo...
© 2020 Yimeng DaiThe number of webpages is growing exponentially, which results in a great volume of...
Graph Neural Networks (GNNs) are a promising deep learning approach for circumventing many real-worl...
In recent years, the usage of the Internet has increased tremendously, and the total number of web p...
International audienceThe keep-growing content of Web images is probably the next important data sou...
Recently, text classification model based on graph neural network (GNN) has attracted more and more ...
This paper utilizes Ant-Miner - the first Ant Colony algorithm for discovering classification rules ...
Fast-developing fields such as Artificial Intelligence (AI) often outpace the efforts of encyclopedi...
Part 3: Ontology-Web and Social Media AI Modeling (OWESOM)International audienceWeb wrappers are sys...
Web mining related exploration is getting the chance to be more essential these days in view of the ...
Recently, neural network based methods have shown their power in learning more expressive features o...
An artificial neural network model, capable of processing general types of graph structured data, ha...
There has been a rising interest in graph neural networks (GNNs) for representation learning over th...