Abstract. This paper proposes an automatic method for extracting information from academic conference Web pages, and organizes these information as on-tologies, then matches these ontologies to the academic linked data. The main contributions include: (1) A page segmentation algorithm is proposed to divide conference Web pages into text blocks. (2) According to vision, key words and other text features, all text blocks are classified as 10 categories using bayes network model. The context information of text blocks are introduced to repair the initial classified results, which are improved to 96 % precision and 98 % re-call. (3) An ontology is generated for each conference website, then all ontolo-gies are matched as an academic linked data
In this paper, we describe a system to collect information about academic affiliation (organisations...
The aim of this paper is to introduce a conference platform that will provide scientists with an eas...
While the explosive increase in information published on the Web, researchers have to filter informa...
Traditional information extraction methods mainly rely on visual feature assisted techniques; but wi...
The world today witnessed an important transmission to the virtual world across the web. After years...
From the importance of the conference and its constructive role in the studies discussion, there mus...
We address the problem of academic conference homepage understanding for the Semantic Web. This prob...
Data extraction from web document is becoming more popular and widely used for many tasks. The obje...
International audienceThis paper deals with the automation of ontology building process from HTML pa...
The Semantic Web Dog Food (SWDF) is the reference linked dataset of the Semantic Web community about...
Web-based, free-text documents on science and technology have been increasing growing on the web. Ho...
We study possibilities to automatically extract information from the Internet, by structuring and co...
BACKGROUND: Web-based, free-text documents on science and technology have been increasing growing on...
none4siThe SemanticWeb Dog Food (SWDF) is the reference linked dataset of the Semantic Web community...
Abstract. The amount of electronically stored textual information is continuously increasing both on...
In this paper, we describe a system to collect information about academic affiliation (organisations...
The aim of this paper is to introduce a conference platform that will provide scientists with an eas...
While the explosive increase in information published on the Web, researchers have to filter informa...
Traditional information extraction methods mainly rely on visual feature assisted techniques; but wi...
The world today witnessed an important transmission to the virtual world across the web. After years...
From the importance of the conference and its constructive role in the studies discussion, there mus...
We address the problem of academic conference homepage understanding for the Semantic Web. This prob...
Data extraction from web document is becoming more popular and widely used for many tasks. The obje...
International audienceThis paper deals with the automation of ontology building process from HTML pa...
The Semantic Web Dog Food (SWDF) is the reference linked dataset of the Semantic Web community about...
Web-based, free-text documents on science and technology have been increasing growing on the web. Ho...
We study possibilities to automatically extract information from the Internet, by structuring and co...
BACKGROUND: Web-based, free-text documents on science and technology have been increasing growing on...
none4siThe SemanticWeb Dog Food (SWDF) is the reference linked dataset of the Semantic Web community...
Abstract. The amount of electronically stored textual information is continuously increasing both on...
In this paper, we describe a system to collect information about academic affiliation (organisations...
The aim of this paper is to introduce a conference platform that will provide scientists with an eas...
While the explosive increase in information published on the Web, researchers have to filter informa...