The Concordia INdexing and DIscovery system (CINDI) is an information discovery and retrieval system to enable a reader to discover resources from a bibliographic database. It uses a metadata description called semantic header to describe an information resource, whose content includes title, author name, the subject and sub-subject, etc. Automatic Semantic Header Generator (ASHG) is used to generate a draft version of the semantic header from a resource automatically. The existing system can deal with four special document formats: HTML, TEXT, LATEX, and RTF. Since more and more people use PDF for document exchange, perusal on line or in print format due to PDF document's easy to use and cross platform portability, more documents are publ...
Machine- understandable data constitutes the foundation for the Semantic Web. This paper presents a ...
Comunicació presentada a la Language Resources and Evaluation Conference (LREC) 2018, celebrada els ...
Indexing large bodies of data is necessary to enable satisfactory search results. Ontologies serve a...
The problem of indexing and retrieval of electronic information resources becomes more critical as t...
As the amount of information and the number of Internet users grow, the problem of indexing and retr...
As the amount of information and the number of Internet users grow, the problem of indexing and ret...
The Concordia INdexing and DIscovery system (CINDI) is an indexing system. It enables a user to inde...
Accurate representation of electronic information on the Internet underlies a solid foundation for p...
Title: A Tool for Transformation of PDF to Text Author: Jonáš Bujok Department: Institute of Formal ...
We introduce PDFMEF, a multi-entity knowledge extrac-tion framework for scholarly documents in the P...
The semantic web is a vision of the Internets future, there machines and humans can understand the s...
This thesis explores the domain of document analysis and document classification within the PDF docu...
In digital libraries, a table, as a specific document component as well as a condensed way to presen...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
Machine- understandable data constitutes the foundation for the Semantic Web. This paper presents a ...
Comunicació presentada a la Language Resources and Evaluation Conference (LREC) 2018, celebrada els ...
Indexing large bodies of data is necessary to enable satisfactory search results. Ontologies serve a...
The problem of indexing and retrieval of electronic information resources becomes more critical as t...
As the amount of information and the number of Internet users grow, the problem of indexing and retr...
As the amount of information and the number of Internet users grow, the problem of indexing and ret...
The Concordia INdexing and DIscovery system (CINDI) is an indexing system. It enables a user to inde...
Accurate representation of electronic information on the Internet underlies a solid foundation for p...
Title: A Tool for Transformation of PDF to Text Author: Jonáš Bujok Department: Institute of Formal ...
We introduce PDFMEF, a multi-entity knowledge extrac-tion framework for scholarly documents in the P...
The semantic web is a vision of the Internets future, there machines and humans can understand the s...
This thesis explores the domain of document analysis and document classification within the PDF docu...
In digital libraries, a table, as a specific document component as well as a condensed way to presen...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
This paper evaluates the performance of tools for the extraction of metadata from scientific article...
Machine- understandable data constitutes the foundation for the Semantic Web. This paper presents a ...
Comunicació presentada a la Language Resources and Evaluation Conference (LREC) 2018, celebrada els ...
Indexing large bodies of data is necessary to enable satisfactory search results. Ontologies serve a...