With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for scholars to locate relevant sets of documents that are useful in their research from the HathiTrust Digital Libary (HTDL) using traditional lexically-based retrieval techniques. Existing document search tools and document clustering approaches use purely lexical analysis, which cannot address the inherent ambiguity of natural language. A semantic search approach offers the potential to overcome the shortcoming of lexical search, but even if an appropriate network of ontologies could be decided upon it would require a full semantic markup of each document. In this paper, we present a conceptual design and report on the initial implementation of a...
Search systems help users locate relevant information in the form of text documents for keyword quer...
The ongoing astounding growth of text data has created an enormous need for fast and efficient Text ...
Processing of unstructured documents according to their content is required in many disciplines; e.g...
With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for sch...
With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for sch...
With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for sch...
Most existing digital libraries use traditional lexically-based retrieval techniques. For establishe...
Most existing digital libraries use traditional lexically-based retrieval techniques. For establishe...
Most existing digital libraries use traditional lexically-based retrieval techniques. For establishe...
The Capisco project developed a suite of tools that analyze documents by the semantics of their cont...
International audienceThe exponential growth of available electronic data is almost useless without ...
International audienceThe exponential growth of available electronic data is almost useless without ...
AbstractIn a World Wide Web continuously growing the need for searching information keeps also growi...
For scientists and researchers, it is very critical to ensure knowledge is accessible for re-use and...
For scientists and researchers, it is very critical to ensure knowledge is accessible for re-use and...
Search systems help users locate relevant information in the form of text documents for keyword quer...
The ongoing astounding growth of text data has created an enormous need for fast and efficient Text ...
Processing of unstructured documents according to their content is required in many disciplines; e.g...
With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for sch...
With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for sch...
With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for sch...
Most existing digital libraries use traditional lexically-based retrieval techniques. For establishe...
Most existing digital libraries use traditional lexically-based retrieval techniques. For establishe...
Most existing digital libraries use traditional lexically-based retrieval techniques. For establishe...
The Capisco project developed a suite of tools that analyze documents by the semantics of their cont...
International audienceThe exponential growth of available electronic data is almost useless without ...
International audienceThe exponential growth of available electronic data is almost useless without ...
AbstractIn a World Wide Web continuously growing the need for searching information keeps also growi...
For scientists and researchers, it is very critical to ensure knowledge is accessible for re-use and...
For scientists and researchers, it is very critical to ensure knowledge is accessible for re-use and...
Search systems help users locate relevant information in the form of text documents for keyword quer...
The ongoing astounding growth of text data has created an enormous need for fast and efficient Text ...
Processing of unstructured documents according to their content is required in many disciplines; e.g...