WikiSearch is an information retrieval system (based on the vector space model) that can be used for searching Wikipedia, one of the largest knowledge bases in the world. Clustering techniques are utilized to group semantically related documents and improve the efficiency of the search system. Clustering allows relevant documents that do not match a query’s explicit form to be retrieved. Cluster labels are automatically generated using document features to provide a faceted browsing service for exploration and discovery. We also propose a storage scheme for creating and managing inverted index and clustering information using a NoSQL database. Finally, performance results are provided for the search system
The need for immediate and accurate access to the current literature at one side and t...
This paper discusses the issues involved in the design of a complete information retrieval system ba...
This thesis presents new methods for classification and thematic grouping of billions of web pages, ...
Users of search systems are often reluctant to explicitly build profiles to indicate their search in...
This article describes an improvement for K-means algorithm and its application in the form of a sys...
This thesis presents a system for web-based information retrieval that supports precise and informat...
People use web search engines to fill a wide variety of navigational, informational and transactiona...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
Successful knowledge management requires efficient tools to manage information in the form of text. ...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Today, various types of vast amount of information have been publishing on the World Wide Web. To di...
Reflecting the rapid growth of science, technology, and culture, it has become common practice to co...
The increasing availability of documents in digital form, together with the corresponding volume of ...
Cette thèse propose un système d'aide à l'indexation et à la recherche de documents pédagogiques fon...
The need for immediate and accurate access to the current literature at one side and t...
This paper discusses the issues involved in the design of a complete information retrieval system ba...
This thesis presents new methods for classification and thematic grouping of billions of web pages, ...
Users of search systems are often reluctant to explicitly build profiles to indicate their search in...
This article describes an improvement for K-means algorithm and its application in the form of a sys...
This thesis presents a system for web-based information retrieval that supports precise and informat...
People use web search engines to fill a wide variety of navigational, informational and transactiona...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
Successful knowledge management requires efficient tools to manage information in the form of text. ...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Today, various types of vast amount of information have been publishing on the World Wide Web. To di...
Reflecting the rapid growth of science, technology, and culture, it has become common practice to co...
The increasing availability of documents in digital form, together with the corresponding volume of ...
Cette thèse propose un système d'aide à l'indexation et à la recherche de documents pédagogiques fon...
The need for immediate and accurate access to the current literature at one side and t...
This paper discusses the issues involved in the design of a complete information retrieval system ba...
This thesis presents new methods for classification and thematic grouping of billions of web pages, ...