Abstract—Software component repositories have adopted semi-structured data models for representing syntactic and semantic features of handled assets. Such models imply key challenges to search engines, which are related to the design of indexing techniques that ought to be efficient in terms of storage space requirements. In such a context, by applying clustering techniques before indexing component repositories, this paper proposes an approach for reducing the number of assets in the repository, and consequently, the size of index files. Based on an illustrative repository, outcomes indicate a significant optimization in the number of assets to be indexed, and, as a consequence, produces significant gains in storage requirements. Besides, ...
We review the time and storage costs of search and clustering algorithms. We exemplify these, based ...
In the field of Software Maintenance the definition of effective approaches to partition a software...
Keyword search has failed to adequately meet the needs of enterprise users. This is largely due to t...
AbstractA Software Repository is a collection of library files and function codes. Programmers and E...
Index partitioning techniques - where indexes are broken into multiple distinct sub-indexes - are a ...
<p>Data repositories and data indexes help to find the relevant datasets among those open and availa...
Better system resource utilization for search engine clusters can result in significant benefits. By...
Software repositories contain a wealth of information about the aspects related to software developm...
Careful architectural decisions are required in order to create a highly available and scalable sear...
he metric space model abstracts many proximity search problems, from nearest-neighbor classifiers to...
Managing digital information is an integral part of our society. Efficient access to data is support...
Abstract: Problem statement: To improve the performance of data retrieval in a homogeneous large XML...
A classified, or clustered file is one where related, or similar records are grouped into classes, ...
In this paper, we introduce a new collection selection strategy to be operated in search engines wit...
We review the time and storage costs of search and clustering algorithms. We exemplify these, based ...
In the field of Software Maintenance the definition of effective approaches to partition a software...
Keyword search has failed to adequately meet the needs of enterprise users. This is largely due to t...
AbstractA Software Repository is a collection of library files and function codes. Programmers and E...
Index partitioning techniques - where indexes are broken into multiple distinct sub-indexes - are a ...
<p>Data repositories and data indexes help to find the relevant datasets among those open and availa...
Better system resource utilization for search engine clusters can result in significant benefits. By...
Software repositories contain a wealth of information about the aspects related to software developm...
Careful architectural decisions are required in order to create a highly available and scalable sear...
he metric space model abstracts many proximity search problems, from nearest-neighbor classifiers to...
Managing digital information is an integral part of our society. Efficient access to data is support...
Abstract: Problem statement: To improve the performance of data retrieval in a homogeneous large XML...
A classified, or clustered file is one where related, or similar records are grouped into classes, ...
In this paper, we introduce a new collection selection strategy to be operated in search engines wit...
We review the time and storage costs of search and clustering algorithms. We exemplify these, based ...
In the field of Software Maintenance the definition of effective approaches to partition a software...
Keyword search has failed to adequately meet the needs of enterprise users. This is largely due to t...