Searching text or documents in large unstructured and semi-structured data sources is not trivial. A search engine is supposed to make more search efficient and effective. It supports to build a query that can be applied automatically to extract the information that complies with the user’s intention. Controlled vocabularies and ontologies help improving the search and make it domain-aware. In this document, we explain the notion of a controlled vocabulary, its construction methods and its use in smart search engines. Manual construction of controlled vocabularies and ontologies can be achieved using several existing tools,which require specific technical skills. Therefore, we refer to the ROC+ tool, developed within WFBR, which helps domai...
We describe a novel approach to precise searching in the full content of digital libraries. The Sear...
In this paper we present how resources and tools developed within the Human Language Technology Grou...
The unavailability of very large corpora with semantically disambiguated words is a major limitation...
Searching text or documents in large unstructured and semi-structured data sources is not trivial. A...
The question whether controlled vocabulary or natural language free-text terms is the most effectiv...
Search providers in domains from medicine to news have long labelled documents with controlled vocab...
Semantic processing system (SPS) is a system that performs phrase search of web content. SPS takes a...
Keyword searching and controlled vocabularies such as Library of Congress subject headings (LCSH) pr...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
Purpose: Controlled vocabularies play an important role in information retrieval. Numerous studies h...
This paper presents a natural language interface system to an Internet search engine that provides t...
Studies on ontologies are receiving a growing attention due to their well-known nature of explicit k...
Controlled vocabularies are one of the most essential components in achieving machine-actionability ...
Traditional search techniques establish a direct connection between the information provided by user...
Domain-specific search engines are becoming increasingly popular because they offer increased accura...
We describe a novel approach to precise searching in the full content of digital libraries. The Sear...
In this paper we present how resources and tools developed within the Human Language Technology Grou...
The unavailability of very large corpora with semantically disambiguated words is a major limitation...
Searching text or documents in large unstructured and semi-structured data sources is not trivial. A...
The question whether controlled vocabulary or natural language free-text terms is the most effectiv...
Search providers in domains from medicine to news have long labelled documents with controlled vocab...
Semantic processing system (SPS) is a system that performs phrase search of web content. SPS takes a...
Keyword searching and controlled vocabularies such as Library of Congress subject headings (LCSH) pr...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
Purpose: Controlled vocabularies play an important role in information retrieval. Numerous studies h...
This paper presents a natural language interface system to an Internet search engine that provides t...
Studies on ontologies are receiving a growing attention due to their well-known nature of explicit k...
Controlled vocabularies are one of the most essential components in achieving machine-actionability ...
Traditional search techniques establish a direct connection between the information provided by user...
Domain-specific search engines are becoming increasingly popular because they offer increased accura...
We describe a novel approach to precise searching in the full content of digital libraries. The Sear...
In this paper we present how resources and tools developed within the Human Language Technology Grou...
The unavailability of very large corpora with semantically disambiguated words is a major limitation...