We participated in the WebCLEF 2005 monolingual task. In this task, a search system aims to retrieve relevant documents from a multilingual corpus of Web documents from Web sites of European governments. Both the documents and the queries are written in a wide range of European languages. A challenge in this setting is to detect the language of documents and topics, and to process them appropriately. We develop a language specific technique for applying the correct stemming approach, as well as for removing the correct stopwords from the queries. We represent documents using three fields, namely content, title, and anchor text of incoming hyperlinks. We use a technique called per-field normalisation, which extends the Divergence From Random...
Abstract Research into unsupervised ways of stemming has resulted, in the past few years, in the dev...
Now a day’s text documents are advancing over internet, e-mails and web pages. As the use of interne...
We test the utility of European language stemmers created using the Snowball language [1]. This allo...
We participated in the WebCLEF 2005 monolingual task. In this task, a search system aims to retrieve...
This paper describes the participation of the REINA Research Group of the University of Salamanca at...
Per-field normalisation has been shown to be effective for Web search tasks, e.g. named-page finding...
Se describe la participación del Grupo de Investigación REINA de la Universidad de Salamanca en foro...
Traditionally, stemming has been applied to Information Retrieval tasks by transforming words in doc...
This report presents results for the TREC 2009 adhoc task, the diversity task, and the relevance fee...
Experiments with a multi-lingual web collection are presented. The EuroGOV corpus is the first multi...
Abstract. This paper reports on a statistical stemming algorithm based on link analysis. Considering...
Abstract. The paper describes statistical methods and experiments for stemming and for the translati...
The University of Exeter group participated in the monolingual, bilingual and multilingual-4 retriev...
We describe our participation in the TREC 2003 Robust and Web tracks. For the Robust track, we exp...
Hummingbird participated in the WebCLEF mixed monolingual retrieval task of the Cross-Language Evalu...
Abstract Research into unsupervised ways of stemming has resulted, in the past few years, in the dev...
Now a day’s text documents are advancing over internet, e-mails and web pages. As the use of interne...
We test the utility of European language stemmers created using the Snowball language [1]. This allo...
We participated in the WebCLEF 2005 monolingual task. In this task, a search system aims to retrieve...
This paper describes the participation of the REINA Research Group of the University of Salamanca at...
Per-field normalisation has been shown to be effective for Web search tasks, e.g. named-page finding...
Se describe la participación del Grupo de Investigación REINA de la Universidad de Salamanca en foro...
Traditionally, stemming has been applied to Information Retrieval tasks by transforming words in doc...
This report presents results for the TREC 2009 adhoc task, the diversity task, and the relevance fee...
Experiments with a multi-lingual web collection are presented. The EuroGOV corpus is the first multi...
Abstract. This paper reports on a statistical stemming algorithm based on link analysis. Considering...
Abstract. The paper describes statistical methods and experiments for stemming and for the translati...
The University of Exeter group participated in the monolingual, bilingual and multilingual-4 retriev...
We describe our participation in the TREC 2003 Robust and Web tracks. For the Robust track, we exp...
Hummingbird participated in the WebCLEF mixed monolingual retrieval task of the Cross-Language Evalu...
Abstract Research into unsupervised ways of stemming has resulted, in the past few years, in the dev...
Now a day’s text documents are advancing over internet, e-mails and web pages. As the use of interne...
We test the utility of European language stemmers created using the Snowball language [1]. This allo...