Due to the lack of explicit word boundaries in Chinese, and Japanese, and to some extent in Korean, an additional problem in IR in these languages is to determine the appropriate indexing units. For CLIR with these languages, we also need to determine translation units. Both words and n-grams of characters have been used in IR in these languages; however, only words have been used as translation units in previous studies. In this paper, we compare the utilization of words and n-grams for both monolingual and cross-lingual IR in these languages. Our experiments show that Chinese character n-grams are reasonable alternative indexing and translation units to words, and they lead to retrieval effectiveness comparable to or higher than words. Fo...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
Bilingual term lists are extensively used as a resource for dictionary-based Cross-Language Informat...
Based on the NTCIR-4 test-collection, our first objective is to present an overview of the retrieval...
This paper reports on the work of Berkeley group at the second NTCIR workshop on Japanese & Engl...
Thomson Legal and Regulatory participated in the CLIR task of the NTCIR-3 workshop. We submitted for...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
Abstract In cross-language information retrieval (CLIR), novel or non-standard expressions, technica...
This paper describes our third participation in an evaluation campaign involving the Chinese, Japane...
This paper describes our second participation in an evaluation campaign involving the Chinese, Japa-...
In NTCIR-5 our focus is to see if web-assisted query expansion is useful, and to test an English-Kor...
Cross-language information retrieval (CLIR), where queries and documents are in different lan-guages...
This paper describes our retrieval system for the NTCIR-3 CLIR task, focusing on Japanese and En-gli...
We participated in the monolingual Chinese, English-Chinese cross language and multilingual retrieva...
This paper presents results for the Japanese/English cross-language informaiton retrieval task on te...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
Bilingual term lists are extensively used as a resource for dictionary-based Cross-Language Informat...
Based on the NTCIR-4 test-collection, our first objective is to present an overview of the retrieval...
This paper reports on the work of Berkeley group at the second NTCIR workshop on Japanese & Engl...
Thomson Legal and Regulatory participated in the CLIR task of the NTCIR-3 workshop. We submitted for...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
Abstract In cross-language information retrieval (CLIR), novel or non-standard expressions, technica...
This paper describes our third participation in an evaluation campaign involving the Chinese, Japane...
This paper describes our second participation in an evaluation campaign involving the Chinese, Japa-...
In NTCIR-5 our focus is to see if web-assisted query expansion is useful, and to test an English-Kor...
Cross-language information retrieval (CLIR), where queries and documents are in different lan-guages...
This paper describes our retrieval system for the NTCIR-3 CLIR task, focusing on Japanese and En-gli...
We participated in the monolingual Chinese, English-Chinese cross language and multilingual retrieva...
This paper presents results for the Japanese/English cross-language informaiton retrieval task on te...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
Bilingual term lists are extensively used as a resource for dictionary-based Cross-Language Informat...
Based on the NTCIR-4 test-collection, our first objective is to present an overview of the retrieval...