Enriching Slovene wordnet with domain-specific terms

Špela Vintar
Darja Fišer

Open link

Publication date

February 2017

DOI

10.5281/zenodo.283489

Publisher

Language Science Press

Abstract

The paper describes an innovative approach to expanding the domain coverage of the Slovene wordnet (sloWNet) by exploiting multiple resources. In the experiment described here we are using a large monolingual Slovene corpus of texts from the domain of informatics to harvest terminology from, and a parallel English-Slovene corpus and an online dictionary as bilingual resources to facilitate the mapping of terms to sloWNet. We first identify the core terms of the domain in English using the Princeton University's WordNet 2.1, and then we translate them into Slovene using a bilingual lexicon produced from the parallel corpus. In the next step we extract multi-word terms from the Slovene domain-specific corpus using a hybrid approach, and final...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Enriching Slovene wordnet with domain-specific terms

Abstract

Extracted data

Enriching Slovene wordnet with domain-specific terms

Abstract

Extracted data

Related items

Related items