The aim of this thesis is training named entity recognition model on a dataset created using structured data. Datasets were created from the names of products and books extracted from structured data in JSON-LD and Microdata format. Structured data were extracted from e-shop and social cataloging websites by web scraping. Names were used as a dataset by themselves as well as webpage text with automatically annotated matches of the names. In total eight models in Czech language were trained for recognizing names of products and books using spaCy library. F-score results are up to 89.94 for products and up to 84.26 for books evaluated on a created testing dataset
Nestrukturirani dokumenti zajemajo informacije v oblikah in postavitvah, ki se lahko od enega primer...
The presented thesis deals with the task of automatic information extraction from HTML documents for...
This bachelor thesis is dedicated to mechanical techniques that are used in the natural language pro...
Named entity recognition from natural language texts is getting more important every day, because it...
Named entity recognition from natural language texts is getting more important every day, because it...
The target of this thesis is to analyze articles on the Wikipedia internet encyclopedia and to conve...
This master thesis deals with the extraction of relationships between named entities in the text. In...
This paper deals with recognition of named entities in Czech texts. We present a recently released c...
Named entities are collocations used to refer to real world objects in text. Named entity normalizat...
Effective Web content filtering is a necessity in educational and workplace environments, but curren...
This dissertation addresses the problem of classification of entities in text represented by noun ph...
The goal of this master thesis is to design and implement a named entity recognition and linking alg...
Title: Neural Network Based Named Entity Recognition Author: Jana Straková Institute: Institute of F...
Title: Neural Network Based Named Entity Recognition Author: Jana Straková Institute: Institute of F...
We present a collection of Named Entity Recognition (NER) systems for six Slavic languages: Bulgaria...
Nestrukturirani dokumenti zajemajo informacije v oblikah in postavitvah, ki se lahko od enega primer...
The presented thesis deals with the task of automatic information extraction from HTML documents for...
This bachelor thesis is dedicated to mechanical techniques that are used in the natural language pro...
Named entity recognition from natural language texts is getting more important every day, because it...
Named entity recognition from natural language texts is getting more important every day, because it...
The target of this thesis is to analyze articles on the Wikipedia internet encyclopedia and to conve...
This master thesis deals with the extraction of relationships between named entities in the text. In...
This paper deals with recognition of named entities in Czech texts. We present a recently released c...
Named entities are collocations used to refer to real world objects in text. Named entity normalizat...
Effective Web content filtering is a necessity in educational and workplace environments, but curren...
This dissertation addresses the problem of classification of entities in text represented by noun ph...
The goal of this master thesis is to design and implement a named entity recognition and linking alg...
Title: Neural Network Based Named Entity Recognition Author: Jana Straková Institute: Institute of F...
Title: Neural Network Based Named Entity Recognition Author: Jana Straková Institute: Institute of F...
We present a collection of Named Entity Recognition (NER) systems for six Slavic languages: Bulgaria...
Nestrukturirani dokumenti zajemajo informacije v oblikah in postavitvah, ki se lahko od enega primer...
The presented thesis deals with the task of automatic information extraction from HTML documents for...
This bachelor thesis is dedicated to mechanical techniques that are used in the natural language pro...