This thesis follows up text categorization. In the first part are described several chosen algorithms for a categorization of documents - the Bayesian model, a categorization with a neural networks and a vector model. Practice part is focused on a algorithm vector model. The vector model is based on idea of two vectors. One vector represents a pattern and second a query. In our case first vector corresponds with a category and the second one with the document. Coordinates of the vector are weights of single words in the text or in the branch depends on, which vector we think about. For comparing are possible to use several procedures like Dice coefficient similarity, Jaccard coefficient or cosine similarity. In my thesis is used cosine simi...
Text categorization is the process of sorting text documents into one or more predefined categories ...
Text data mining is the process of extracting and analyzing valuable information from text. A text d...
Natural language processing is an interdisciplinary field of research which studies the problems and...
Práce se zabývá porovnáváním textu a jeho kategorizaci. Kategorie, které je program schopen určit, z...
Text categorization (also known as text classification) is the task of automatically assigning docum...
Text categorization is the task in which text documents are classified into one or more of predefine...
With the development of online data, text categorization has become one of the key procedures for ta...
Supervised text categorization is a machine learning task where a predefined category label is autom...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...
Natural language processing is an interdisciplinary field of research which studies the problems and...
This thesis presents the application of various classification techniques on text documents. Since t...
Because of the explosion of digital and online text information, automatic organization of documents...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
This paper focuses on a comparative evaluation of a wide-range of text categorization methods, inclu...
Text categorization is the process of sorting text documents into one or more predefined categories ...
Text data mining is the process of extracting and analyzing valuable information from text. A text d...
Natural language processing is an interdisciplinary field of research which studies the problems and...
Práce se zabývá porovnáváním textu a jeho kategorizaci. Kategorie, které je program schopen určit, z...
Text categorization (also known as text classification) is the task of automatically assigning docum...
Text categorization is the task in which text documents are classified into one or more of predefine...
With the development of online data, text categorization has become one of the key procedures for ta...
Supervised text categorization is a machine learning task where a predefined category label is autom...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...
Natural language processing is an interdisciplinary field of research which studies the problems and...
This thesis presents the application of various classification techniques on text documents. Since t...
Because of the explosion of digital and online text information, automatic organization of documents...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
This paper focuses on a comparative evaluation of a wide-range of text categorization methods, inclu...
Text categorization is the process of sorting text documents into one or more predefined categories ...
Text data mining is the process of extracting and analyzing valuable information from text. A text d...
Natural language processing is an interdisciplinary field of research which studies the problems and...