The purpose of this thesis is to examine how topic modeling can be used as a tool to explore large sets of text data. This thesis is written on assignment from Nofima Food Research Institute. A set of about 52 000 unknown texts of various lengths were downloaded using an external web-harvesting company (Webhose.io). The texts are collected with a specific search query consisting of food related vegetarian and vegan based keywords as this is a field of interest with Nofima. Latent Dirichlet Allocation, known as LDA, is used to create and model these topics. LDA is a method that allows unobserved groups of similar data to be explained by a group of words known as a topic. The collected texts are split into smaller subsections based on the ty...
With the rapid proliferation of social networking sites (SNS), automatic topic extraction from vario...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
This thesis focuses on finding an end-to-end unsupervised solution to solve a two-step problem of ex...
The purpose of this thesis is to examine how topic modeling can be used as a tool to explore large s...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
This paper is in the field of natural language processing. It applied unsupervised machine learning ...
With the increasing prevalence of unstructured online data generated (e.g., social media, online for...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Topic models like latent Dirichlet allocation (LDA) provide a framework for analyzing large datasets...
Meat consumption has caused several problems in terms of overusing freshwater, underground water con...
This work aims at discovering topics in a text corpus and classifying the most relevant terms for ea...
Increased meat consumption has been associated with the overuse of fresh water, underground water co...
With the rapid proliferation of social networking sites (SNS), automatic topic extraction from vario...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
This thesis focuses on finding an end-to-end unsupervised solution to solve a two-step problem of ex...
The purpose of this thesis is to examine how topic modeling can be used as a tool to explore large s...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
This paper is in the field of natural language processing. It applied unsupervised machine learning ...
With the increasing prevalence of unstructured online data generated (e.g., social media, online for...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Topic models like latent Dirichlet allocation (LDA) provide a framework for analyzing large datasets...
Meat consumption has caused several problems in terms of overusing freshwater, underground water con...
This work aims at discovering topics in a text corpus and classifying the most relevant terms for ea...
Increased meat consumption has been associated with the overuse of fresh water, underground water co...
With the rapid proliferation of social networking sites (SNS), automatic topic extraction from vario...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
This thesis focuses on finding an end-to-end unsupervised solution to solve a two-step problem of ex...