Application of Topic Models in text mining of educational data and more specifically, the text data obtained from lecture videos, is an area of research which is largely unexplored yet holds great potential. This work seeks to find empirical evidence for an improvement in Topic Modeling by pre- extracting bigram tokens and adding them as additional features in the Latent Dirichlet Allocation (LDA) algorithm, a widely-recognized topic modeling technique. The dataset considered for analysis is a collection of transcripts of video lectures on Machine Learning scraped from YouTube. Using the cosine similarity distance measure as a metric, the experiment showed a statistically significant improvement in topic model performance against the baseli...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
With the advent and popularity of big data mining and huge text analysis in modern times, automated ...
The main aim of this article is to present the results of different experiments focused on the probl...
Topic modeling has been widely adopted by researchers for a variety of different research problems t...
This study demonstrates a way to generate a Topic model using LDA (Latent Dirichlet Allocation) topi...
This work aims at discovering topics in a text corpus and classifying the most relevant terms for ea...
Probabilistic topic modeling is a powerful tool to uncover hidden thematic structure of documents. T...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Topic modeling has been used widely to extract the structures (topics) in a collection (corpus) of d...
This dataset contains scraped and processed text from roughly 100 years of articles published in the...
In this thesis, we investigate several extensions of the basic Latent Dirichlet Allocation model for...
Text mining has a wide range of applications in education. In this paper, we review Latent Dirichlet...
Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent str...
Abstract: The development and rapid popularization of the internet has led to an exponential growth ...
Topic modeling is often perceived as a relatively new development in information retrieval sciences,...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
With the advent and popularity of big data mining and huge text analysis in modern times, automated ...
The main aim of this article is to present the results of different experiments focused on the probl...
Topic modeling has been widely adopted by researchers for a variety of different research problems t...
This study demonstrates a way to generate a Topic model using LDA (Latent Dirichlet Allocation) topi...
This work aims at discovering topics in a text corpus and classifying the most relevant terms for ea...
Probabilistic topic modeling is a powerful tool to uncover hidden thematic structure of documents. T...
Natural Language Processing is a complex method of data mining the vast trove of documents created a...
Topic modeling has been used widely to extract the structures (topics) in a collection (corpus) of d...
This dataset contains scraped and processed text from roughly 100 years of articles published in the...
In this thesis, we investigate several extensions of the basic Latent Dirichlet Allocation model for...
Text mining has a wide range of applications in education. In this paper, we review Latent Dirichlet...
Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent str...
Abstract: The development and rapid popularization of the internet has led to an exponential growth ...
Topic modeling is often perceived as a relatively new development in information retrieval sciences,...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
With the advent and popularity of big data mining and huge text analysis in modern times, automated ...
The main aim of this article is to present the results of different experiments focused on the probl...