Context: Latent Dirichlet Allocation (LDA) has been successfully used in the literature to extract topics from software documents and support developers in various software engineering tasks. While LDA has been mostly used with default settings, previous studies showed that default hyperparameter values generate sub-optimal topics from software documents. Objective: Recent studies applied meta-heuristic search (mostly evolutionary algorithms) to configure LDA in an unsupervised and automated fashion. However, previous work advocated for different meta-heuristics and surrogate metrics to optimize. The objective of this paper is to shed light on the influence of these two factors when tuning LDA for SE tasks. Method: We empirically evaluated ...
One of the difficulties in maintaining a large software system is the absence of documented business...
Latent Dirichlet Allocation (LDA) is a probability model for grouping hidden topics in documents by ...
With the vast amount of information available on the Internet today, helping users find relevant con...
Latent Dirichlet Allocation (LDA) has been used to support many software engineering tasks. Previous...
Abstract Background: Unstructured and textual data is increasing rapidly and Latent Dirichlet Alloca...
Latent Dirichlet Allocation is a generative technique, the application of which has recently gained ...
A number of approaches in traceability link recovery and other software engineering tasks incorporat...
Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent str...
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requ...
SNPD 2014 : 15th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence...
In today's digital world, customers give their opinions on a product that they have purchased online...
In this paper, I apply latent dirichlet allocation(LDA) to cluster 100,000 health related articles u...
ToPIC (Tuning of Parameters for Inference of Concepts) is a distributed self-tuning engine whose aim...
Many approaches have been introduced to enable Latent Dirichlet Allocation (LDA) models to be update...
Hyper-parameter optimization methods allow efficient and robust hyperparameter search-ing without th...
One of the difficulties in maintaining a large software system is the absence of documented business...
Latent Dirichlet Allocation (LDA) is a probability model for grouping hidden topics in documents by ...
With the vast amount of information available on the Internet today, helping users find relevant con...
Latent Dirichlet Allocation (LDA) has been used to support many software engineering tasks. Previous...
Abstract Background: Unstructured and textual data is increasing rapidly and Latent Dirichlet Alloca...
Latent Dirichlet Allocation is a generative technique, the application of which has recently gained ...
A number of approaches in traceability link recovery and other software engineering tasks incorporat...
Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent str...
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requ...
SNPD 2014 : 15th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence...
In today's digital world, customers give their opinions on a product that they have purchased online...
In this paper, I apply latent dirichlet allocation(LDA) to cluster 100,000 health related articles u...
ToPIC (Tuning of Parameters for Inference of Concepts) is a distributed self-tuning engine whose aim...
Many approaches have been introduced to enable Latent Dirichlet Allocation (LDA) models to be update...
Hyper-parameter optimization methods allow efficient and robust hyperparameter search-ing without th...
One of the difficulties in maintaining a large software system is the absence of documented business...
Latent Dirichlet Allocation (LDA) is a probability model for grouping hidden topics in documents by ...
With the vast amount of information available on the Internet today, helping users find relevant con...