Trying to make sense of large sets of data is becoming a task very central to computer science in general. Topic models, capable of uncovering the semantic themes pervading through large collections of documents, have seen a surge in popularity in recent years, with applications in a variety of domains. In this thesis, topic models are applied to source code repositories, specifically for the purpose of concept location - offering an overview of which features are contained within asystem, the relationships between such features, and their locality within the system. Topic models are high level statistical tools; their raw output is given in terms of probability distributions, suited neither for simple interpretation nor deep analysis.Inter...
Topic models remain a black box both for modelers and for end users in many respects. From the model...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Using topic models to mine domain topics from source code has been a promising way for developers to...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
Abstract-Topic modeling has seen a surge in use for software comprehension. Although the models infe...
Software repositories, such as source code, email archives, and bug databases, contain unstructured ...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Software maintenance and the understanding of where in the source code features are implemented are ...
ware development by mining and analyzing software repositories. Since the ma-jority of the software ...
Abstract—Exploring linguistic topics in source code is a pro-gram comprehension activity that shows ...
Text contents are overloaded with the digitization of the data and new contents are transmitted thro...
Topic Modeling for Research Software ABSTRACT Currently, the amount of daily publications in diffe...
Latent Direchlet Allocation (LDA) is a statistical topic modeling approach that has been used to sup...
We present TopicNets, a Web-based system for visual and interactive analysis of large sets of docume...
Statistical topic models provide a general data-driven framework for automated discovery of high-lev...
Topic models remain a black box both for modelers and for end users in many respects. From the model...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Using topic models to mine domain topics from source code has been a promising way for developers to...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
Abstract-Topic modeling has seen a surge in use for software comprehension. Although the models infe...
Software repositories, such as source code, email archives, and bug databases, contain unstructured ...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Software maintenance and the understanding of where in the source code features are implemented are ...
ware development by mining and analyzing software repositories. Since the ma-jority of the software ...
Abstract—Exploring linguistic topics in source code is a pro-gram comprehension activity that shows ...
Text contents are overloaded with the digitization of the data and new contents are transmitted thro...
Topic Modeling for Research Software ABSTRACT Currently, the amount of daily publications in diffe...
Latent Direchlet Allocation (LDA) is a statistical topic modeling approach that has been used to sup...
We present TopicNets, a Web-based system for visual and interactive analysis of large sets of docume...
Statistical topic models provide a general data-driven framework for automated discovery of high-lev...
Topic models remain a black box both for modelers and for end users in many respects. From the model...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Using topic models to mine domain topics from source code has been a promising way for developers to...