Using topic models to mine domain topics from source code has been a promising way for developers to comprehend the functional concerns implemented in the source code of a software system. However, not all the topics mined from source code are domain topics that represent functional concerns of the software. Besides domain topics, other topics may represent cross-cutting concerns or other concerns. These topics are noises in the context of helping developers to comprehend the functional concerns. In this paper, we propose an approach to filter out noises and mine Cohesive Domain Topics (CDTs) from source code. A topic is a CDT if its associated words represent certain functional concern and its associated source code elements collaborativel...
In this paper we investigate the potential benefits of Latent Dirichlet Allocation (LDA) as a techni...
As the information load grows, it becomes increasingly difficult to follow-up new trends in business...
Address email Large repositories of source code create new challenges and opportunities for sta-tist...
One of the difficulties in maintaining a large software system is the absence of documented business...
LNCS, volume 10086Developers nowadays can leverage existing systems to build their own applications....
Software repositories, such as source code, email archives, and bug databases, contain unstructured ...
Abstract Programming question and answer (Q&A) websites, such as Stack Overflow, leverage the kn...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
Information Retrieval (IR) methods, and in particular topic models, have recently been used to suppo...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Abstract — Software maintenance tasks require familiarity with the entire software system to make pr...
Community website for programming related Q&A (Question and Answer), Stack Overflow serves as a ...
Abstract—Exploring linguistic topics in source code is a pro-gram comprehension activity that shows ...
Abstract—Locating buggy code is a time-consuming task in software development. Given a new bug repor...
In this paper we investigate the potential benefits of Latent Dirichlet Allocation (LDA) as a techni...
As the information load grows, it becomes increasingly difficult to follow-up new trends in business...
Address email Large repositories of source code create new challenges and opportunities for sta-tist...
One of the difficulties in maintaining a large software system is the absence of documented business...
LNCS, volume 10086Developers nowadays can leverage existing systems to build their own applications....
Software repositories, such as source code, email archives, and bug databases, contain unstructured ...
Abstract Programming question and answer (Q&A) websites, such as Stack Overflow, leverage the kn...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
Information Retrieval (IR) methods, and in particular topic models, have recently been used to suppo...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Abstract — Software maintenance tasks require familiarity with the entire software system to make pr...
Community website for programming related Q&A (Question and Answer), Stack Overflow serves as a ...
Abstract—Exploring linguistic topics in source code is a pro-gram comprehension activity that shows ...
Abstract—Locating buggy code is a time-consuming task in software development. Given a new bug repor...
In this paper we investigate the potential benefits of Latent Dirichlet Allocation (LDA) as a techni...
As the information load grows, it becomes increasingly difficult to follow-up new trends in business...
Address email Large repositories of source code create new challenges and opportunities for sta-tist...