Software repositories, such as source code, email archives, and bug databases, contain unstructured and unlabeled text that is difficult to analyze with traditional techniques. We propose the use of statistical topic models to automatically discover structure in these textual repositories. This dis-covered structure has the potential to be used in software engineering tasks, such as bug prediction and traceability link recovery. Our research goal is to address the challenges of applying topic models to software repositories
LNCS, volume 10086Developers nowadays can leverage existing systems to build their own applications....
MINING SOFTWARE REPOSITORIES, which is the process of analyzing the data re-lated to software develo...
International audienceBackground: Software development results in the production of various types of...
ware development by mining and analyzing software repositories. Since the ma-jority of the software ...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
Reporting bugs is one of the vital activities for evolving software systems. Given such reports, dev...
Abstract Context Mining software repositories has emerged as a research direction over the past deca...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Large repositories of source code create new challenges and opportunities for statistical machine le...
Address email Large repositories of source code create new challenges and opportunities for sta-tist...
Abstract—Locating buggy code is a time-consuming task in software development. Given a new bug repor...
Information Retrieval (IR) methods, and in particular topic models, have recently been used to suppo...
Using topic models to mine domain topics from source code has been a promising way for developers to...
Much of what is written about a software project is soon forgotten. Software repositories are full o...
Abstract — Software maintenance tasks require familiarity with the entire software system to make pr...
LNCS, volume 10086Developers nowadays can leverage existing systems to build their own applications....
MINING SOFTWARE REPOSITORIES, which is the process of analyzing the data re-lated to software develo...
International audienceBackground: Software development results in the production of various types of...
ware development by mining and analyzing software repositories. Since the ma-jority of the software ...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
Reporting bugs is one of the vital activities for evolving software systems. Given such reports, dev...
Abstract Context Mining software repositories has emerged as a research direction over the past deca...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Large repositories of source code create new challenges and opportunities for statistical machine le...
Address email Large repositories of source code create new challenges and opportunities for sta-tist...
Abstract—Locating buggy code is a time-consuming task in software development. Given a new bug repor...
Information Retrieval (IR) methods, and in particular topic models, have recently been used to suppo...
Using topic models to mine domain topics from source code has been a promising way for developers to...
Much of what is written about a software project is soon forgotten. Software repositories are full o...
Abstract — Software maintenance tasks require familiarity with the entire software system to make pr...
LNCS, volume 10086Developers nowadays can leverage existing systems to build their own applications....
MINING SOFTWARE REPOSITORIES, which is the process of analyzing the data re-lated to software develo...
International audienceBackground: Software development results in the production of various types of...