Studying the evolution of topics (collections of co-occurring words) in a software project is an emerging technique to automatically shed light on how the project is changing over time: which topics are becoming more actively devel-oped, which ones are dying down, or which topics are lately more error-prone and hence require more testing. Existing techniques for modeling the evolution of topics in software projects suffer from issues of data duplication, i.e., when the repository contains multiple copies of the same document, as is the case in source code histories. To address this is-sue, we propose the Diff model, which applies a topic model only to the changes of the documents in each version instead of to the whole document at each vers...
AbstractIn order to study software evolution, it is necessary to measure artefacts representative of...
Software systems have to evolve over their life-cycle or they become progressively less useful. The ...
Using topic models to mine domain topics from source code has been a promising way for developers to...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Abstract—As the development of a software project progresses, its complexity grows accordingly, maki...
A large document collection that builds up over time usually contains a number of different ...
A large document collection that builds up over time usually contains a number of different themes. ...
Software change histories are results of incremental updates made by developers. As a byproduct of t...
Multiple studies found that developer questions about the history of code were among the hardest and...
Over the past three decades, more and more research has been spent on understanding software evoluti...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
Version control and bug tracking systems contain large amounts of historical information that can gi...
International audienceVersion Control Systems are key elements of modern software development. They ...
AbstractIn order to study software evolution, it is necessary to measure artefacts representative of...
Software systems have to evolve over their life-cycle or they become progressively less useful. The ...
Using topic models to mine domain topics from source code has been a promising way for developers to...
Studying the evolution of topics (collections of co-occurring words) in a software project is an eme...
Topics are collections of words that co-occur fre-quently in a text corpus. Topics have been found t...
Abstract—As the development of a software project progresses, its complexity grows accordingly, maki...
A large document collection that builds up over time usually contains a number of different ...
A large document collection that builds up over time usually contains a number of different themes. ...
Software change histories are results of incremental updates made by developers. As a byproduct of t...
Multiple studies found that developer questions about the history of code were among the hardest and...
Over the past three decades, more and more research has been spent on understanding software evoluti...
Trying to make sense of large sets of data is becoming a task very central to computer science in ge...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
Version control and bug tracking systems contain large amounts of historical information that can gi...
International audienceVersion Control Systems are key elements of modern software development. They ...
AbstractIn order to study software evolution, it is necessary to measure artefacts representative of...
Software systems have to evolve over their life-cycle or they become progressively less useful. The ...
Using topic models to mine domain topics from source code has been a promising way for developers to...