Changeset-Based Topic Modeling of Software Repositories


Changeset-Based Topic Modeling of Software Repositories is a scholarly work, published in 2020 in ''IEEE Transactions on Software Engineering''. The main subjects of the publication include information retrieval, topic modeling, code, source code, snapshot, data mining, web scraping, retraining, software engineering, obsolescence, glossary of archaeology, scientific workflow system, software, and computer science. The authors expand the authors' work by investigating: a second task (developer identification), the effects of including different changeset parts in the model, the repository characteristics that affect the accuracy of the approach, and the effects of the time invariance assumption on evaluation results.

Related Works