Sprint2Vec: a deep characterization of sprints in iterative software development

dc.contributor.authorChoetkiertikul M.
dc.contributor.authorBanyongrakkul P.
dc.contributor.authorRagkhitwetsagul C.
dc.contributor.authorTuarob S.
dc.contributor.authorDam H.K.
dc.contributor.authorSunetnanta T.
dc.contributor.correspondenceChoetkiertikul M.
dc.contributor.otherMahidol University
dc.date.accessioned2024-12-13T18:13:28Z
dc.date.available2024-12-13T18:13:28Z
dc.date.issued2024-01-01
dc.description.abstractIterative approaches like Agile Scrum are commonly adopted to enhance the software development process. However, challenges such as schedule and budget overruns still persist in many software projects. Several approaches employ machine learning techniques, particularly classification, to facilitate decision-making in iterative software development. Existing approaches often concentrate on characterizing a sprint to predict solely productivity. We introduce Sprint2Vec, which leverages three aspects of sprint information - sprint attributes, issue attributes, and the developers involved in a sprint, to comprehensively characterize it for predicting both productivity and quality outcomes of the sprints. Our approach combines traditional feature extraction techniques with automated deep learning-based unsupervised feature learning techniques. We utilize methods like Long Short-Term Memory (LSTM) to enhance our feature learning process. This enables us to learn features from unstructured data, such as textual descriptions of issues and sequences of developer activities. We conducted an evaluation of our approach on two regression tasks: predicting the deliverability (i.e., the amount of work delivered from a sprint) and quality of a sprint (i.e., the amount of delivered work that requires rework). The evaluation results on five well-known open-source projects (Apache, Atlassian, Jenkins, Spring, and Talendforge) demonstrate our approach's superior performance compared to baseline and alternative approaches.
dc.identifier.citationIEEE Transactions on Software Engineering (2024)
dc.identifier.doi10.1109/TSE.2024.3509016
dc.identifier.eissn19393520
dc.identifier.issn00985589
dc.identifier.scopus2-s2.0-85210995629
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/102347
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleSprint2Vec: a deep characterization of sprints in iterative software development
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85210995629&origin=inward
oaire.citation.titleIEEE Transactions on Software Engineering
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationUniversity of Wollongong

Files

Collections