Reusing My Own Code: Preliminary Results for Competitive Coding in Jupyter Notebooks

dc.contributor.authorRitta N.
dc.contributor.authorSettewong T.
dc.contributor.authorKula R.G.
dc.contributor.authorRagkhitwetsagul C.
dc.contributor.authorSunetnanta T.
dc.contributor.authorMatsumoto K.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T17:02:28Z
dc.date.available2023-06-18T17:02:28Z
dc.date.issued2022-01-01
dc.description.abstractThe reuse of already existing code is widely considered a popular software development practice, that provides both benefits and drawbacks for all stakeholders involved. Prior work reports on how code reuse is a common practice in software development projects and data science projects such as machine learning pipelines. Recently, there has been much code reuse work in the context of competitive programming. Although there is work such as detecting plagiarism, there is no work that studies how a competitor will reuse their own code. In this paper, we present a preliminary study on the code reuse behavior of three grandmasters' Jupyter notebooks in the Kaggle Competitions, an online competition platform for data scientists, and report the types of code they often reuse. Grandmasters are the highest level reached in competitions (novice, expert, master, and grandmaster). We find that Grandmasters are less likely to reuse specialized code, but instead, tend to reuse common functions like importing packages (importing the pandas library). They are most likely to reuse common abstractions like importing packages, configurations, file IO operations, show data, plotting graphs, defining functions, and exploring files. The work opens up new research potential into recommending how developers can reuse their own code.
dc.identifier.citationProceedings - Asia-Pacific Software Engineering Conference, APSEC Vol.2022-December (2022) , 457-461
dc.identifier.doi10.1109/APSEC57359.2022.00062
dc.identifier.issn15301362
dc.identifier.scopus2-s2.0-85149168830
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/84311
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleReusing My Own Code: Preliminary Results for Competitive Coding in Jupyter Notebooks
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85149168830&origin=inward
oaire.citation.endPage461
oaire.citation.startPage457
oaire.citation.titleProceedings - Asia-Pacific Software Engineering Conference, APSEC
oaire.citation.volume2022-December
oairecerif.author.affiliationNara Institute of Science and Technology
oairecerif.author.affiliationMahidol University

Files

Collections