Why Visualize Data When Coding? Preliminary Categories for Coding in Jupyter Notebooks

dc.contributor.authorSettewong T.
dc.contributor.authorRitta N.
dc.contributor.authorKula R.G.
dc.contributor.authorRagkhitwetsagul C.
dc.contributor.authorSunetnanta T.
dc.contributor.authorMatsumoto K.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T17:02:27Z
dc.date.available2023-06-18T17:02:27Z
dc.date.issued2022-01-01
dc.description.abstractData visualization becomes a crucial component in data analytics, especially data exploration, understanding, and analysis. Effective data visualization impacts decision-making and aids in discovering and understanding relationships. It leads to benefits in data-intensive software development tasks e.g., feature engineering in machine learning-based software projects. However, it is unknown how visualizations are used in competitive programming. The idea of this paper is to report early results on what visualizations are prevalent in competitive programming. Grandmasters are the highest level reached in competitions (novice, expert, master, and grandmaster). Analyzing the visualizations of 7 high-rank competitors (i.e., Grandmaster) in Kaggle, we identify and present a catalog of visualizations used to both tell a story from the data, as well as explain the process and pipelines involved to explain their coding solutions. Our taxonomy includes nine types from over 821 visualizations in 68 instances of Jupyter notebooks. Furthermore, most visualizations are for data analysis for distribution (DA Distribution), and frequency (DA Frequency) are most used. We envision that this catalog can be useful to better understand different situations in which to employ these visualizations.
dc.identifier.citationProceedings - Asia-Pacific Software Engineering Conference, APSEC Vol.2022-December (2022) , 462-466
dc.identifier.doi10.1109/APSEC57359.2022.00063
dc.identifier.issn15301362
dc.identifier.scopus2-s2.0-85149180580
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/84309
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleWhy Visualize Data When Coding? Preliminary Categories for Coding in Jupyter Notebooks
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85149180580&origin=inward
oaire.citation.endPage466
oaire.citation.startPage462
oaire.citation.titleProceedings - Asia-Pacific Software Engineering Conference, APSEC
oaire.citation.volume2022-December
oairecerif.author.affiliationNara Institute of Science and Technology
oairecerif.author.affiliationMahidol University

Files

Collections