Adaptive Lasso sparse logistic regression on high-dimensional data with multicollinearity

dc.contributor.authorSudjai N.
dc.contributor.authorDuangsaphon M.
dc.contributor.authorChandhanayingyong C.
dc.contributor.correspondenceSudjai N.
dc.contributor.otherMahidol University
dc.date.accessioned2025-11-16T18:15:18Z
dc.date.available2025-11-16T18:15:18Z
dc.date.issued2025-02-25
dc.description.abstractA combination of high-dimensional sparse data and multicollinearity problems can lead to instabilities in a predictive model when applied to a new data set. The least absolute shrinkage and selection operator (Lasso) is widely employed in machine-learning algorithm for variable selection and parameter estimations. Although this method is computationally feasible for high-dimensional data, it has some drawbacks. Thus, the adaptive Lasso was developed using the adaptive weight on penalty function. This adaptive weight is related to the power order of the estimators. Hence, we focus on the power of adaptive weight on two penalty functions: adaptive Lasso and adaptive elastic net. This study aimed to compare the performances of the power of the adaptive Lasso and adaptive elastic net methods under high-dimensional sparse data with multicollinearity. Moreover, the performances of four penalized methods were compared: Lasso, elastic net, adaptive Lasso, and adaptive elastic net. They were compared using the mean of the predicted mean squared error for the simulation study and the classification accuracy for a real-data application. The results showed that the higher-order of the adaptive Lasso method performed best on very high-dimensional sparse data with multicollinearity when the initial weight was determined using a ridge estimator. However, in the case of high-dimensional sparse data with multicollinearity, the square root of the adaptive Lasso together with the initial weight using Lasso was the best option.
dc.identifier.citationScience Engineering and Health Studies Vol.19 (2025)
dc.identifier.doi10.69598/sehs.19.25020002
dc.identifier.eissn26300087
dc.identifier.scopus2-s2.0-105020747709
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/113025
dc.rights.holderSCOPUS
dc.subjectMultidisciplinary
dc.titleAdaptive Lasso sparse logistic regression on high-dimensional data with multicollinearity
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105020747709&origin=inward
oaire.citation.titleScience Engineering and Health Studies
oaire.citation.volume19
oairecerif.author.affiliationThammasat University
oairecerif.author.affiliationSiriraj Hospital

Files

Collections