Publication:
Predicting formation of haloacetic acids by chlorination of organic compounds using machine-learning-assisted quantitative structure-activity relationships

dc.contributor.authorJosé Andrés Corderoen_US
dc.contributor.authorKai Heen_US
dc.contributor.authorKanjira Janyaen_US
dc.contributor.authorShinya Echigoen_US
dc.contributor.authorSadahiko Itohen_US
dc.contributor.otherResearch Center for Environmental Quality Managementen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherKyoto Universityen_US
dc.date.accessioned2020-12-28T05:29:04Z
dc.date.available2020-12-28T05:29:04Z
dc.date.issued2020-01-01en_US
dc.description.abstract© 2020 Elsevier B.V. The presence of disinfection byproducts (DBPs) in drinking water is a major public health concern, and an effective strategy to limit the formation of these DBPs is to prevent their precursors. In silico prediction from chemical structure would allow rapid identification of precursors and could be used as a prescreening tool to prioritize testing. We present models using machine learning algorithms (i.e., support vector regressor, random forest regressor, and multilayer perceptron regressor) and chemical descriptors as features to predict the formation of haloacetic acids (HAAs). A robust model with good predictivity (i.e., leave-one-out cross-validated Q2 > 0.5) to predict the formation of trichloroacetic acid (TCAA) was developed using a random forest regressor. The number of aromatic bonds, hydrophilicity, and electrotopological descriptors related to electrostatic interactions and the atomic distribution of electronegativity were identified as important predictors of TCAA formation potentials (FPs). However, the prediction of dichloroacetic acid was less accurate, which is congruent with the presence of different types of precursors exhibiting distinct mechanisms. This study demonstrates that nonlinear combinations of general chemical descriptors can adequately estimate HAAFPs, and we hope that our study can be used to predict precursors of other disinfection byproducts based on chemical structures using a similar workflow.en_US
dc.identifier.citationJournal of Hazardous Materials. (2020)en_US
dc.identifier.doi10.1016/j.jhazmat.2020.124466en_US
dc.identifier.issn18733336en_US
dc.identifier.issn03043894en_US
dc.identifier.other2-s2.0-85096010084en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/60481
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85096010084&origin=inwarden_US
dc.subjectEnvironmental Scienceen_US
dc.titlePredicting formation of haloacetic acids by chlorination of organic compounds using machine-learning-assisted quantitative structure-activity relationshipsen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85096010084&origin=inwarden_US

Files

Collections