Publication:
Optimizing exoplanet atmosphere retrieval using unsupervised machine-learning classification

dc.contributor.authorJ. J.C. Hayesen_US
dc.contributor.authorE. Kerinsen_US
dc.contributor.authorS. Awiphanen_US
dc.contributor.authorI. McDonalden_US
dc.contributor.authorJ. S. Morganen_US
dc.contributor.authorP. Chuanraksasaten_US
dc.contributor.authorS. Komonjindaen_US
dc.contributor.authorN. Sanguansaken_US
dc.contributor.authorP. Kittaraen_US
dc.contributor.otherSuranaree University of Technologyen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherThe University of Manchesteren_US
dc.contributor.otherChiang Mai Universityen_US
dc.contributor.otherNational Astronomical Research Institute of Thailanden_US
dc.date.accessioned2022-08-04T08:32:38Z
dc.date.available2022-08-04T08:32:38Z
dc.date.issued2021-01-01en_US
dc.description.abstractOne of the principal bottlenecks to atmosphere characterization in the era of all-sky surveys is the availability of fast, autonomous, and robust atmospheric retrieval methods. We present a new approach using unsupervised machine learning to generate informed priors for retrieval of exoplanetary atmosphere parameters from transmission spectra. We use principal component analysis (PCA) to efficiently compress the information content of a library of transmission spectra forward models generated using the PLATON package. We then apply a k-means clustering algorithm in PCA space to segregate the library into discrete classes. We show that our classifier is almost always able to instantaneously place a previously unseen spectrum into the correct class, for low-to-moderate spectral resolutions, R, in the range R = 30−300 and noise levels up to 10 per cent of the peak-to-trough spectrum amplitude. The distribution of physical parameters for all members of the class therefore provides an informed prior for standard retrieval methods such as nested sampling. We benchmark our informed-prior approach against a standard uniform-prior nested sampler, finding that our approach is up to a factor of 2 faster, with negligible reduction in accuracy. We demonstrate the application of this method to existing and near-future observatories, and show that it is suitable for real-world application. Our general approach is not specific to transmission spectroscopy and should be more widely applicable to cases that involve the repetitive fitting of trusted high-dimensional models to large data catalogues, including beyond exoplanetary science.en_US
dc.identifier.citationMonthly Notices of the Royal Astronomical Society. Vol.494, No.3 (2021), 4492-4508en_US
dc.identifier.doi10.1093/mnras/staa978en_US
dc.identifier.issn13652966en_US
dc.identifier.issn00358711en_US
dc.identifier.other2-s2.0-85088572475en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/76871
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85088572475&origin=inwarden_US
dc.subjectEarth and Planetary Sciencesen_US
dc.subjectPhysics and Astronomyen_US
dc.titleOptimizing exoplanet atmosphere retrieval using unsupervised machine-learning classificationen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85088572475&origin=inwarden_US

Files

Collections