TY - JOUR
T1 - Cycle-starnet
T2 - Bridging the gap between theory and data by leveraging large data sets
AU - O'Briain, Teaghan
AU - Ting, Yuan Sen
AU - Fabbro, Sébastien
AU - Yi, Kwang M.
AU - Venn, Kim
AU - Bialek, Spencer
N1 - Publisher Copyright:
© 2021. The American Astronomical Society.
PY - 2021/1/10
Y1 - 2021/1/10
N2 - Advancements in stellar spectroscopy data acquisition have made it necessary to accomplish similar improvements in efficient data analysis techniques. Current automated methods for analyzing spectra are either (a) data driven, which requires prior knowledge of stellar parameters and elemental abundances, or (b) based on theoretical synthetic models that are susceptible to the gap between theory and practice. In this study, we present a hybrid generative domain-adaptation method that turns simulated stellar spectra into realistic spectra by applying unsupervised learning to large spectroscopic surveys. We apply our technique to the APOGEE H-band spectra at R = 22,500 and the Kurucz synthetic models. As a proof of concept, two case studies are presented. The first is the calibration of synthetic data to become consistent with observations. To accomplish this, synthetic models are morphed into spectra that resemble observations, thereby reducing the gap between theory and observations. Fitting the observed spectra shows an improved average cR 2 reduced from 1.97 to 1.22, along with a mean residual reduced from 0.16 to-0.01 in normalized flux. The second case study is the identification of the elemental source of missing spectral lines in the synthetic modeling. A mock data set is used to show that absorption lines can be recovered when they are absent in one of the domains. This method can be applied to other fields that use large data sets and are currently limited by modeling accuracy.
AB - Advancements in stellar spectroscopy data acquisition have made it necessary to accomplish similar improvements in efficient data analysis techniques. Current automated methods for analyzing spectra are either (a) data driven, which requires prior knowledge of stellar parameters and elemental abundances, or (b) based on theoretical synthetic models that are susceptible to the gap between theory and practice. In this study, we present a hybrid generative domain-adaptation method that turns simulated stellar spectra into realistic spectra by applying unsupervised learning to large spectroscopic surveys. We apply our technique to the APOGEE H-band spectra at R = 22,500 and the Kurucz synthetic models. As a proof of concept, two case studies are presented. The first is the calibration of synthetic data to become consistent with observations. To accomplish this, synthetic models are morphed into spectra that resemble observations, thereby reducing the gap between theory and observations. Fitting the observed spectra shows an improved average cR 2 reduced from 1.97 to 1.22, along with a mean residual reduced from 0.16 to-0.01 in normalized flux. The second case study is the identification of the elemental source of missing spectral lines in the synthetic modeling. A mock data set is used to show that absorption lines can be recovered when they are absent in one of the domains. This method can be applied to other fields that use large data sets and are currently limited by modeling accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85100309935&partnerID=8YFLogxK
U2 - 10.3847/1538-4357/abca96
DO - 10.3847/1538-4357/abca96
M3 - Article
SN - 0004-637X
VL - 906
JO - Astrophysical Journal
JF - Astrophysical Journal
IS - 2
M1 - abca96
ER -