TY - JOUR
T1 - The EPOCH Project
T2 - I. Periodic variable stars in the EROS-2 LMC database â
AU - Kim, Dae Won
AU - Protopapas, Pavlos
AU - Bailer-Jones, Coryn A.L.
AU - Byun, Yong Ik
AU - Chang, Seo Won
AU - Marquette, Jean Baptiste
AU - Shin, Min Su
PY - 2014/6
Y1 - 2014/6
N2 - The EPOCH (EROS-2 periodic variable star classification using machine learning) project aims to detect periodic variable stars in the EROS-2 light curve database. In this paper, we present the first result of the classification of periodic variable stars in the EROS-2 LMC database. To classify these variables, we first built a training set by compiling known variables in the Large Magellanic Cloud area from the OGLE and MACHO surveys. We crossmatched these variables with the EROS-2 sources and extracted 22 variability features from 28âE 392 light curves of the corresponding EROS-2 sources. We then used the random forest method to classify the EROS-2 sources in the training set. We designed the model to separate not only δ Scuti stars, RR Lyraes, Cepheids, eclipsing binaries, and long-period variables, the superclasses, but also their subclasses, such as RRab, RRc, RRd, and RRe for RR Lyraes, and similarly for the other variable types. The model trained using only the superclasses shows 99% recall and precision, while the model trained on all subclasses shows 87% recall and precision. We applied the trained model to the entire EROS-2 LMC database, which contains about 29 million sources, and found 117234 periodic variable candidates. Out of these 117234 periodic variables, 55 285 have not been discovered by either OGLE or MACHO variability studies. This set comprises 1906 δ Scuti stars, 6607 RR Lyraes, 638 Cepheids, 178 Type II Cepheids, 34562 eclipsing binaries, and 11394 long-period variables.
AB - The EPOCH (EROS-2 periodic variable star classification using machine learning) project aims to detect periodic variable stars in the EROS-2 light curve database. In this paper, we present the first result of the classification of periodic variable stars in the EROS-2 LMC database. To classify these variables, we first built a training set by compiling known variables in the Large Magellanic Cloud area from the OGLE and MACHO surveys. We crossmatched these variables with the EROS-2 sources and extracted 22 variability features from 28âE 392 light curves of the corresponding EROS-2 sources. We then used the random forest method to classify the EROS-2 sources in the training set. We designed the model to separate not only δ Scuti stars, RR Lyraes, Cepheids, eclipsing binaries, and long-period variables, the superclasses, but also their subclasses, such as RRab, RRc, RRd, and RRe for RR Lyraes, and similarly for the other variable types. The model trained using only the superclasses shows 99% recall and precision, while the model trained on all subclasses shows 87% recall and precision. We applied the trained model to the entire EROS-2 LMC database, which contains about 29 million sources, and found 117234 periodic variable candidates. Out of these 117234 periodic variables, 55 285 have not been discovered by either OGLE or MACHO variability studies. This set comprises 1906 δ Scuti stars, 6607 RR Lyraes, 638 Cepheids, 178 Type II Cepheids, 34562 eclipsing binaries, and 11394 long-period variables.
KW - Magellanic Clouds
KW - Methods: data analysis
KW - Stars: variables: general
UR - http://www.scopus.com/inward/record.url?scp=84902076715&partnerID=8YFLogxK
U2 - 10.1051/0004-6361/201323252
DO - 10.1051/0004-6361/201323252
M3 - Article
SN - 0004-6361
VL - 566
JO - Astronomy and Astrophysics
JF - Astronomy and Astrophysics
M1 - A43
ER -