TY - JOUR
T1 - Identification of Single Spectral Lines in Large Spectroscopic Surveys Using UMLAUT
T2 - An Unsupervised Machine-learning Algorithm Based on Unbiased Topology
AU - Baronchelli, I.
AU - Scarlata, C. M.
AU - Rodríguez-Muoz, L.
AU - Bonato, M.
AU - Morselli, L.
AU - Vaccari, M.
AU - Carraro, R.
AU - Barrufet, L.
AU - Henry, A.
AU - Mehta, V.
AU - Rodighiero, G.
AU - Baruffolo, A.
AU - Bagley, M.
AU - Battisti, A.
AU - Colbert, J.
AU - Dai, Y. S.
AU - De Pascale, M.
AU - Dickinson, H.
AU - Malkan, M.
AU - Mancini, C.
AU - Rafelski, M.
AU - Teplitz, H. I.
N1 - Publisher Copyright:
© 2021. The American Astronomical Society. All rights reserved..
PY - 2021/12
Y1 - 2021/12
N2 - The identification of an emission line is unambiguous when multiple spectral features are clearly visible in the same spectrum. However, in many cases, only one line is detected, making it difficult to correctly determine the redshift. We developed a freely available unsupervised machine-learning algorithm based on unbiased topology (UMLAUT) that can be used in a very wide variety of contexts, including the identification of single emission lines. To this purpose, the algorithm combines different sources of information, such as the apparent magnitude, size and color of the emitting source, and the equivalent width and wavelength of the detected line. In each specific case, the algorithm automatically identifies the most relevant ones (i.e., those able to minimize the dispersion associated with the output parameter). The outputs can be easily integrated into different algorithms, allowing us to combine supervised and unsupervised techniques and increasing the overall accuracy. We tested our software on WISP (WFC3 IR Spectroscopic Parallel) survey data. WISP represents one of the closest existing analogs to the near-IR spectroscopic surveys that are going to be performed by the future Euclid and Roman missions. These missions will investigate the large-scale structure of the universe by surveying a large portion of the extragalactic sky in near-IR slitless spectroscopy, detecting a relevant fraction of single emission lines. In our tests, UMLAUT correctly identifies real lines in 83.2% of the cases. The accuracy is slightly higher (84.4%) when combining our unsupervised approach with a supervised approach we previously developed.
AB - The identification of an emission line is unambiguous when multiple spectral features are clearly visible in the same spectrum. However, in many cases, only one line is detected, making it difficult to correctly determine the redshift. We developed a freely available unsupervised machine-learning algorithm based on unbiased topology (UMLAUT) that can be used in a very wide variety of contexts, including the identification of single emission lines. To this purpose, the algorithm combines different sources of information, such as the apparent magnitude, size and color of the emitting source, and the equivalent width and wavelength of the detected line. In each specific case, the algorithm automatically identifies the most relevant ones (i.e., those able to minimize the dispersion associated with the output parameter). The outputs can be easily integrated into different algorithms, allowing us to combine supervised and unsupervised techniques and increasing the overall accuracy. We tested our software on WISP (WFC3 IR Spectroscopic Parallel) survey data. WISP represents one of the closest existing analogs to the near-IR spectroscopic surveys that are going to be performed by the future Euclid and Roman missions. These missions will investigate the large-scale structure of the universe by surveying a large portion of the extragalactic sky in near-IR slitless spectroscopy, detecting a relevant fraction of single emission lines. In our tests, UMLAUT correctly identifies real lines in 83.2% of the cases. The accuracy is slightly higher (84.4%) when combining our unsupervised approach with a supervised approach we previously developed.
UR - http://www.scopus.com/inward/record.url?scp=85122561910&partnerID=8YFLogxK
U2 - 10.3847/1538-4365/ac250c
DO - 10.3847/1538-4365/ac250c
M3 - Article
SN - 0067-0049
VL - 257
JO - Astrophysical Journal, Supplement Series
JF - Astrophysical Journal, Supplement Series
IS - 2
M1 - 67
ER -