Abstract
Random projections are used to estimate parameters of interest in large scale data sets by projecting data into a lower dimensional space. Some parameters of interest between pairs of vectors are the Euclidean distance and the inner product, while parameters of interest for the whole data set could be its singular values or singular vectors. We show how we can borrow an idea from Monte Carlo integration by using control variates to reduce the variance of the estimates of Euclidean distances and inner products by storing marginal information of our data set. We demonstrate this variance reduction through experiments on synthetic data as well as the colon and kos datasets. We hope that this inspires future work which incorporates control variates in further random projection applications.
Original language | English |
---|---|
Title of host publication | Random projections with control variates |
Editors | De Marsico M., di Baja G.S., Fred A. |
Place of Publication | Online |
Publisher | SciTePress |
Pages | 138 - 147 |
ISBN (Print) | 978-989758222-6 |
DOIs | |
Publication status | Published - 2017 |
Event | ICPRAM 2017 - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Porto Duration: 1 Jan 2017 → … https://link.springer.com/chapter/10.1007/978-3-319-93647-5_1 |
Conference
Conference | ICPRAM 2017 - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods |
---|---|
Period | 1/01/17 → … |
Other | 24 to 26 of February 2017 |
Internet address |