Deep learning exoplanets detection by combining real and synthetic data

Sara Cuéllar, Paulo Granados, Ernesto Fabregas, Michel Curé, Héctor Vargas, Sebastián Dormido-Canto, Gonzalo Farias

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


Scientists and astronomers have attached great importance to the task of discovering new exoplanets, even more so if they are in the habitable zone. To date, more than 4300 exoplanets have been confirmed by NASA, using various discovery techniques, including planetary transits, in addition to the use of various databases provided by space and groundbased telescopes. This article proposes the development of a deep learning system for detecting planetary transits in Kepler Telescope light curves. The approach is based on related work from the literature and enhanced to validation with real light curves. A CNN classification model is trained from a mixture of real and synthetic data. The model is then validated only with unknown real data. The best ratio of synthetic data is determined by the performance of an optimisation technique and a sensitivity analysis. The precision, accuracy and true positive rate of the best model obtained are determined and compared with other similar works. The results demonstrate that the use of synthetic data on the training stage can improve the transit detection performance on real light curves.

Original languageEnglish
Article numbere0268199
JournalPLoS ONE
Issue number5 May
StatePublished - May 2022


Dive into the research topics of 'Deep learning exoplanets detection by combining real and synthetic data'. Together they form a unique fingerprint.

Cite this