Synthetic EPIC-MOS1 Camera Flare events
Synthetic Solar-Flare dataset generated with a VAE generative model for EPIC-MOS1 camera of XMM-Newton telescope.
ESPAI
This website introduces the ESPAI project — what it is, what it does, and why it matters. You’ll also find access to the project’s main outputs, from datasets and AI models to source code and publications. Explore our resources to dive into cutting-edge tools for classification and generation of solar-flare datasets.
Everything is open and accessible. See the Outputs section below to learn more, or open the ESPAI Resource Hub to access the project resources. If you’d like to download datasets or code, we’ll just ask you to fill out a short form with your affiliation and intended use.
Learn in detail about ESPAI, its scientific goals, and the motivation behind using Artificial Intelligence (AI) to remove solar-flare contaminations from XMM-Newton observations. Explore ESPAI’s scientific goals and the motivation behind using AI to mitigate solar- flare contamination from XMM-Newton observations. Understand how the project combines astrophysical models with advanced machine learning techniques to recover data that would otherwise be discarded.
Learn MoreBrowse a growing collection of datasets, source code, plots and documentation. These resources are designed to support both the astrophysics and AI communities, offering practical tools for analysis, experimentation, and reproducible science.
ExploreFind clear and transparent information about licensing conditions, recommended citation formats, and guidelines for the responsible reuse of materials. These terms help ensure that ESPAI's outputs are properly credited and can be applied ethically in future research.
View TermsThe ESPAI project leverages advanced machine-learning methods to improve the signal purity of XMM-Newton X-ray observations, focusing specifically on data from the EPIC cameras. Its primary objective is to develop an autonomous framework that can identify and mitigate solar-flare contamination, improving data quality while reducing the amount of scientifically useful data that is discarded during filtering.
To tackle this, ESPAI builds a binary, photon-level classifier that labels each detected event as either “flare” or “signal.” The classifier’s output is then used to isolate high-confidence solar-flare photons, which serve as the seed for a generative AI model that produces a large sample of “pure” solar-flare events. This synthetic but physically grounded dataset is designed to support detailed studies of this key source of background in X-ray observations, while also enabling more targeted mitigation in EPIC data processing.
The project delivers openly accessible tools: curated datasets, generative frameworks and source code repositories, all provided under transparent licensing terms. In this way, ESPAI contributes to reproducible science and accelerates discovery for both the astrophysics and AI communities.
Development of a generative AI framework that produces high-purity synthetic solar- flare events from classifier-selected flare photons, enabling detailed flare studies, benchmarking, and downstream model development.
Photon-level solar-flare vs signal classification to support more targeted filtering and automated EPIC data processing at scale.
Consistency checks of the predicted results on the Good Time Intervals (GTIs) using energy spectrum analysis. Performance reporting focuses on statistical validation using the Kolmogorov-Smirnov (K-S) test to ensure the classified signal distributions match physical expectations.
A unified pipeline links (i) data pre-processing using XMM-Scientific Analysis System (SAS), including cheesing task to remove astrophysical sources, (ii) robust classification of solar-flares using MLPs with data augmentation, and (iii) controlled synthetic data generation via VAEs. The workflow supports observation-based inference constraints and scales to handle the large Blank Sky dataset.
ESPAI is conceived and developed under the leadership of Koexai S.r.l., which coordinates all project activities and ensures their delivery. The project benefits from the valuable scientific input of Prof. Pasquale Mazzotta at the University of Rome Tor Vergata, who contribute expertise in Observational Cosmology and Galaxy Clusters and act as scientific advisor, nominated by INAF (Spoke 3 leader).
ESPAI has been selected and funded under the ICSC – Centro Nazionale di Ricerca in HPC, Big Data e Quantum Computing, as part of the PNRR, and supported by the European Union – NextGenerationEU.
Explore ESPAI’s outputs: datasets, source code, plots and documentation. For downloads and full details, open the Resource Hub.
Synthetic Solar-Flare dataset generated with a VAE generative model for EPIC-MOS1 camera of XMM-Newton telescope.
Synthetic Solar-Flare dataset generated with a VAE generative model for EPIC-MOS2 camera of XMM-Newton telescope.
Synthetic Solar-Flare dataset generated with a VAE generative model for EPIC-pn camera of XMM-Newton telescope.
Large files & licensing: full releases are provided for non-commercial research on a short request.
For resource access requests, collaboration inquiries, or technical questions:
info@koexai.comKoexai S.r.l. — Project lead and coordinating body.
Pasquale Mazzotta (University of Rome Tor Vergata, Istituto Nazionale di Fisica Nucleare) — Scientific advisor and contributor, nominated by INAF (Spoke 3 leader).
Federico De Luca (University of Rome Tor Vergata, Istituto Nazionale di Fisica Nucleare) — Post Doc at University of Rome Tor Vergata.