[Cosmological LambdaCDM Simulations for Inference with Machine Learning and Bayesian statistics]
The project aims to build a Simulation-based Inference pipeline that predicts the four cosmological parameters {$\Omega_{\Lambda}$,
A selection of some of the plots from the project can be found below.
Projection of the mean density of neutral hydrogen at 
Gas and Dark Matter for the most massive halo in 10 of the boxes from the CLIMB suite at z = 0. The gray circles represent the regions, where the mean density is larger than 500 times the
critical density of the universe.

Halo mass function for all 50 simulated boxes of the CLIMB suite. The shape of all lines have
a similar slope, however they show a shift in the number of halos for different cosmologies. Three reference
lines from the TNG50, TNG100 and TNG300 simulations (Nelson et al. 2021) are shown as black lines.

Comparison of the CLIMB suits to other works with varying cosmologies. All plots shown here are made from CLIMB high.

Example gallery of single spectra created from the CLIMB high simulations using TEMET. All spectra shown were made from the same box and different lines of sight.

To increase the amount of information per input spectrum and allow the network to use longer sections of real spectra in the inference mode, the spectra are augmented. Different short spectra (upper panel) are randomly shuffled and
patched together to make one long spectrum (lower panel).

Comparison of different noise models. In the upper plot no noise is added to the synthetic
spectrum. In the middle plot a constant random Gaussian noise with Signal-to-noise (SNR) 5 is added. In
the lower plot the mean SNR ratio per pixel of the SDSS catalog spectra with median SNR > 5 is assumed.

As a reference, two observed spectra from the SDSS DR9 Lyman alpha catalog.

Flow chart of the Transformer Network used in this work. The final model has about 4 million trainable parameters.

From the 500,00 available spectra 70% is used as a training set, 15% for a validation set and 15% for a test set. The Transformer is trained for 6 Epochs on the trainind dataset. An example training curve can be seen here.

To judge the performence of the Transformer, it is first applied to spectra from a reference box. This box has the cosmological parameters found by the Planck 2015 study and was never seen during training. 
Finally the Transformer is also applied to observed spectra from the SDSS survey. The predictions for 