Appendix
This appendix includes further information about studies, detailed investigations and tests.
First extended history simulations
New CORSIKA datasets are simulated and stored at:
/data/sim/IceCube/2023/generated/CORSIKA_EHISTORY/
The simulation is divided into 4 different energy ranges. Since the simulation is done out ourselves, the dataset numbers are not provided in iceprod.
30010: 600 GeV - 1 PeV
30011: 1 PeV - 100 PeV
30012: 100 PeV - 1 EeV
30013: 1 EeV - 50 EeV
The following settings are used:
CORSIKA version 77420
SIBYLL 2.3d
Icetray 1.5.1
5 components (p, He, N, Al, Fe)
Component norm: [10, 5, 3, 2, 1]
Zenith angle: 0 - 90 degrees
Polyplopia: True
Ecuts1: 273 GeV (hadron min energy)
Ecuts2: 273 GeV (muon min energy)
Ecuts3: \(10^{20}\) GeV (electron min energy)
Ecuts4: \(10^{20}\) GeV (photon min energy)
TrimShower: True
Atmosphere:
ratmo: 4
(April)Spectrum : -1 (increase high energy statistics)
This simulation is performed to test the extended history and prompt tagging software. Thus, the statistics are not yet sufficient.
The built CORSIKA software is stored at: /data/user/pgutjahr/software/CORSIKA/corsika-77420/bin/
and also available in the cvmfs:
/cvmfs/icecube.opensciencegrid.org/users/pgutjahr/software/CORSIKA/
Dataset exploration - Level 2
For the dataset exploration, the definition of a leading muon is defined as following: The leading muon is the muon with the highest energy in the muon bundle. This can be expressed in “Leadingness”. Leadingness \(L_{\mathrm{E}}\) describes the ratio of the highest energetic muon \(E_{\mathrm{max}}\) in a muon bundle to the total energy \(E_{\mathrm{tot}}\) of the muon bundle:
If there is no specific leadingness stated, the term leading muon refers to the muon with the highest energy in the muon bundle.
In the following, unweighted and weighted distributions of the simulated events are shown. In Fig. 388 and Fig. 389, the primary distributions are shown for each dataset.

Fig. 388 : The energy distribution of primary particles is shown for the four different simulation datasets.

Fig. 389 : The energy distribution of primary particles is shown for the four different simulation datasets, weighted to GlobalSplineFit5Comp (GSF).
In Fig. 390 and Fig. 391, the primary distributions are shown for each dataset, separated by the primary particle type.

Fig. 390 : The energy distribution of primary particles is shown for the four different simulation datasets, separated by the primary particle type.

Fig. 391 : The energy distribution of primary particles is shown for the four different simulation datasets, separated by the primary particle type, weighted to GlobalSplineFit5Comp (GSF).
In Fig. 392 and Fig. 393, the energy distribution of the leading muon is shown for each dataset. The leading muon is defined as the muon with the highest energy in the muon bundle. The shown energy corresponds to the energy at the detector entry.

Fig. 392 : The energy distribution of the leading muon is shown for the four different simulation datasets.

Fig. 393 : The energy distribution of the leading muon is shown for the four different simulation datasets, weighted to GlobalSplineFit5Comp (GSF).
In Fig. 394 and Fig. 395, the energy distribution of the muon bundle is shown for each dataset. The muon bundle is defined as the the sum of the energy of all muons entering the detector.

Fig. 394 : The energy distribution of the muon bundle is shown for the four different simulation datasets.

Fig. 395 : The energy distribution of the muon bundle is shown for the four different simulation datasets, weighted to GlobalSplineFit5Comp (GSF).
Estimation of the simulated statistics
The estimation of the simulated statistics needed for this analysis is not easy to determine. The statistics should be sufficient in the phase space of the analysis. This will probably be defined by the zenith angle of the incoming muon and the muon energy. Here, both the leading and bundle energy at detector entry and at the surface are considered. Furthermore, the systematic uncertainties in this phase space need to be known to create a simulation with statistical uncertainties lower than the systematic uncertainties. However, to get a first impression of the statistics simulated so far, Fig. 396 and Fig. 397 show the energy spectrum of the primary and leading muon energy. The simulated events are shown in blue, in orange the events are weighted to the expected statistics of 1 year of IceCube data using GlobalSplineFit5Comp (GSF) weighting. Here, the muon filter is applied and an energy cut of 200 TeV is applied to the muon bundle energy at the surface. For leading muon energies above 1 PeV, more muons are simulated than expected for 1 year. (The cuts applied here are not the final cuts for the analysis.)

Fig. 396 : Primary energy spectrum is shown to estimate the simulated statistics.

Fig. 397 : Leading muon energy spectrum is shown to estimate the simulated statistics.
General Simulation Questions
Before we have started the large-scale IceProd simulation, we have discussed the following questions:
- Does cutting off the electromagnetic shower component have any impact on our phase space (high energy muons)? This is done by Ecuts3 and Ecuts4.
10% effect possible on the muon energy spectrum, but no significant effect on the runtime and disk space
EM component will be turned on, which is
done by setting Ecuts3 and Ecuts4 to the same value as Ecuts2 and Ecuts1, thus 273 GeV
- Shall we stay with Icetray 1.5.1 which was used for the first test simulation?
Use latest version of Icetray to include any possible bug fixes and up-to-date software + latest ice model
- We haven’t oversampled our showers yet. Which factor for oversampling is usual?
At low energies, oversampling up to 10 is common, but this should be decreased at higher energies
We decided not to oversample the showers, since this results in a “fake statistics”
- How can we reduce the disk space?
For the final simulation, we will store step 0 and level 2 files. The extended I3MCTrees can be removed, since we can re-simulate them using PROPOSAL if needed
- How much disk storage do we need for the final simulation?
Roughly 50 TB
- Which seasons do we want to simulate? 4 seasons?
We want to simulate all 12 seasons as defined here.
This enables further studies of the seasonal variations in the future.
- Do we want to set the TrimShower option?
For large zenith angles, even high energy muon can be cut off. For the calculation of the effective area, we have to turn off trimshower
Thus, we don’t use the TrimShower option
Stochasticity
This section is based on datasets 30010-30013
A muon loses its energy in stochastic processes. Thus, a single muon deposits stochastic energy losses along a track. In a bundle of many muons, every muon has its own stochastic energy losses, which appear as a more continuous energy loss in the detector. Hence, if there are very stochastic energy losses detected inside the detector, there are probably only a few muons or a single muon (at low energies). If we extend this to high energies, the largest energy losses are caused by the most energetic muon in the bundle. In a bundle in which the muon energies are distributed more equally, also the losses appear more continuously. The idea is to search for events that deposit their energy more stochastically to select and/or to improve the energy reconstruction of muons with a high leadingness.
For the stochasticity calculation of the leading muon, the energy depositions and corresponding distances are needed. These can be determined by the function get_track_energy_depositions. The stochasticity is then calculated by the function compute_stochasticity. This function calculates the stochasticity of energy losses along a track by measuring the area between the cumulative distribution function (CDF) of the energy losses and the relative distances. It returns three values: the stochasticity (a float between 0 and 1, normalized by 0.5), the total area above the diagonal (a float), and the total area below the diagonal (a float). An extreme case of 1 means, the muon loses all it’s energy in one interaction, the extreme case of 0 means, the muon loses all it’s energy continuously.
As mentioned above, usually there is not only one muon, but several muons entering the detector. The energy losses of individual muons overlap. For this calculation, all energy losses of all muons with respect to their propagated distance are determined by the function get_bundle_energy_depositions. Here, it is assumed that all tracks travel on the same trajectory. The stochasticity is then calculated with the same function stated above. In this analysis, it is referred to as the bundle stochasticity.
Monte Carlo studies
In Fig. 398, the leadingness is shown as a function of the bundle stochasticity. If the muon event has a large stochasticity, this is caused by a high leadingness, but this is the case only for a small amount of events. Hence, a high leadingness does not necessary results to a large stochasticity.

Fig. 398 : The leadingness is shown as a function of the bundle stochasticity as a weighted distribution.
To get an idea of the correlation between the leading muon energy and the bundle stochasticity, in Fig. 399, the energy of the leading muon is shown as a function of the bundle stochasticity.

Fig. 399 : The energy of the leading muon is shown as a function of the bundle stochasticity.
In the following, the title of the plots shows a cut applied on the bundle energy in GeV. Hence, from left to right only high energy muons are selected.
In Fig. 400, the leadingness is shown as a function of the bundle stochasticity. High stochasticities lead to a large leadingness, but it removes the entire statistics.

Fig. 400 : The leadingness is shown as a function of the bundle stochasticity.
In Fig. 401, the leadingness is shown as a function of the largest energy loss. It results that considering only the largest energy loss does not indicate the leadingness.

Fig. 401 : The leadingness is shown as a function of the largest energy loss.
In Fig. 402, the energy of the leading muon is shown as a function of the largest energy loss. The largest energy loss is correlated with the energy of the leading muon. The larger the energy loss, the higher the energy of the leading muon.

Fig. 402 : The energy of the leading muon is shown as a function of the largest energy loss.
In Fig. 403, the energy spectrum of the leading muon is shown for different cuts on the stochasticity. The plot is divided into a prompt and conventional component. A cut on the stochasticity removes high energy muons. Due to the low statistics expected at high energies for 10 years, we do not apply any cuts on the stochasticity in this analysis.

Fig. 403 : The energy spectrum of the leading muon is shown for different cuts on the stochasticity.
Impact on the energy reconstruction
The impact of the stochasticity on the energy reconstruction is shown in the following plots.
The bundle energy reconstruction for different cuts on the stochasticity is shown in Fig. 404 and Fig. 405. A cut on the stochasticity does not improve the bundle energy reconstruction.

Fig. 404 : The bundle energy reconstruction for stochasticities below a certain cut is shown.

Fig. 405 : The bundle energy reconstruction for stochasticities above a certain cut is shown.
The leading muon energy reconstruction for different cuts on the stochasticity is shown in Fig. 406 and Fig. 407. A cut on the stochasticity does not improve the leading muon energy reconstruction.

Fig. 406 : The leading muon energy reconstruction for stochasticities below a certain cut is shown.

Fig. 407 : The leading muon energy reconstruction for stochasticities above a certain cut is shown.
In summary, a cut on the stochasticity does not improve the bundle or leading muon energy reconstruction.
Bundle radius
This section is based on datasets 30010-30013
Another idea to investigate muons with a high leadingness is to analyze the bundle radius. Depending on the fraction of the energy the most energetic muons carries, the projected radius of the entire bundle should differ. Here, different radii for the fractional amount of energy inside the projected area are studied. To quantify this, the perpendicular distance between the leading muon and the closest approach position to the center of the detector is calculated. Then, the closest approach point to the center is calculated for all muons in the bundle. With these positions, the distances between the leading muon and the other muons are calculated. Finally, the distances are weighted by the energy. For example, 100% means that the largest distance between a muon and the leading muon is considered. 90% means that the distance between the leading muon and the muon that accumulates 90 % of the bundle energy is considered. In the following, this distance is referred to as the bundle radius. The calculation can be performed with the function get_bundle_radius.
Monte Carlo studies
In Fig. 408, the bundle radius is shown for different bundle radius quantiles. These range from the energy inside the projected area from 50% to 100%. The same plot is shown for different scalings on the axes. The distributions peak between 5m and 20m, but also radii above 100m are observed.

Fig. 408 : The bundle radius is shown for different bundle radius quantiles.
In Fig. 409, the leadingness is shown as a function of the bundle radius for a bundle radius quantile of 100%. If the bundle radius is very small, the leadingness is high.

Fig. 409 : The leadingness is shown as a function of the bundle radius for a bundle radius quantile of 100% as a weighted distribution.
In the following Fig. 410, the leadingness is shown as a function of the bundle radius for different bundle energy cuts. If the bundle radius is high, the leadingness is low.






Fig. 410 : The leadingness is shown as a function of the bundle radius for different bundle energy cuts.
In Fig. 411, the muon bundle energy is shown as a function of the bundle radius for different bundle energy cuts. For a small amount of events, a large bundle radius indicates a low bundle energy.

Fig. 411 : The muon bundle energy is shown as a function of the bundle radius for different bundle energy cuts.
In Fig. 412, the leading muon energy is shown as a function of the bundle radius for different bundle energy cuts. For a small amount of events, a large bundle radius indicates a low leading muon energy.

Fig. 412 : The leading muon energy is shown as a function of the bundle radius for different bundle energy cuts.
In Fig. 413, the leading muon energy spectrum is shown for different cuts on the bundle radius. A bundle radius quantile of 99% is chosen as a cut parameter.

Fig. 413 : The leading muon energy spectrum is shown for different cuts on the bundle radius of the 99% quantile.
In Fig. 414, the leading muon energy spectrum is shown for different cuts on the bundle radius. A bundle radius quantile of 100% is chosen as a cut parameter.

Fig. 414 : The leading muon energy spectrum is shown for different cuts on the bundle radius of the 100% quantile.
Selecting events below a certain bundle radius does not increase the sensitivity to distinguish between prompt and conventional, but it removes statistics. Thus, there is no selection performed using the bundle radius.
Impact on the energy reconstruction
In Fig. 415, the impact of the bundle radius on the reconstruction of the leading muon energy is shown. A bundle radius quantile of 100% is chosen as a cut parameter.

Fig. 415 : The impact of the bundle radius of the 10% quantile on the reconstruction of the leading muon energy is shown.
There is no significant reconstruction improvement due to the application of a bundle radius cut. Instead, high energy events are rejected. Hence, no cut on the bundle radius is performed.
Network evaluation
This section is based on datasets 30010-30013
In the following, the evaluation of the networks is shown. Each figure contains two plots. The left plots show the evaluation of all events, the right plot shows an uncertainty cut applied on the estimated uncertainty by the network. The evaluation is performed on our own extended history simulation dataset (datasets 30010 - 30013). Each plot has the network prediction on the y-axis and the true value on the x-axis. In general, networks are trained with 3 or 9 inputs and a time window of 6ms or the internal DNN time window cleaning is applied to the SplitInIceDSTPulses. Furthermore, the CNN layers and nodes are varied. The runtime prediction is presented for the usage of a GPU. The preprocessing runtime represents the time needed to create the input features for the network based on the input pulses.
Bundle energy at surface
precut networks:

Fig. 416 : The bundle energy at the surface is shown for the network DeepLearningReco_precut_bundle_energy_3inputs_6ms_at_surface_01
. It uses 3 inputs
and a 6ms time window.

Fig. 417 : The bundle energy at the surface is shown for the network DeepLearningReco_precut_surface_bundle_energy_3inputs_6ms_01
. It uses 3 inputs
and a 6ms time window.

Fig. 418 : The bundle energy at the surface is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
. It uses 9 inputs and a 6ms time window.

Fig. 419 : The bundle energy at the surface is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_large_log_02
. It uses 9 inputs and the internal DNN time window cleaning.
Bundle energy at entry

Fig. 420 : The bundle energy at the entry is shown for the network DeepLearningReco_leading_bundle_energy_OC_inputs9_6ms_large_log_02
. It uses 9 inputs and a 6ms time window.

Fig. 421 : The bundle energy at the entry is shown for the network DeepLearningReco_leading_bundle_OC_inputs9_large_log_02
. It uses 9 inputs and the internal DNN time window cleaning.

Fig. 422 : The bundle energy at the entry is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
. It uses 9 inputs and a 6ms time window.

Fig. 423 : The bundle energy at the entry is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_large_log_02
. It uses 9 inputs and the internal DNN time window cleaning.
Leading muon energy at surface

Fig. 424 : The leading muon energy at the surface is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
. It uses 9 inputs and a 6ms time window.

Fig. 425 : The leading muon energy at the surface is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_large_log_02
. It uses 9 inputs and the internal DNN time window cleaning.
Leading muon energy at entry

Fig. 426 : The leading muon energy at the entry is shown for the network DeepLearningReco_leading_bundle_energy_OC_inputs9_6ms_large_log_02
. It uses 9 inputs and a 6ms time window.

Fig. 427 : The leading muon energy at the entry is shown for the network DeepLearningReco_leading_bundle_OC_inputs9_large_log_02
. It uses 9 inputs and the internal DNN time window cleaning.

Fig. 428 : The leading muon energy at the entry is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
. It uses 9 inputs and a 6ms time window.

Fig. 429 : The leading muon energy at the entry is shown for the network DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_large_log_02
. It uses 9 inputs and the internal DNN time window cleaning.
The reconstruction of the leading muon is a difficult task, since the leading muon is accompanied by a bundle of muons. Thus, the emitted cherenkov light of the leading muon is superimposed by the light of the other muons. In Fig. 13, the true muon energy fraction is shown as a function of the true bundle energy, at entry. There is a clear correlation between the true muon energy fraction and the true bundle energy. The distribution is smeared. In Fig. 14, the reconstructed muon energy fraction is shown as a function of the reconstructed bundle energy, at entry. This distribution is less smeared. Hence, the network seems to reconstruct the bundle energy and tries to refer to the leading muon energy.

Fig. 430 : The true muon energy fraction is shown as a function of the true bundle energy, at entry.

Fig. 431 : The reconstructed muon energy fraction is shown as a function of the true bundle energy, at entry.
Track geometry
Center time:

Fig. 432 : The center time is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 433 : The center time is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Entry time:

Fig. 434 : The entry time is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 435 : The entry time is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Center position x:

Fig. 436 : The center position x is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 437 : The center position x is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Center position y:

Fig. 438 : The center position y is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 439 : The center position y is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Center position z:

Fig. 440 : The center position z is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 441 : The center position z is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Entry position x:

Fig. 442 : The entry position x is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 443 : The entry position x is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Entry position y:

Fig. 444 : The entry position y is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 445 : The entry position y is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Entry position z:

Fig. 446 : The entry position z is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 447 : The entry position z is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Total track length:

Fig. 448 : The track length is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 449 : The track length is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Track length in detector:

Fig. 450 : The track length in the detector is shown for the network DeepLearningREco_track_geometry_9inputs_6ms_medium_01
. It uses 9 inputs and a 6ms time window.

Fig. 451 : The track length in the detector is shown for the network DeepLearningREco_track_geometry_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Direction
Zenith angle:

Fig. 452 : The zenith angle is shown for the network DeepLearningReco_direction_9inputs_6ms_medium_02_03
. It uses 9 inputs and a 6ms time window.

Fig. 453 : The zenith angle is shown for the network DeepLearningReco_direction_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Azimuth angle:

Fig. 454 : The azimuth angle is shown for the network DeepLearningReco_direction_9inputs_6ms_medium_02_03
. It uses 9 inputs and a 6ms time window.

Fig. 455 : The azimuth angle is shown for the network DeepLearningReco_direction_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Angular resolution:

Fig. 456 : The angular resolution is shown for the network DeepLearningReco_direction_9inputs_6ms_medium_02_03
. It uses 9 inputs and a 6ms time window.

Fig. 457 : The angular resolution is shown for the network DeepLearningReco_direction_9inputs_uncleaned_01
. It uses 9 inputs and the internal DNN time window cleaning.
Multiplicity
The multiplicity means the number of muons entering the detector in a bundle. So far, we do not use this information for the analysis, but we just wanted to check if it is possible to reconstruct the multiplicity.

Fig. 458 : The multiplicity is shown for the network DeepLearningReco_precut_bundle_energy_multi_OC_6ms_01
. It uses 3 inputs and a 6ms time window.

Fig. 459 : The multiplicity is shown for the network DeepLearningReco_precut_bundle_energy_multi_OC_6ms_02
. It uses 3 inputs and a 6ms time window.

Fig. 460 : The multiplicity is shown for the network DeepLearningReco_precut_bundle_energy_multi_OC_6ms_03
. It uses 3 inputs and a 6ms time window.

Fig. 461 : The multiplicity is shown for the network DeepLearningReco_precut_bundle_energy_multi_OC_6ms_04
. It uses 3 inputs and a 6ms time window.
Networks used for pseudo analysis
This section is based on datasets 30010-30013
The following networks are the networks used for the pseudo analysis. These networks are at an early stage as it can be seen in the performance in comparison to the plots presented above. Thus, this networks will not be used for the final analysis.
Angular reconstructions



Left side: only L2 muon filter, right side: L2 muon filter and cut on bundle energy: \(E > 10\,\mathrm{TeV}\)


Energy reconstructions: muon bundle


Energy reconstruction: leading muon


Z-vertex investigations (L4)
In the following, the center position z is investigated on level 4. Several zenith cuts, distance to center cuts, and energy cuts are applied. Even more plots can be found here.
Cos zenith cuts

Fig. 462 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 463 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 464 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 465 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Distance to center cuts

Fig. 466 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 467 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 468 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 469 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Energy cuts

Fig. 470 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 471 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 472 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 473 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Cos zenith 2D correlation

Fig. 474 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 475 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 476 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 477 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Increase neutrino impact by factor 200

Fig. 478 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 479 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 480 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 481 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Z-vertex investigations (L5)
In the following, the center position z is investigated on level 5. Several zenith cuts, distance to center cuts, and energy cuts are applied. Even more plots can be found here.
Cos zenith cuts

Fig. 482 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 483 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 484 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 485 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Distance to center cuts

Fig. 486 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 487 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 488 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 489 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Energy cuts

Fig. 490 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 491 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 492 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 493 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Cos zenith 2D correlation

Fig. 494 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 495 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 496 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 497 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
Increase neutrino impact by factor 200

Fig. 498 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 499 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 500 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.

Fig. 501 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
.
z-vertex re-weighting (L5)
Overall, the data-MC agreement is good. However, the z-vertex distribution shows a slope in the data. This slope is removed by re-weighting the z-vertex distribution. The re-weighting is done bin-wise to get a flat distribution in the data-MC agreement. This is shown in Figure Fig. 502.

Fig. 502 : Center position z reconstructed by DeepLearningReco_track_geometry_9inputs_6ms_medium_01
. The solid line represent a reweighted distribution of the z-vertex, the dashed line the original distribution. The reweighting is done bin-wise to get a flat distribution in the data-MC agreement.
By re-weighting the z-vertex distribution, the impact on other variables needs to be investigated. This is done in the following plots for the several energy reconstructions and the cos zenith angle distribution. In all distributions, the re-weighting has only a small impact on the energy reconstructions and the cos zenith distribution. Additionally, the small impact leads to an improvement in the data-MC agreement.

Fig. 503 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 504 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 505 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 506 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 507 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 508 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 509 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 510 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 511 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 512 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 513 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 514 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 515 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 516 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 517 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 518 : Energy reconstruction by DeepLearningReco_leading_bundle_surface_leading_bundle_energy_OC_inputs9_6ms_large_log_02
to check the impact of the z-vertex re-weighting.

Fig. 519 : Zenith reconstruction by DeepLearningReco_direction_9inputs_6ms_medium_02_03
to check the impact of the z-vertex re-weighting.

Fig. 520 : Zenith reconstruction by DeepLearningReco_direction_9inputs_6ms_medium_02_03
to check the impact of the z-vertex re-weighting.

Fig. 521 : Zenith reconstruction by DeepLearningReco_direction_9inputs_6ms_medium_02_03
to check the impact of the z-vertex re-weighting.

Fig. 522 : Zenith reconstruction by DeepLearningReco_direction_9inputs_6ms_medium_02_03
to check the impact of the z-vertex re-weighting.
Old unfolding plots
Proxy Variable
At first, we need a proxy that correlates with the true physical quantity. In Figure Fig. 523, the muon energy of the leading muon at entry is used as a proxy variable. The target is the energy of the leading muon at the surface.

Fig. 523 : Proxy variable for unfolding. Here, the muon energy of the leading muon at entry is used. The target is the leading muon energy at surface.
Unfold Event Rate - one proxy
In Figure Fig. 524, the unfolded event rate of the muon energy of the leading muon at the surface is shown. In Figure Fig. 525, the same unfolding is performed with more steps and walkers.

Fig. 524 : Unfolded event rate of the muon energy of the leading muon at the surface. The true distribution is shown in blue, the unfolded distribution in red.

Fig. 525 : Unfolded event rate of the muon energy of the leading muon at the surface. The true distribution is shown in blue, the unfolded distribution in red.
Unfold Event Rate - multiple proxies / tree binning
Instead of using just one proxy, multiple proxies can be used. Due to the high dimensionality, a decision tree is used to bin the data. This is done in a way, that the tree is used to solve our problem: estimating the energy of the leading muon at the surface. But then, we don’t use the estimation of the energy, but just the leafs for the binning. This way, we can use the tree to bin the data. Of course, this approach can also be used with only one proxy. This is presented first, to compare the results with the classical binning by hand.
In Figures Fig. 526 and Fig. 527, the decision tree regression for the leading muon energy at the surface is shown. The tree is able to reconstruct the target variable.

Fig. 526 : Decision tree regression for the leading muon energy at the surface. The tree is used to bin the data.

Fig. 527 : Decision tree regression for the leading muon energy at the surface in 2D. The tree is used to bin the data.
Using this binning and the same number of steps and walkers a in Figure Fig. 524, the unfolding of the event rate results as in Figure Fig. 528. The tree binning improves the unfolding result.

Fig. 528 : Unfolded event rate of the muon energy of the leading muon at the surface. The true distribution is shown in blue, the unfolded in red. Tree binning is used. As a proxy, only the leading muon energy at entry is used.
Following, the leading muon energy at entry and the zenith angle are used as proxies. The tree binning is shown in Figure Fig. 529. The result of the unfolded event rate is shown in Figure Fig. 530.

Fig. 529 : 2D binning of the data using a decision tree. The tree is used to bin the data.

Fig. 530 : Unfolded event rate of the muon energy of the leading muon at the surface. The true distribution is shown in blue, the unfolded in red. Tree binning is used. As proxies, the leading muon energy at entry and the zenith angle are used.
Unfold Muon Flux - one proxy
For the unfolding of the muon flux at surface, an effective area is needed. This area is basically the information, how many muons correspond to a certain event measured by the detector.
In Figure Fig. 531 the muon flux at surface is unfolded using the leading muon energy at entry as a proxy with classical binning.

Fig. 531 : Unfolded muon flux at surface. The true distribution is shown in blue, the unfolded in red.