Appendix¶
This appendix includes further information about studies, detailed investigations and tests.
First extended history simulations¶
New CORSIKA datasets are simulated and stored at:
/data/sim/IceCube/2023/generated/CORSIKA_EHISTORY/
The simulation is divided into 4 different energy ranges. Since the simulation is done out ourselves, the dataset numbers are not provided in iceprod.
30010: 600 GeV - 1 PeV
30011: 1 PeV - 100 PeV
30012: 100 PeV - 1 EeV
30013: 1 EeV - 50 EeV
The following settings are used:
CORSIKA version 77420
SIBYLL 2.3d
Icetray 1.5.1
5 components (p, He, N, Al, Fe)
Component norm: [10, 5, 3, 2, 1]
Zenith angle: 0 - 90 degrees
Polyplopia: True
Ecuts1: 273 GeV (hadron min energy)
Ecuts2: 273 GeV (muon min energy)
Ecuts3: \(10^{20}\) GeV (electron min energy)
Ecuts4: \(10^{20}\) GeV (photon min energy)
TrimShower: True
Atmosphere:
ratmo: 4
(April)Spectrum : -1 (increase high energy statistics)
This simulation is performed to test the extended history and prompt tagging software. Thus, the statistics are not yet sufficient.
The built CORSIKA software is stored at: /data/user/pgutjahr/software/CORSIKA/corsika-77420/bin/
and also available in the cvmfs:
/cvmfs/icecube.opensciencegrid.org/users/pgutjahr/software/CORSIKA/
Dataset exploration - Level 2¶
For the dataset exploration, the definition of a leading muon is defined as following: The leading muon is the muon with the highest energy in the muon bundle. This can be expressed in “Leadingness”. Leadingness \(L_{\mathrm{E}}\) describes the ratio of the highest energetic muon \(E_{\mathrm{max}}\) in a muon bundle to the total energy \(E_{\mathrm{tot}}\) of the muon bundle:
If there is no specific leadingness stated, the term leading muon refers to the muon with the highest energy in the muon bundle.
In the following, unweighted and weighted distributions of the simulated events are shown. In Fig. 146 and Fig. 147, the primary distributions are shown for each dataset.
In Fig. 148 and Fig. 149, the primary distributions are shown for each dataset, separated by the primary particle type.
In Fig. 150 and Fig. 151, the energy distribution of the leading muon is shown for each dataset. The leading muon is defined as the muon with the highest energy in the muon bundle. The shown energy corresponds to the energy at the detector entry.
In Fig. 152 and Fig. 153, the energy distribution of the muon bundle is shown for each dataset. The muon bundle is defined as the the sum of the energy of all muons entering the detector.
Estimation of the simulated statistics¶
The estimation of the simulated statistics needed for this analysis is not easy to determine. The statistics should be sufficient in the phase space of the analysis. This will probably be defined by the zenith angle of the incoming muon and the muon energy. Here, both the leading and bundle energy at detector entry and at the surface are considered. Furthermore, the systematic uncertainties in this phase space need to be known to create a simulation with statistical uncertainties lower than the systematic uncertainties. However, to get a first impression of the statistics simulated so far, Fig. 154 and Fig. 155 show the energy spectrum of the primary and leading muon energy. The simulated events are shown in blue, in orange the events are weighted to the expected statistics of 1 year of IceCube data using GlobalSplineFit5Comp (GSF) weighting. Here, the muon filter is applied and an energy cut of 200 TeV is applied to the muon bundle energy at the surface. For leading muon energies above 1 PeV, more muons are simulated than expected for 1 year. (The cuts applied here are not the final cuts for the analysis.)
General Simulation Questions¶
Before we have started the large-scale IceProd simulation, we have discussed the following questions:
- Does cutting off the electromagnetic shower component have any impact on our phase space (high energy muons)? This is done by Ecuts3 and Ecuts4.
10% effect possible on the muon energy spectrum, but no significant effect on the runtime and disk space
EM component will be turned on, which is
done by setting Ecuts3 and Ecuts4 to the same value as Ecuts2 and Ecuts1, thus 273 GeV
- Shall we stay with Icetray 1.5.1 which was used for the first test simulation?
Use latest version of Icetray to include any possible bug fixes and up-to-date software + latest ice model
- We haven’t oversampled our showers yet. Which factor for oversampling is usual?
At low energies, oversampling up to 10 is common, but this should be decreased at higher energies
We decided not to oversample the showers, since this results in a “fake statistics”
- How can we reduce the disk space?
For the final simulation, we will store step 0 and level 2 files. The extended I3MCTrees can be removed, since we can re-simulate them using PROPOSAL if needed
- How much disk storage do we need for the final simulation?
Roughly 50 TB
- Which seasons do we want to simulate? 4 seasons?
We want to simulate all 12 seasons as defined here.
This enables further studies of the seasonal variations in the future.
- Do we want to set the TrimShower option?
For large zenith angles, even high energy muon can be cut off. For the calculation of the effective area, we have to turn off trimshower
Thus, we don’t use the TrimShower option
Stochasticity¶
This section is based on datasets 30010-30013
A muon loses its energy in stochastic processes. Thus, a single muon deposits stochastic energy losses along a track. In a bundle of many muons, every muon has its own stochastic energy losses, which appear as a more continuous energy loss in the detector. Hence, if there are very stochastic energy losses detected inside the detector, there are probably only a few muons or a single muon (at low energies). If we extend this to high energies, the largest energy losses are caused by the most energetic muon in the bundle. In a bundle in which the muon energies are distributed more equally, also the losses appear more continuously. The idea is to search for events that deposit their energy more stochastically to select and/or to improve the energy reconstruction of muons with a high leadingness.
For the stochasticity calculation of the leading muon, the energy depositions and corresponding distances are needed. These can be determined by the function get_track_energy_depositions. The stochasticity is then calculated by the function compute_stochasticity. This function calculates the stochasticity of energy losses along a track by measuring the area between the cumulative distribution function (CDF) of the energy losses and the relative distances. It returns three values: the stochasticity (a float between 0 and 1, normalized by 0.5), the total area above the diagonal (a float), and the total area below the diagonal (a float). An extreme case of 1 means, the muon loses all it’s energy in one interaction, the extreme case of 0 means, the muon loses all it’s energy continuously.
As mentioned above, usually there is not only one muon, but several muons entering the detector. The energy losses of individual muons overlap. For this calculation, all energy losses of all muons with respect to their propagated distance are determined by the function get_bundle_energy_depositions. Here, it is assumed that all tracks travel on the same trajectory. The stochasticity is then calculated with the same function stated above. In this analysis, it is referred to as the bundle stochasticity.
Monte Carlo studies¶
In Fig. 156, the leadingness is shown as a function of the bundle stochasticity. If the muon event has a large stochasticity, this is caused by a high leadingness, but this is the case only for a small amount of events. Hence, a high leadingness does not necessary results to a large stochasticity.
To get an idea of the correlation between the leading muon energy and the bundle stochasticity, in Fig. 157, the energy of the leading muon is shown as a function of the bundle stochasticity.
In the following, the title of the plots shows a cut applied on the bundle energy in GeV. Hence, from left to right only high energy muons are selected.
In Fig. 158, the leadingness is shown as a function of the bundle stochasticity. High stochasticities lead to a large leadingness, but it removes the entire statistics.
In Fig. 159, the leadingness is shown as a function of the largest energy loss. It results that considering only the largest energy loss does not indicate the leadingness.
In Fig. 160, the energy of the leading muon is shown as a function of the largest energy loss. The largest energy loss is correlated with the energy of the leading muon. The larger the energy loss, the higher the energy of the leading muon.
In Fig. 161, the energy spectrum of the leading muon is shown for different cuts on the stochasticity. The plot is divided into a prompt and conventional component. A cut on the stochasticity removes high energy muons. Due to the low statistics expected at high energies for 10 years, we do not apply any cuts on the stochasticity in this analysis.
Impact on the energy reconstruction¶
The impact of the stochasticity on the energy reconstruction is shown in the following plots.
The bundle energy reconstruction for different cuts on the stochasticity is shown in Fig. 162 and Fig. 163. A cut on the stochasticity does not improve the bundle energy reconstruction.
The leading muon energy reconstruction for different cuts on the stochasticity is shown in Fig. 164 and Fig. 165. A cut on the stochasticity does not improve the leading muon energy reconstruction.
In summary, a cut on the stochasticity does not improve the bundle or leading muon energy reconstruction.
Bundle radius¶
This section is based on datasets 30010-30013
Another idea to investigate muons with a high leadingness is to analyze the bundle radius. Depending on the fraction of the energy the most energetic muons carries, the projected radius of the entire bundle should differ. Here, different radii for the fractional amount of energy inside the projected area are studied. To quantify this, the perpendicular distance between the leading muon and the closest approach position to the center of the detector is calculated. Then, the closest approach point to the center is calculated for all muons in the bundle. With these positions, the distances between the leading muon and the other muons are calculated. Finally, the distances are weighted by the energy. For example, 100% means that the largest distance between a muon and the leading muon is considered. 90% means that the distance between the leading muon and the muon that accumulates 90 % of the bundle energy is considered. In the following, this distance is referred to as the bundle radius. The calculation can be performed with the function get_bundle_radius.
Monte Carlo studies¶
In Fig. 166, the bundle radius is shown for different bundle radius quantiles. These range from the energy inside the projected area from 50% to 100%. The same plot is shown for different scalings on the axes. The distributions peak between 5m and 20m, but also radii above 100m are observed.
In Fig. 167, the leadingness is shown as a function of the bundle radius for a bundle radius quantile of 100%. If the bundle radius is very small, the leadingness is high.
In the following Fig. 168, the leadingness is shown as a function of the bundle radius for different bundle energy cuts. If the bundle radius is high, the leadingness is low.
In Fig. 169, the muon bundle energy is shown as a function of the bundle radius for different bundle energy cuts. For a small amount of events, a large bundle radius indicates a low bundle energy.
In Fig. 170, the leading muon energy is shown as a function of the bundle radius for different bundle energy cuts. For a small amount of events, a large bundle radius indicates a low leading muon energy.
In Fig. 171, the leading muon energy spectrum is shown for different cuts on the bundle radius. A bundle radius quantile of 99% is chosen as a cut parameter.
In Fig. 172, the leading muon energy spectrum is shown for different cuts on the bundle radius. A bundle radius quantile of 100% is chosen as a cut parameter.
Selecting events below a certain bundle radius does not increase the sensitivity to distinguish between prompt and conventional, but it removes statistics. Thus, there is no selection performed using the bundle radius.
Impact on the energy reconstruction¶
In Fig. 173, the impact of the bundle radius on the reconstruction of the leading muon energy is shown. A bundle radius quantile of 100% is chosen as a cut parameter.
There is no significant reconstruction improvement due to the application of a bundle radius cut. Instead, high energy events are rejected. Hence, no cut on the bundle radius is performed.
Network evaluation¶
This section is based on datasets 30010-30013
In the following, the evaluation of the networks is shown. Each figure contains two plots. The left plots show the evaluation of all events, the right plot shows an uncertainty cut applied on the estimated uncertainty by the network. The evaluation is performed on our own extended history simulation dataset (datasets 30010 - 30013). Each plot has the network prediction on the y-axis and the true value on the x-axis. In general, networks are trained with 3 or 9 inputs and a time window of 6ms or the internal DNN time window cleaning is applied to the SplitInIceDSTPulses. Furthermore, the CNN layers and nodes are varied. The runtime prediction is presented for the usage of a GPU. The preprocessing runtime represents the time needed to create the input features for the network based on the input pulses.
Bundle energy at surface¶
precut networks:
Bundle energy at entry¶
Leading muon energy at surface¶
Leading muon energy at entry¶
The reconstruction of the leading muon is a difficult task, since the leading muon is accompanied by a bundle of muons. Thus, the emitted cherenkov light of the leading muon is superimposed by the light of the other muons. In Fig. 13, the true muon energy fraction is shown as a function of the true bundle energy, at entry. There is a clear correlation between the true muon energy fraction and the true bundle energy. The distribution is smeared. In Fig. 14, the reconstructed muon energy fraction is shown as a function of the reconstructed bundle energy, at entry. This distribution is less smeared. Hence, the network seems to reconstruct the bundle energy and tries to refer to the leading muon energy.
Track geometry¶
Center time:
Entry time:
Center position x:
Center position y:
Center position z:
Entry position x:
Entry position y:
Entry position z:
Total track length:
Track length in detector:
Direction¶
Zenith angle:
Azimuth angle:
Angular resolution:
Multiplicity¶
The multiplicity means the number of muons entering the detector in a bundle. So far, we do not use this information for the analysis, but we just wanted to check if it is possible to reconstruct the multiplicity.
Networks used for pseudo analysis¶
This section is based on datasets 30010-30013
The following networks are the networks used for the pseudo analysis. These networks are at an early stage as it can be seen in the performance in comparison to the plots presented above. Thus, this networks will not be used for the final analysis.
Angular reconstructions¶
Left side: only L2 muon filter, right side: L2 muon filter and cut on bundle energy: \(E > 10\,\mathrm{TeV}\)