from __future__ import division, print_function
import numpy, pylab, math

%matplotlib inline

Now that you have learned how to use dataio-shovel and steamshovel, you are ready to learn to manipulate information within an i3 file. You will use the python skills you learned to write simple scripts to access information, select events, add objects to the frame and to make simple plots.

The first step is to get the IceCube software

import icecube

We now have access to all the projects that we built before. The first project we will learn about is dataio.

from icecube import dataio

The dataio project allows us to access i3 files. We will learn more about the dataio.I3File method using the ipython help() function

help(dataio.I3File)

Help on class I3File in module icecube.dataio:

class I3File(Boost.Python.instance)
 |  Simple reader/writer for .i3 files, with optional on-the-fly gzipping
 |  
 |  Method resolution order:
 |      I3File
 |      Boost.Python.instance
 |      __builtin__.object
 |  
 |  Methods defined here:
 |  
 |  __init__(...)
 |      __init__( (object)arg1) -> None :
 |          Create an I3File object that is not attached to any on-disk file
 |      
 |          C++ signature :
 |              void __init__(_object*)
 |      
 |      __init__( (object)arg1, (I3File)arg2) -> None :
 |          Copy constructor
 |      
 |          C++ signature :
 |              void __init__(_object*,I3::dataio::python::I3SequentialFile)
 |      
 |      __init__( (object)arg1, (str)filename (may be .i3 or .i3.gz)) -> None :
 |          Create and open and I3File object for reading
 |      
 |          C++ signature :
 |              void __init__(_object*,std::string)
 |      
 |      __init__( (object)arg1, (str)filename (may be .i3 or .i3.gz) [, (Mode)Mode='Reading']) -> None :
 |          Create and open an I3File object, specifying the mode
 |      
 |          C++ signature :
 |              void __init__(_object*,std::string [,I3::dataio::python::I3SequentialFile::Mode='Reading'])
 |      
 |      __init__( (object)arg1, (str)filename, (str)mode (r, w, or x)) -> None :
 |          Create and open and I3File object, specifiying the mode
 |      
 |          C++ signature :
 |              void __init__(_object*,std::string,char)
 |  
 |  __iter__(...)
 |      __iter__( (I3File)arg1) -> I3File :
 |      
 |          C++ signature :
 |              I3::dataio::python::I3SequentialFile __iter__(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  __reduce__ = <unnamed Boost.Python function>(...)
 |  
 |  close(...)
 |      close( (I3File)arg1) -> None :
 |          Close the file
 |      
 |          C++ signature :
 |              void close(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  more(...)
 |      more( (I3File)arg1) -> bool :
 |          Return True if there are more frames in the .i3 file
 |      
 |          C++ signature :
 |              bool more(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  next(...)
 |      next( (I3File)arg1) -> I3Frame :
 |          Return the next frame if one is available, else throw StopIteration
 |      
 |          C++ signature :
 |              boost::shared_ptr<I3Frame> next(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  open_file(...)
 |      open_file( (I3File)arg1, (str)filename (may be .i3 or .i3.gz) [, (Mode)Mode='Reading']) -> int :
 |          Open a .i3 file
 |      
 |          C++ signature :
 |              int open_file(I3::dataio::python::I3SequentialFile {lvalue},std::string [,I3::dataio::python::I3SequentialFile::Mode='Reading'])
 |      
 |      open_file( (I3File)arg1, (str)arg2) -> int :
 |      
 |          C++ signature :
 |              int open_file(I3::dataio::python::I3SequentialFile {lvalue},std::string)
 |      
 |      open_file( (I3File)arg1, (str)arg2, (str)arg3) -> int :
 |      
 |          C++ signature :
 |              int open_file(I3::dataio::python::I3SequentialFile {lvalue},std::string,char)
 |  
 |  pop_daq(...)
 |      pop_daq( (I3File)arg1) -> I3Frame :
 |          Return the next DAQ frame from the file, skipping frames on other streams.
 |      
 |          C++ signature :
 |              boost::shared_ptr<I3Frame> pop_daq(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  pop_frame(...)
 |      pop_frame( (I3File)arg1) -> I3Frame :
 |          Return the next frame on any stream from the file.
 |      
 |          C++ signature :
 |              boost::shared_ptr<I3Frame> pop_frame(I3::dataio::python::I3SequentialFile {lvalue})
 |      
 |      pop_frame( (I3File)arg1, (Stream)Stream) -> I3Frame :
 |          Return the next frame on stream 'Stream' from the file.
 |      
 |          C++ signature :
 |              boost::shared_ptr<I3Frame> pop_frame(I3::dataio::python::I3SequentialFile {lvalue},I3Frame::Stream)
 |  
 |  pop_physics(...)
 |      pop_physics( (I3File)arg1) -> I3Frame :
 |          Return the next physics frame from the file, skipping frames on other streams.
 |      
 |          C++ signature :
 |              boost::shared_ptr<I3Frame> pop_physics(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  push(...)
 |      push( (I3File)arg1, (I3Frame)frame) -> None :
 |          Push a frame to the file (if file opened with Mode.Writing)
 |      
 |          C++ signature :
 |              void push(I3::dataio::python::I3SequentialFile {lvalue},boost::shared_ptr<I3Frame>)
 |  
 |  rewind(...)
 |      rewind( (I3File)arg1) -> None :
 |          Rewind to beginning of file and reopen
 |      
 |          C++ signature :
 |              void rewind(I3::dataio::python::I3SequentialFile {lvalue})
 |  
 |  ----------------------------------------------------------------------
 |  Data and other attributes defined here:
 |  
 |  Closed = icecube.dataio.Mode.Closed
 |  
 |  Mode = <class 'icecube.dataio.Mode'>
 |  
 |  
 |  Reading = icecube.dataio.Mode.Reading
 |  
 |  Writing = icecube.dataio.Mode.Writing
 |  
 |  __instance_size__ = 696
 |  
 |  ----------------------------------------------------------------------
 |  Data descriptors inherited from Boost.Python.instance:
 |  
 |  __dict__
 |  
 |  __weakref__
 |  
 |  ----------------------------------------------------------------------
 |  Data and other attributes inherited from Boost.Python.instance:
 |  
 |  __new__ = <built-in method __new__ of Boost.Python.class object>
 |      T.__new__(S, ...) -> a new object with type S, a subtype of T

We see that there are five ways to initialize an I3File. In order to access the files that we looked at earlier, we will use the third initializer. Since we have two files, a GCD file and a data file, we will open them both separately:

geofile = dataio.I3File('/home/mamday/IceCube/data/IC86/SantaIgel/GeoCalibDetectorStatus_IC86.55697_corrected_V2.i3')

infile = dataio.I3File('/home/mamday/IceCube/data/IC86/SantaIgel/JP/SebCutL8-JPVuvNuMu-JPVuvNuMu-SebCut-genie_ic.1450.000499.L7.i3')

If we want to be able to write frames into a new i3 file, we need to use either the fourth or fifth initializer. We can create a file using the fourth initializer called 'testfile.i3':

testfile = dataio.I3File('testfile.i3',dataio.I3File.Writing)

But this can be acheived with a little less typing using the fifth initializer:

outfile = dataio.I3File('newfile.i3','w')

So now we have our two input i3 files and our output i3 file. In order to access the information in the files, we have to access the frames inside. There are several methods for accessing the frame objects. First we can just pop the first frame in the file using pop_frame()

frame = infile.pop_frame()

Great. Now we have accessed our first frame! If we want to know what is contained in our frame, we can use keys():

frame.keys()

['FilterMask',
 'I3EventHeader',
 'DrivingTime',
 'I3MCWeightDict',
 'I3TriggerHierarchy',
 'MCPESeriesMap_withNoise',
 'OfflinePulses',
 'I3MCTree',
 'InIceRawData']

This is the first frame in the file. Since it is a data file, the first frame is a DAQ frame.

What happens if we use pop_frame again? Let's try:

frame = infile.pop_frame()

frame.keys()

['iSPEFitSingle_tt0FitParams',
 'MPEFit_ContainedFitParams',
 'MPEFit_ContainedFit_rusage',
 'iSPEFit32_tt0',
 'LineFit_DCParams',
 'MPEFit',
 'OfflinePulses_sRT',
 'HuberFit_tt0',
 'MPEFit_Finite',
 'FilterMask',
 'MPEFit_FiniteCuts',
 'DipoleFit_DC',
 'MPEFitFitParams',
 'TT0_delayCleaned',
 'iSPEFitSingle_tt0',
 'NoiseEngine_bool',
 'MPEFit_Contained',
 'MPEFit_ContainedFit_StartStopParams',
 'ToI_DCParams',
 'MaskedOfflinePulses',
 'DrivingTime',
 'I3EventHeader',
 'bayes',
 'ToI_DC',
 'bayesFitParams',
 'MPEFitCuts',
 'iLineFit_tt0Params',
 'ImpLineFit_tt0_rusage',
 'SPEFit2_DC',
 'OfflinePulses_TW',
 'I3MCWeightDict',
 'DipoleFit_DCParams',
 'iLineFit_tt0',
 'LineFit_DC',
 'OfflinePulses_cRT',
 'TT0_debiased',
 'I3TriggerHierarchy',
 'EarlyLaunches',
 'MCPESeriesMap_withNoise',
 'VertexReco_rusage',
 'OfflinePulses',
 'MPEFit_FiniteStopFitParams',
 'MPEFit_FiniteStop',
 'CascadeLast_DC',
 'I3MCTree',
 'iSPEFit32_tt0FitParams',
 'TT0',
 'InIceRawData',
 'TT0_debiased_1st']

Now we have a new frame. Since the i3 files use a stack data structure, we can no longer access the previous frame of the file. However, to return to the first frame, we can use the rewind() method:

infile.rewind()

Now, if we pop the frame again, we will get the first frame in the file:

frame = infile.pop_frame()

frame.keys()

['FilterMask',
 'I3EventHeader',
 'DrivingTime',
 'I3MCWeightDict',
 'I3TriggerHierarchy',
 'MCPESeriesMap_withNoise',
 'OfflinePulses',
 'I3MCTree',
 'InIceRawData']

There are a few other ways of getting frames, depending on what types of frame you are interested in. If you only want Physics frames, you can use pop_physics():

frame = infile.pop_physics()

Or if you only want DAQ frames you can use pop_daq():

frame = infile.pop_daq()

For now we want to look at the first frame in the file, so we will rewind again:

infile.rewind()

Generally, in an analysis, you will want to select some frames in a file and reject others. Predominately you will do this for a large number of events and you will want to use the Icetray module framework. However, if you have a small number of events, you can keep or reject frames using the dataio methods. Previously we learned how to create an outputfile. Now we will add a frame to the output file using the push method:

outfile.push(frame)

Our output file now contains a single frame. A simple use for this method would be to write out the first 100 frames of a file into a new i3 file:

for i in xrange(100):
    frame = infile.pop_frame()
    outfile.push(frame)
outfile.close()

Now if we look at our file in dataio-shovel, we see that it is the same as the previous file, but now it contains only the first 100 frames. Before we continue, we will rewind the input file and get the first frame again

infile.rewind()
frame = infile.pop_frame()
frame.keys()

['FilterMask',
 'I3EventHeader',
 'DrivingTime',
 'I3MCWeightDict',
 'I3TriggerHierarchy',
 'MCPESeriesMap_withNoise',
 'OfflinePulses',
 'I3MCTree',
 'InIceRawData']

Now, if we want to actually be able to look at anything in the frame, we need to get an important new project, dataclasses:

from icecube import dataclasses

The dataclasses project contains the majority of the objects that you will see in most i3 files. Most analyses also depend on other projects in the IceCube software package as well, which will be covered further in the section on icetray and i3 modules later in the week. For now we are going to use our new project to get some information about our frame. The first frame object we will look at is the 'I3EventHeader'. Using the Get method:

evt_head = frame.Get('I3EventHeader')

We can also get access to this object by using frame["ObjectName"]

evt_head = frame["I3EventHeader"]

Now we have access to the I3EventHeader object. Generally the most useful information in the I3EventHeader are the Run ID and the Event ID

evt_id = evt_head.event_id
run_id = evt_head.run_id
print(run_id, evt_id)

1 4255

Now we want to select events based on their Event ID. First we will create a new output file, eventfile.i3

outfile1 = dataio.I3File('eventfile.i3','w')

We would like to loop over all the frames, but we don't necessarily know how many frames are in the file. In order to do that, we can use the I3File.more method, which is True if there are more frames in the file and False otherwise:

while(infile.more()):
    frame = infile.pop_frame()
    evt_head = frame["I3EventHeader"]
    evt_id = evt_head.event_id
    if(evt_id<100):
        print(evt_id)

Exercise: Fill eventfile.i3 with events whose event_id ends with the number 5

Example Solution:

infile.rewind()
while(infile.more()):
    frame = infile.pop_frame()
    evt_head = frame["I3EventHeader"]
    evt_id = evt_head.event_id
    if(not(evt_id%5) and evt_id%10):
        outfile1.push(frame)
        
outfile1.close()

infile.rewind()

There are relatively few cases where you will want to cut events based on the Event ID. A more common object used to cut events is the FilterMask. The second level of processing for analysis files after triggering is to determine if the event passes certain predetermined cuts, or filters, for a specific type of analysis. The pass or fail result for the event is then stored in a dict in the FilterMask.

filt_mask = frame["FilterMask"]

You can determine what filters and included in the FilterMask simply by printing it:

print(filt_mask)

{ 
	CascadeFilter_11 : 0 , 0
	DeepCoreFilter_11 : 1 , 1
	EHEFilter_11 : 0 , 0
	FilterMinBias_11 : 1 , 0
	GCLEStarting_11 : 0 , 0
	ICOnlineL2Filter_11 : 0 , 0
	IceTopMuonCalibration_11 : 0 , 0
	IceTopSTA3_11 : 0 , 0
	IceTopSTA3_InIceSMT_11 : 0 , 0
	IceTopSTA8_11 : 0 , 0
	IceTopSTA8_InIceSMT_11 : 0 , 0
	IceTop_InFill_STA3_11 : 0 , 0
	InIceSMT_IceTopCoincidence_11 : 0 , 0
	MoonFilter_11 : 0 , 0
	MuonFilter_11 : 0 , 0
	PhysicsMinBiasTrigger_11 : 0 , 0
	SDST_GCHE_11 : 0 , 0
	SDST_GCMinBias_11 : 1 , 0
	SDST_GCNWStarting_11 : 1 , 1
	SDST_LowUp_11 : 1 , 1
	SDST_MoonFilter_11 : 0 , 0
	SDST_MuonFilter_11 : 0 , 0
	SDST_SunFilter_11 : 0 , 0
	SDST_VEF_11 : 0 , 0
	SlopFilterTime_11 : 0 , 0
	SlopFilterTrig_11 : 0 , 0
 }

Each filter name is a key in the filter mask. You can then access the pass or fail result of the event as a boolean using the method condition_passed:

filt_mask["CascadeFilter_11"].condition_passed

False

In this case the result of the CascadeFilter_11 is False.

Exercise: Write a function that outputs frames to a file if they pass EHEFilter_11

Most of the information you want to use in your analysis is contained in the reconstructed pulses. Reconstructed pulses contain information about what was measured by each DOM in an event.

off_pulses = frame['OfflinePulses']

The pulses we extracted from the frame are indexed by the DOM where the interaction took place. We can look at all the DOMs in the event using a simple for loop:

for i,j in off_pulses:
    print(i,j)

OMKey(5,34,0) [[ I3RecoPulse Time : 11206.2
            Charge : 0.133119
             Width : 8.33333
             Flags : FADC 
]]
OMKey(10,26,0) [[ I3RecoPulse Time : 5448.1
            Charge : 1.03105
             Width : 8.33333
             Flags : FADC 
]]
OMKey(10,48,0) [[ I3RecoPulse Time : 5569.71
            Charge : 2.81031
             Width : 8.33333
             Flags : FADC 
]]
OMKey(12,37,0) [[ I3RecoPulse Time : 9352.07
            Charge : 2.09284
             Width : 8.33333
             Flags : FADC 
], [ I3RecoPulse Time : 12420.8
            Charge : 1.22696
             Width : 8.33333
             Flags : FADC 
]]
OMKey(16,20,0) [[ I3RecoPulse Time : 6504.84
            Charge : 1.42294
             Width : 8.33333
             Flags : FADC 
]]
OMKey(18,38,0) [[ I3RecoPulse Time : 12507.3
            Charge : 0.638218
             Width : 8.33333
             Flags : FADC 
]]
OMKey(25,55,0) [[ I3RecoPulse Time : 11757.4
            Charge : 1.42419
             Width : 8.33333
             Flags : FADC 
]]
OMKey(27,12,0) [[ I3RecoPulse Time : 3977.29
            Charge : 0.759791
             Width : 8.33333
             Flags : FADC 
]]
OMKey(28,54,0) [[ I3RecoPulse Time : 4598.82
            Charge : 1.07698
             Width : 8.33333
             Flags : FADC 
]]
OMKey(29,46,0) [[ I3RecoPulse Time : 8003.45
            Charge : 1.62029
             Width : 8.33333
             Flags : FADC 
]]
OMKey(36,57,0) [[ I3RecoPulse Time : 11722.2
            Charge : 0.624308
             Width : 8.33333
             Flags : FADC 
]]
OMKey(38,60,0) [[ I3RecoPulse Time : 15174.4
            Charge : 2.28419
             Width : 8.33333
             Flags : FADC 
]]
OMKey(40,7,0) [[ I3RecoPulse Time : 8075.85
            Charge : 0.809269
             Width : 8.33333
             Flags : FADC 
]]
OMKey(43,5,0) [[ I3RecoPulse Time : 4108.15
            Charge : 1.57164
             Width : 8.33333
             Flags : FADC 
]]
OMKey(46,27,0) [[ I3RecoPulse Time : 4171.12
            Charge : 0.771984
             Width : 8.33333
             Flags : FADC 
]]
OMKey(46,39,0) [[ I3RecoPulse Time : 11372
            Charge : 1.19782
             Width : 8.33333
             Flags : FADC 
]]
OMKey(47,35,0) [[ I3RecoPulse Time : 6086.95
            Charge : 1.36158
             Width : 8.33333
             Flags : FADC 
]]
OMKey(52,4,0) [[ I3RecoPulse Time : 6476.62
            Charge : 1.19959
             Width : 8.33333
             Flags : FADC 
]]
OMKey(53,42,0) [[ I3RecoPulse Time : 7174.29
            Charge : 0.764415
             Width : 8.33333
             Flags : FADC 
]]
OMKey(57,7,0) [[ I3RecoPulse Time : 7658.25
            Charge : 1.80153
             Width : 8.33333
             Flags : FADC 
]]
OMKey(57,49,0) [[ I3RecoPulse Time : 15819.1
            Charge : 1.26751
             Width : 8.33333
             Flags : FADC 
]]
OMKey(58,10,0) [[ I3RecoPulse Time : 10722.7
            Charge : 0.264381
             Width : 8.33333
             Flags : FADC 
]]
OMKey(60,22,0) [[ I3RecoPulse Time : 9534.72
            Charge : 0.812832
             Width : 8.33333
             Flags : FADC 
]]
OMKey(67,7,0) [[ I3RecoPulse Time : 9861.3
            Charge : 1.35874
             Width : 8.33333
             Flags : FADC 
], [ I3RecoPulse Time : 12987.1
            Charge : 0.96305
             Width : 8.33333
             Flags : FADC 
]]
OMKey(68,28,0) [[ I3RecoPulse Time : 5625.43
            Charge : 0.993418
             Width : 8.33333
             Flags : FADC 
]]
OMKey(70,28,0) [[ I3RecoPulse Time : 3960.37
            Charge : 1.18348
             Width : 8.33333
             Flags : FADC 
]]
OMKey(71,54,0) [[ I3RecoPulse Time : 3969.88
            Charge : 0.707554
             Width : 8.33333
             Flags : FADC 
]]
OMKey(74,4,0) [[ I3RecoPulse Time : 14532.2
            Charge : 2.19653
             Width : 8.33333
             Flags : FADC 
]]
OMKey(75,13,0) [[ I3RecoPulse Time : 4832.48
            Charge : 1.31349
             Width : 8.33333
             Flags : FADC 
], [ I3RecoPulse Time : 10907.4
            Charge : 0.675741
             Width : 8.33333
             Flags : FADC 
]]
OMKey(79,41,0) [[ I3RecoPulse Time : 10696.1
            Charge : 1.35165
             Width : 8.33333
             Flags : FADC 
]]
OMKey(82,34,0) [[ I3RecoPulse Time : 15351.2
            Charge : 1.65049
             Width : 8.33333
             Flags : FADC 
], [ I3RecoPulse Time : 15384.6
            Charge : 0.492013
             Width : 8.33333
             Flags : FADC 
]]
OMKey(83,52,0) [[ I3RecoPulse Time : 10603.2
            Charge : 1.34736
             Width : 8.33333
             Flags : FADC 
]]
OMKey(84,51,0) [[ I3RecoPulse Time : 10076.9
            Charge : 0.266619
             Width : 1.41283
             Flags : LC ATWD FADC 
], [ I3RecoPulse Time : 10086
            Charge : 1.50078
             Width : 1.6536
             Flags : LC ATWD FADC 
], [ I3RecoPulse Time : 10096.8
            Charge : 0.236888
             Width : 1.6536
             Flags : LC ATWD FADC 
]]
OMKey(84,52,0) [[ I3RecoPulse Time : 10074.9
            Charge : 1.40206
             Width : 0.910752
             Flags : LC ATWD FADC 
]]
OMKey(84,53,0) [[ I3RecoPulse Time : 9956.73
            Charge : 0.689426
             Width : 1.65344
             Flags : LC ATWD FADC 
]]
OMKey(84,55,0) [[ I3RecoPulse Time : 9905.45
            Charge : 0.248409
             Width : 1.65357
             Flags : LC ATWD FADC 
], [ I3RecoPulse Time : 9917.85
            Charge : 0.93582
             Width : 1.65357
             Flags : LC ATWD FADC 
], [ I3RecoPulse Time : 9926.94
            Charge : 0.221708
             Width : 1.13546
             Flags : LC ATWD FADC 
]]
OMKey(84,56,0) [[ I3RecoPulse Time : 9897.99
            Charge : 0.33384
             Width : 1.65392
             Flags : LC ATWD FADC 
]]
OMKey(84,57,0) [[ I3RecoPulse Time : 9882.78
            Charge : 2.05384
             Width : 1.22315
             Flags : LC ATWD FADC 
], [ I3RecoPulse Time : 9891.05
            Charge : 0.18448
             Width : 1.65409
             Flags : LC ATWD FADC 
]]
OMKey(84,58,0) [[ I3RecoPulse Time : 9890.02
            Charge : 0.774523
             Width : 1.57362
             Flags : LC ATWD FADC 
]]
OMKey(85,47,0) [[ I3RecoPulse Time : 10674.3
            Charge : 0.887308
             Width : 8.33333
             Flags : FADC 
]]
OMKey(85,50,0) [[ I3RecoPulse Time : 10237.1
            Charge : 1.28551
             Width : 8.33333
             Flags : FADC 
]]
OMKey(85,58,0) [[ I3RecoPulse Time : 10111.1
            Charge : 1.11807
             Width : 8.33333
             Flags : FADC 
]]
OMKey(86,3,0) [[ I3RecoPulse Time : 9901.62
            Charge : 0.414108
             Width : 8.33333
             Flags : FADC 
]]
OMKey(86,13,0) [[ I3RecoPulse Time : 4708.29
            Charge : 0.788198
             Width : 8.33333
             Flags : FADC 
]]

From this you can see that each entry has an OMKey, which identifies the DOM, and a pulse series. From the OMKey you can get the OM number and the string number for the DOM:

omstringTup = [(i.om,i.string) for i in off_pulses.keys()]
print(omstringTup[:10])

[(34, 5), (26, 10), (48, 10), (37, 12), (20, 16), (38, 18), (55, 25), (12, 27), (54, 28), (46, 29)]

off_pulses.keys()

[OMKey(5,34,0),
 OMKey(10,26,0),
 OMKey(10,48,0),
 OMKey(12,37,0),
 OMKey(16,20,0),
 OMKey(18,38,0),
 OMKey(25,55,0),
 OMKey(27,12,0),
 OMKey(28,54,0),
 OMKey(29,46,0),
 OMKey(36,57,0),
 OMKey(38,60,0),
 OMKey(40,7,0),
 OMKey(43,5,0),
 OMKey(46,27,0),
 OMKey(46,39,0),
 OMKey(47,35,0),
 OMKey(52,4,0),
 OMKey(53,42,0),
 OMKey(57,7,0),
 OMKey(57,49,0),
 OMKey(58,10,0),
 OMKey(60,22,0),
 OMKey(67,7,0),
 OMKey(68,28,0),
 OMKey(70,28,0),
 OMKey(71,54,0),
 OMKey(74,4,0),
 OMKey(75,13,0),
 OMKey(79,41,0),
 OMKey(82,34,0),
 OMKey(83,52,0),
 OMKey(84,51,0),
 OMKey(84,52,0),
 OMKey(84,53,0),
 OMKey(84,55,0),
 OMKey(84,56,0),
 OMKey(84,57,0),
 OMKey(84,58,0),
 OMKey(85,47,0),
 OMKey(85,50,0),
 OMKey(85,58,0),
 OMKey(86,3,0),
 OMKey(86,13,0)]

The pulses for each DOM contain information about the time, charge and the electronic read out of the pulse (we'll ignore width for now). We can use this information to learn about the physics of the event. One criterion that is often used is the number of DOMs in a pulse series, referred to as the "number of channels" in the event. We can easily get this information, which is simply the length of our pulse variable:

print(len(off_pulses))

44

You might also want to know the total number of pulses in the event. Each DOM may contain multiple pulses. In order to count the total number of pulses on each DOM, you can just get the length of the list containing all of the elements in reconstructed pulse series:

all_pulses = [p for i,j in off_pulses for p in j]
print(len(all_pulses))

53

For technical reasons a single interaction can sometimes be split into multiple pulses. Therefore it is often better to use the total charge in an event instead of the total number of pulses. Since we created a list of all the pulses in the event already, we can calculate the total charge as follows:

tot_charge = sum([p.charge for p in all_pulses])
print(tot_charge)

56.5428167433

We can now use what we learned about plotting to plot the number of channels for each event in the file:

n_chan = []
while(infile.more()):
   frame = infile.pop_frame()
   off_pulses = frame['OfflinePulses']
   n_chan.append(len(off_pulses))

bin_it = numpy.linspace(0,max(n_chan),max(n_chan)+1)
counts, bins, patches = pylab.hist(n_chan,bin_it,color='r',histtype='step')
pylab.ylim(0,max(counts)+1)
pylab.xlabel('NChan')

<matplotlib.text.Text at 0x3d81dd0>

Exercise: Plot number of pulses and total charge for all events in the file

Some pulses do not contain both fADC and ATWD information. If you want to treat pulses differently depending on whether or not they have ATWD information, you can do so by using flags as follows:

tot_atwd_charge = sum([p.charge for p in all_pulses if (p.flags & dataclasses.I3RecoPulse.PulseFlags.ATWD)])
print(tot_atwd_charge)

8.84839096665

We can then look at the distribution of charge in the event like so:

charges = numpy.array([p.charge for p in all_pulses])
bin_it = numpy.linspace(0,math.ceil(max(charges)),math.ceil(max(charges))+1)
counts, bins, patches = pylab.hist(charges,bin_it,color='r',histtype='step')
pylab.ylim(0,max(counts)+1)
pylab.xlabel('Pulse Charge')

<matplotlib.text.Text at 0x3d13350>

And compare this to the plot of the distribution of charge for pulses that have ATWD information:

charges = numpy.array([p.charge for p in all_pulses])
atwd_charges = numpy.array([p.charge for p in all_pulses if (p.flags & dataclasses.I3RecoPulse.PulseFlags.ATWD)])
bin_it = numpy.linspace(0,math.ceil(max(charges)),math.ceil(max(charges))+1)
counts, bins, patches = pylab.hist(charges,bin_it,color='r',histtype='step',label='All')
counts1, bins1, patches1 = pylab.hist(atwd_charges,bin_it,color='b',histtype='step',label="Has ATWD")
pylab.ylim(0,max(counts+counts1)+1)
pylab.xlabel('Pulse Charge')
pylab.legend(loc=1)

<matplotlib.legend.Legend at 0x3d45e50>

For various reasons we may not want to use all of the pulses in an event. We may then want to be able to pass a pulse series that meets our criterion to various algorithms. In this case, we can create a new pulse series that contains only the pulses we want by using an I3RecoPulseSeriesMapMask. Let's make a pulse series containing only pulses with ATWD information. First we have to create a function that returns True or False depending on if the pulse meets our requirements:

def myPulses(omkey, index, pulse):
    if(pulse.flags & dataclasses.I3RecoPulse.PulseFlags.ATWD):
        return True
    else:
        return False

Our function takes three arguments: omkey, index and pulse. The omkey is the key for the DOM that contains the pulse, the index is the position of the pulse in the I3RecoPulse map for that DOM, and the pulse is the I3RecoPulse, which contains information about the time, charge and flags of the pulse, as we learned before. Now, to create our new pulse series, we define the mask:

my_mask = dataclasses.I3RecoPulseSeriesMapMask(frame,'OfflinePulses',myPulses)

Now that we created our I3RecoPulseSeriesMapMask, we can access information about the pulses in the same way as before:

mask_pulses = my_mask.apply(frame)
tot_mask_charge = sum([p.charge for i,j in mask_pulses for p in j])
print(tot_mask_charge)

8.84839096665

Calculating the charge of our new pulse series, we see that we get the same result as when we required ATWD information in our full pulse series.

Exercise: Create a mask that contains only DOMs where the total charge is > 10

Exercise: Create a mask that contains only DOMs on the inner DeepCore strings (inner DC strings are strings [26,27,35,36,37,45,46,79,80,81,82,83,84,85,86])

Now that you have created a new pulse object, you will probably want to add this object to the frame. This is acheived with the frame 'Put' method, which requires the name of your object and the object variable as inputs:

frame.Put('ATWDPulseMask',my_mask)

or

frame['OtherATWDPulseMask']=my_mask

We can now add our modified frame to the output file:

testfile.push(frame)
testfile.close()

Other useful information that you will often want is the position of the DOM that contains the pulses you are interested in. I3RecoPulses do not explicitly contain information about the position of the DOM. However, the OMKey of the DOM can be used to determine the position from the I3Geometry. We will now use the second file we loaded before:

g_frame = geofile.pop_frame()
g_frame.keys()

['I3Geometry']

We saw before that the GCD file contains three main objects: I3Geometry, I3Calibration and I3DetectorStatus. To get the position information for the DOMs, we want to access the I3Geometry object:

geometry = g_frame["I3Geometry"]

The I3Geometry contains information about the DOM positions in something called an OMGeo. We can see the mapping of OMKeys to DOM information by looping over the OMGeos, similar to how we looped over the I3RecoPulseSeries before:

#for i, j in geometry.omgeo:
#    print i,j

help(icecube.dataclasses.I3OMGeo)

Help on class I3OMGeo in module icecube.dataclasses:

class I3OMGeo(Boost.Python.instance)
 |  Method resolution order:
 |      I3OMGeo
 |      Boost.Python.instance
 |      __builtin__.object
 |  
 |  Methods defined here:
 |  
 |  __copy__(...)
 |      __copy__( (object)arg1) -> object :
 |          Make a shallow copy using the copy constructor
 |      
 |          C++ signature :
 |              boost::python::api::object __copy__(boost::python::api::object)
 |  
 |  __deepcopy__(...)
 |      __deepcopy__( (object)arg1, (dict)arg2) -> object :
 |          Make a deep copy using the copy constructor
 |      
 |          C++ signature :
 |              boost::python::api::object __deepcopy__(boost::python::api::object,boost::python::dict)
 |  
 |  __getstate__(...)
 |      __getstate__( (object)arg1) -> tuple :
 |      
 |          C++ signature :
 |              boost::python::tuple __getstate__(boost::python::api::object)
 |  
 |  __init__(...)
 |      __init__( (object)arg1) -> None :
 |      
 |          C++ signature :
 |              void __init__(_object*)
 |      
 |      __init__( (object)arg1, (I3OMGeo)arg2) -> None :
 |      
 |          C++ signature :
 |              void __init__(_object*,I3OMGeo)
 |  
 |  __reduce__ = <unnamed Boost.Python function>(...)
 |  
 |  __setattr__(...)
 |      __setattr__( (object)arg1, (object)arg2, (object)arg3) -> None :
 |      
 |          C++ signature :
 |              void __setattr__(boost::python::api::object,boost::python::api::object,boost::python::api::object)
 |  
 |  __setstate__(...)
 |      __setstate__( (object)arg1, (tuple)arg2) -> None :
 |      
 |          C++ signature :
 |              void __setstate__(boost::python::api::object,boost::python::tuple)
 |  
 |  ----------------------------------------------------------------------
 |  Data descriptors defined here:
 |  
 |  area
 |  
 |  direction
 |  
 |  omtype
 |  
 |  orientation
 |  
 |  position
 |  
 |  ----------------------------------------------------------------------
 |  Data and other attributes defined here:
 |  
 |  AMANDA = icecube.dataclasses.OMType.AMANDA
 |  
 |  IceCube = icecube.dataclasses.OMType.IceCube
 |  
 |  IceTop = icecube.dataclasses.OMType.IceTop
 |  
 |  OMType = <class 'icecube.dataclasses.OMType'>
 |  
 |  
 |  UnknownType = icecube.dataclasses.OMType.UnknownType
 |  
 |  __getstate_manages_dict__ = True
 |  
 |  __instance_size__ = 32
 |  
 |  __safe_for_unpickling__ = True
 |  
 |  ----------------------------------------------------------------------
 |  Data descriptors inherited from Boost.Python.instance:
 |  
 |  __dict__
 |  
 |  __weakref__
 |  
 |  ----------------------------------------------------------------------
 |  Data and other attributes inherited from Boost.Python.instance:
 |  
 |  __new__ = <built-in method __new__ of Boost.Python.class object>
 |      T.__new__(S, ...) -> a new object with type S, a subtype of T

From this we see that the OMGeo has many properties. The property we want is 'position':

dom_pos = [j.position for i,j in geometry.omgeo]

The entries of domPos are all objects of type 'I3Position', which contains the x, y and z component of the position. You can access the individual components as follows:

sep_pos = [(p.x,p.y,p.z) for p in dom_pos]
print(sep_pos[:10])

[(-256.1400146484375, -521.0800170898438, 496.0299987792969), (-256.1400146484375, -521.0800170898438, 479.010009765625), (-256.1400146484375, -521.0800170898438, 461.989990234375), (-256.1400146484375, -521.0800170898438, 444.9700012207031), (-256.1400146484375, -521.0800170898438, 427.95001220703125), (-256.1400146484375, -521.0800170898438, 410.92999267578125), (-256.1400146484375, -521.0800170898438, 393.9100036621094), (-256.1400146484375, -521.0800170898438, 376.8800048828125), (-256.1400146484375, -521.0800170898438, 359.8599853515625), (-256.1400146484375, -521.0800170898438, 342.8399963378906)]

Exercise: In the IceCube detector there is a region where the ice is much less clear due to the accumulation of particulates called the 'dust layer'. This dust layer is approximately in the region from 0 to -150 m in the IceCube coordinate system. Create two pulse series, one with pulses above the dust layer and one with pulses below the dust layer. Plot the total charge in each pulse series for all events.

Exercise: a) Previously you created a pulse series that contained only DOMs from the inner DeepCore strings. Create a new pulse discriminator that requires that these pulses be below the dust layer.

Something that is frequently used in analyses for various things is the centor of gravity (COG), which is the charge weighted position and time of the pulses in the selection. Plot the COG in x,y,z and t for the DeepCore pulse series you created for all events

You can find the distance between two DOMs very simply. I3Positions act like vectors, and can be added and subtracted. Then, to find the distance, you can very simply find the magnitude of the difference:

first_dist = dom_pos[0]-dom_pos[1]
print(dom_pos[0],dom_pos[1])

I3Position(-256.14,-521.08,496.03) I3Position(-256.14,-521.08,479.01)

first_dist_mag = first_dist.magnitude
print (first_dist_mag)

17.0199890137

Exercise: Create a pulse series that contains all pulses that were not selected by your inner DeepCore pulse selector, call it 'NonDCPulses'.

Muons travel through the ice at approximately the speed of light. A muon travelling through the detector and into DeepCore will have pulses inside the DeepCore volume that are causally related to pulses outside of it.

For each pulse in NonDCPulses calculate the distance and the time between the pulses and the COG of the DeepCore pulses you selected before. From this, calculate the velocity of a particle travelling between those pulses, distance/time. Plot the distribution of velocities. Count the number of pulses in NonDCPulses with a velocity in the range 0.25-0.4 m/ns.

Create an output file that contains events that have fewer than 3 pulses in NonDCPulses.

Exercise: This is a simple method for vetoing through going muons that pass through DeepCore. Think of a way to improve this algorithm using information about the pulses that we learned in this tutorial.

Exercise: Create a new method of vetoing events using information we learned in this tutorial.