Directories and Milestones

  1. Validation

sDST Archive Creation Notes

The plan is to create sDST archive files for all IC86 and later runs. For all previous runs the level2 will be deemed to be the archive.

Full PFRaw files arrive on tape. On average it requires 100 ± 10 minutes per file on the farm to translate them into sDST. The reduction factor is about 4.3. The nominal reduction is about 8, but only 44% of events are in PFFilt, so the sDST archive includes almost twice as many events.

By request there is no effort made to avoid duplication: all events are processed whether or not they were beamed North earlier.

Some subruns are much sorter than others, and some runs are bad. I do not attempt to exclude bad runs! "Bad" is somewhat flexible, and since this is to serve as an archive I judge it appropriate to keep everything. If someone changes his mind later there's no need to respin tapes.

I need to (8-July-2013) run full processing on a full run, to compare with the processing done on the PFFilt. There should be no difference for most analyses, but this has not been demonstrated.

This is, so far, processed by scripts run by hand rather than the complete machine.

As jbellinger, on onyx,

cd ~jbellinger/ARCHIVE/dumptape
eval `/net/local/software/setup.sh`
python TDCspawn.py `pwd`/debugout.log >& shortrun.log &
# This loads tapes into drives and does some bookkeeping.
vi byhand.com
# change the tape numbers (e.g.  TAPE_00104 to TAPE_00107, Tape_00104 to Tape_00107, etc)
./byhand.com
# Wait about 5-6 hours.  MAKE SURE THE DUMP IS DONE BEFORE PROCEEDING.
# This creates Tape_0xxxx.log files which should contain all the file
# names
# When the dump is done, look at (e.g.) /data/exp/stagetape/TAPE_00107/getunique.com
# This should exist and not be empty.  It's object is to rename files from
# tempstore to goodfiles such that duplicate files are left behind.
cd /data/exp/stagetape/TAPE_00107
more ./getunique.com
# This should have already been executed, so there should be files in
# goodfiles corresponding to the entries in getunique.com
# Set the group ownership so that sdstarchive can write there!
sudo chown -R jbellinger:sdst TAPE_00107

As sdstarchive, on npx4

cd /data/user/sdstarchive/ARCHIVE/CandC
# Determine which tape dumps need to be processed.  Make sure this
# happens AFTER the tape is done dumping!  This is a by-hand check.
# Execute the job creation file, here for TAPE_00107
./drivesubmit.sh TAPE_00107
condor_submit job.TAPE_00107.submit
# Wait for a day or so until the jobs are done.

Checking for problems in jobs is fairly easy:

cd /data/user/sdstarchive/ARCHIVE/TAPE_00107/log
grep 'return value' * | grep -v 'value 0'
# anything that turns up (e.g. in log.88) you can do
more ../output/log.88
# and one of the first things in the output file is the list of files
# processed.  The logging is not extensive., though
# I have a script in /data/user/sdstarchive/ARCHIVE/CandC that
# checks for bad return codes:
./countLogs.sh TAPE_00103

Some files give checksum errors. Curiously enough, most of them seem to process well enough until the end or almost the end.

Tape IDRun nameUnique goodFailed runsMissingAlso at
00101tape_2012_300_RAW_copy_1558Run00120288_Subrun00000000_00000000299
00102tape_2012_301_RAW_copy_1591Run00120288_Subrun00000000_00000001
Run00120304_Subrun00000000_00000101
301,302
Fixed in 305
00103tape_2012_302_RAW_copy_2395Run00120308_Subrun00000000_00000057Fixed in 306
00104tape_2012_304_RAW_copy_25650
00105tape_2012_303_RAW_copy_140
00106tape_2012_305_RAW_copy_1394Run00120307_Subrun00000000_00000153Fixed from 302
00107tape_2012_307_RAW_copy_1280
00108tape_2012_306_RAW_copy_21980
00109tape_2012_308_RAW_copy_25460
00110tape_2012_320_RAW_copy_1572Run00120345_Subrun00000000_00000127Fixed in 324
00111tape_2012_309_RAW_copy_1587Run00120316_Subrun00000000_00000034Fixed in
311
00112tape_2012_321_RAW_copy_24930
00113tape_2012_323_RAW_copy_2593Run00120366_Subrun00000000_00000026 corrupt on tapeFixed from 326
00114tape_2012_322_RAW_copy_13960
00115tape_2012_324_RAW_copy_1556Run00120366_Subrun00000000_00000049Fixed from 325
00116tape_2012_326_RAW_copy_1200Run00120368_Subrun00000000_00000165Fixed from
328
00117tape_2012_325_RAW_copy_2135Run00120368_Subrun00000000_00000058Fixed from
329
00118tape_2012_327_FILTERED_copy_100
00119tape_2012_340_RAW_copy_1505Run00120388_Subrun00000000_00000043Fixed from
342
00120tape_2012_329_RAW_copy_14170
00121tape_2012_341_RAW_copy_25890
00122tape_2012_342_RAW_copy_200
00123tape_2012_328_RAW_copy_220
00124tape_2012_343_RAW_copy_100
00125tape_2012_342_RAW_copy_24070BEGIN FULL EHE
00126tape_2012_328_RAW_copy_23930
00127tape_2012_343_RAW_copy_12020
00128tape_2012_348_RAW_copy_15890
00129tape_2012_347_RAW_copy_25880
00130tape_2012_349_RAW_copy_12040
00131tape_2012_311_RAW_copy_24000
00132tape_2012_310_FILTERED_copy_100
00133tape_2012_312_RAW_copy_15760
00134tape_2012_314_RAW_copy_11950
00135tape_2012_313_RAW_copy_2660
00136tape_2012_315_RAW_copy_23870
00137tape_2012_317_RAW_copy_2?
00138tape_2012_316_RAW_copy_1?
00139tape_2012_318_RAW_copy_1?
00140tape_2012_330_RAW_copy_2?
00141tape_2012_319_RAW_copy_2?
00142tape_2012_331_RAW_copy_1?