Jobsub ID 301499.0@justin-prod-sched01.dune.hep.ac.uk
Jobsub ID | 301499.0@justin-prod-sched01.dune.hep.ac.uk |
Workflow Testing | Yes |
Workflow ID | 500 |
Stage ID | 1 |
User name | amcnab@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
GPU | No |
RSS bytes | 1073741824 (1024 MiB) |
Wall seconds limit | 3600 (1 hours) |
Submitted time | 2024-11-22 23:20:59 |
Site | CA_SFU |
Entry | DUNE_CA_SFU_lcg-ce3 |
Last heartbeat | 2024-11-23 00:09:44 |
From worker node | Hostname | cdr1314.int.cedar.computecanada.ca |
cpuinfo | Intel(R) Xeon(R) Platinum 8160 CPU @ 2.10GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 1073741824 (1024 MiB) |
Wall seconds limit | 84598 (23 hours) |
GPU | |
Inner Apptainer? | True |
Job state | finished |
Allocator name | justin-allocator-int.dune.hep.ac.uk |
Started | 2024-11-22 23:22:30 |
Input files | |
Jobscript | Exit code | 0 |
Real time | 39m (2354s) |
CPU time | 0m (31s = 1%) |
Outputting started | 2024-11-23 00:01:45 |
Output files | |
Finished | 2024-11-23 00:09:44 |
Saved logs | justin-logs:301499.0-justin-prod-sched01.dune.hep.ac.uk.logs.tgz |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
ing if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs
DEBUG:root:gfal.Default: closing protocol connection
DEBUG:root:[{'hostname': 'webdav.grid.surfsara.nl', 'scheme': 'davs', 'port': 2880, 'prefix': '/pnfs/grid.sara.nl/data/dune/disk/RSE', 'impl': 'rucio.rse.protocols.gfal.Default', 'domains': {'lan': {'read': 2, 'write': 1, 'delete': 1}, 'wan': {'read': 2, 'write': 1, 'delete': 1, 'third_party_copy_read': 1, 'third_party_copy_write': 1}}, 'extended_attributes': None}, {'hostname': 'penguin12.grid.surfsara.nl', 'scheme': 'root', 'port': 21094, 'prefix': '/pnfs/grid.sara.nl/data/dune/disk/RSE', 'impl': 'rucio.rse.protocols.gfal.Default', 'domains': {'lan': {'read': 1, 'write': 1, 'delete': 2}, 'wan': {'read': 1, 'write': 1, 'delete': 2, 'third_party_copy_read': 10, 'third_party_copy_write': 10}}, 'extended_attributes': None}]
INFO:root:Trying upload with root to SURFSARA
DEBUG:root:Processing upload with the domain: wan
DEBUG:root:gfal.Default: connecting to storage
DEBUG:root:The PFN created from the LFN: root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload
DEBUG:root:put: Attempt 1
DEBUG:root:gfal.Default: uploading file from awt-1732317753-rvRmedEQqs to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload
INFO:root:Successful upload of temporary file. root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload
DEBUG:root:skip_upload_stat=False
DEBUG:root:stat: pfn=root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload
DEBUG:root:gfal.Default: getting stats of file root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload
DEBUG:root:Filesize: Expected=26 Found=26
DEBUG:root:Checksum: Expected=63c907a2 Found=63c907a2
DEBUG:root:Renaming file root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs
DEBUG:root:gfal.Default: renaming file from root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs.rucio.upload to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/4b/28/awt-1732317753-rvRmedEQqs
DEBUG:root:gfal.Default: closing protocol connection
DEBUG:root:Upload done.
INFO:root:Successfully uploaded file awt-1732317753-rvRmedEQqs
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/urllib3/connectionpool.py:1061: InsecureRequestWarning: Unverified HTTPS request is being made to host 'dune-rucio.fnal.gov'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#ssl-warnings
warnings.warn(
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /traces/ HTTP/1.1" 404 207
DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x14e00cfac040> acquired
DEBUG:dogpile.lock:Calling creation function for previously expired value
DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']"
DEBUG:dogpile.lock:Released creation lock
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "PUT /replicas HTTP/1.1" 200 0
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202447/dids HTTP/1.1" 503 299
[33;1m2024-11-22 15:57:36,180 WARNING Waiting 0.25s due to reason: server returned 503 [0m
WARNING:baseclient:Waiting 0.25s due to reason: server returned 503
DEBUG:urllib3.connectionpool:Resetting dropped connection: dune-rucio.fnal.gov
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202447/dids HTTP/1.1" 201 7
DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x14e00cf75400> acquired
DEBUG:dogpile.lock:Calling creation function for previously expired value
DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']"
DEBUG:dogpile.lock:Released creation lock
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas/list HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-uploads-202447/files HTTP/1.1" 504 247
[33;1m2024-11-22 15:59:24,655 WARNING Waiting 0.25s due to reason: server returned 504 [0m
WARNING:baseclient:Waiting 0.25s due to reason: server returned 504
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (2): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-uploads-202447/files HTTP/1.1" 504 247
[33;1m2024-11-22 16:00:25,224 WARNING Waiting 0.5s due to reason: server returned 504 [0m
WARNING:baseclient:Waiting 0.5s due to reason: server returned 504
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (3): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-uploads-202447/files HTTP/1.1" 200 None
--- Upload try 1/1
--- Rucio upload 1/1 returns 0
--- Replica check try 1/1
--- Dataset awt-uploads-202447 check try 1/1
--- Upload, replicas, and datasets checks passed
'justin-rucio-upload --rse SURFSARA --protocol davs --scope testpro --dataset awt-uploads-202447 awt-1732317753-rvRmedEQqs --timeout 1200' returns 0
subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1486589336/CN=173231775015
issuer : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1486589336
identity : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1486589336
type : RFC compliant proxy
strength : 2048 bits
path : /home/awt-proxy.pem
timeleft : 167:20:46
key usage : Digital Signature, Key Encipherment, Key Agreement
=== VO dune extension information ===
VO : dune
subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk
issuer : /DC=org/DC=incommon/C=US/ST=Illinois/O=Fermi Research Alliance/CN=voms1.fnal.gov
attribute : /dune/Role=Production/Capability=NULL
attribute : /dune/Role=NULL/Capability=NULL
timeleft : 147:42:19
uri : voms1.fnal.gov:15042
===== Results =====
Download/upload commands:
xrdcp --force --nopbar --verbose $read_pfn downloaded.txt
echo '{"namespace":"testpro","name":"FILENAME","size":0}' >tmp.json
metacat file declare --json -f tmp.json "dune:all"
justin-rucio-upload --rse $rse_name --protocol $write_protocol --scope testpro --dataset awt-uploads-202447 --timeout 1200 FILENAME
Use the wrapper job link on the page for the job on the justIN Dashboard to find the full log file, with errors from these commands
Each line: $JUSTIN_SITE_NAME $rse_name $download_retval $upload_retval $read_pfn $write_protocol
==awt== CA_SFU DUNE_CERN_EOS 0 0 root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_ES_PIC 0 0 root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_FR_CCIN2P3_DISK 0 0 root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_UK_GLASGOW 0 0 root://cephc02.gla.scotgrid.ac.uk:1094//cephfs/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_UK_LANCASTER_CEPH 0 0 root://xgate.hec.lancs.ac.uk:1094//cephfs/grid/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_UK_MANCHESTER_CEPH 0 0 root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_US_BNL_SDCC 0 0 root://dcdndoor.sdcc.bnl.gov:1094//pnfs/sdcc.bnl.gov/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU DUNE_US_FNAL_DISK_STAGE 0 0 root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU NIKHEF 0 0 root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU PRAGUE 0 0 root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU QMUL 51 0 root://xrootd01.escqmul.ac.uk:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU RAL-PP 0 0 root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU RAL_ECHO 0 0 root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== CA_SFU SURFSARA 0 0 root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs