Jobsub ID 305780.0@justin-prod-sched01.dune.hep.ac.uk
Jobsub ID | 305780.0@justin-prod-sched01.dune.hep.ac.uk |
Workflow Testing | Yes |
Workflow ID | 500 |
Stage ID | 1 |
User name | amcnab@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
GPU | No |
RSS bytes | 1073741824 (1024 MiB) |
Wall seconds limit | 3600 (1 hours) |
Submitted time | 2024-12-02 17:32:25 |
Site | US_FNAL-T1 |
Entry | CMSHTPC_T1_US_FNAL_condce_opp1_whole |
Last heartbeat | 2024-12-02 17:39:50 |
From worker node | Hostname | dunegli-37184-0-cmswn4008.fnal.gov |
cpuinfo | AMD EPYC 7543 32-Core Processor |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 1073741824 (1024 MiB) |
Wall seconds limit | 171000 (47 hours) |
GPU | |
Inner Apptainer? | True |
Job state | finished |
Allocator name | justin-allocator-int.dune.hep.ac.uk |
Started | 2024-12-02 17:33:19 |
Input files | |
Jobscript | Exit code | 0 |
Real time | 6m (382s) |
CPU time | 0m (14s = 3%) |
Outputting started | 2024-12-02 17:39:42 |
Output files | |
Finished | 2024-12-02 17:39:50 |
Saved logs | justin-logs:305780.0-justin-prod-sched01.dune.hep.ac.uk.logs.tgz |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202449 HTTP/1.1" 409 104
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
INFO:root:Dataset testpro:awt-uploads-202449 already exists - no rule will be created
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-1733160802-Dp8UJvMSbz/meta?plugin=DID_COLUMN HTTP/1.1" 404 129
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:root:File DID does not exist
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas HTTP/1.1" 201 7
INFO:root:Successfully added replica in Rucio catalogue at SURFSARA
DEBUG:rucio.rse.protocols.protocol:PFN2LFN function will not be fetched from the policy package
DEBUG:root:gfal.Default: connecting to storage
DEBUG:root:gfal.Default: checking if file exists None
DEBUG:root:Checking if root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz exists
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz
DEBUG:root:gfal.Default: closing protocol connection
DEBUG:root:[{'hostname': 'webdav.grid.surfsara.nl', 'scheme': 'davs', 'port': 2880, 'prefix': '/pnfs/grid.sara.nl/data/dune/disk/RSE', 'impl': 'rucio.rse.protocols.gfal.Default', 'domains': {'lan': {'read': 2, 'write': 1, 'delete': 1}, 'wan': {'read': 2, 'write': 1, 'delete': 1, 'third_party_copy_read': 1, 'third_party_copy_write': 1}}, 'extended_attributes': None}, {'hostname': 'penguin12.grid.surfsara.nl', 'scheme': 'root', 'port': 21094, 'prefix': '/pnfs/grid.sara.nl/data/dune/disk/RSE', 'impl': 'rucio.rse.protocols.gfal.Default', 'domains': {'lan': {'read': 1, 'write': 1, 'delete': 2}, 'wan': {'read': 1, 'write': 1, 'delete': 2, 'third_party_copy_read': 10, 'third_party_copy_write': 10}}, 'extended_attributes': None}]
INFO:root:Trying upload with root to SURFSARA
DEBUG:root:Processing upload with the domain: wan
DEBUG:root:gfal.Default: connecting to storage
DEBUG:root:The PFN created from the LFN: root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload
DEBUG:root:put: Attempt 1
DEBUG:root:gfal.Default: uploading file from awt-1733160802-Dp8UJvMSbz to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload
INFO:root:Successful upload of temporary file. root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload
DEBUG:root:skip_upload_stat=False
DEBUG:root:stat: pfn=root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload
DEBUG:root:gfal.Default: getting stats of file root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload
DEBUG:root:Filesize: Expected=26 Found=26
DEBUG:root:Checksum: Expected=5f39072d Found=5f39072d
DEBUG:root:Renaming file root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz
DEBUG:root:gfal.Default: renaming file from root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz.rucio.upload to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/ba/0f/awt-1733160802-Dp8UJvMSbz
DEBUG:root:gfal.Default: closing protocol connection
DEBUG:root:Upload done.
INFO:root:Successfully uploaded file awt-1733160802-Dp8UJvMSbz
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/urllib3/connectionpool.py:1061: InsecureRequestWarning: Unverified HTTPS request is being made to host 'dune-rucio.fnal.gov'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#ssl-warnings
warnings.warn(
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /traces/ HTTP/1.1" 404 207
DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x1542855ad250> acquired
DEBUG:dogpile.lock:Calling creation function for previously expired value
DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']"
DEBUG:dogpile.lock:Released creation lock
DEBUG:urllib3.connectionpool:Resetting dropped connection: dune-rucio.fnal.gov
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "PUT /replicas HTTP/1.1" 200 0
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202449/dids HTTP/1.1" 201 7
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas/list HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-uploads-202449/files HTTP/1.1" 200 None
--- Upload try 1/1
--- Rucio upload 1/1 returns 0
--- Replica check try 1/1
--- Dataset awt-uploads-202449 check try 1/1
--- Upload, replicas, and datasets checks passed
'justin-rucio-upload --rse SURFSARA --protocol davs --scope testpro --dataset awt-uploads-202449 awt-1733160802-Dp8UJvMSbz --timeout 1200' returns 0
subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1653189242/CN=173316079987
issuer : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1653189242
identity : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1653189242
type : RFC compliant proxy
strength : 2048 bits
path : /home/awt-proxy.pem
timeleft : 167:53:37
key usage : Digital Signature, Key Encipherment, Key Agreement
=== VO dune extension information ===
VO : dune
subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk
issuer : /DC=org/DC=incommon/C=US/ST=Illinois/O=Fermi Research Alliance/CN=voms1.fnal.gov
attribute : /dune/Role=Production/Capability=NULL
attribute : /dune/Role=NULL/Capability=NULL
timeleft : 153:31:21
uri : voms1.fnal.gov:15042
===== Results =====
Download/upload commands:
xrdcp --force --nopbar --verbose $read_pfn downloaded.txt
echo '{"namespace":"testpro","name":"FILENAME","size":0}' >tmp.json
metacat file declare --json -f tmp.json "dune:all"
justin-rucio-upload --rse $rse_name --protocol $write_protocol --scope testpro --dataset awt-uploads-202449 --timeout 1200 FILENAME
Use the wrapper job link on the page for the job on the justIN Dashboard to find the full log file, with errors from these commands
Each line: $JUSTIN_SITE_NAME $rse_name $download_retval $upload_retval $read_pfn $write_protocol
==awt== US_FNAL-T1 DUNE_CERN_EOS 0 0 root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_ES_PIC 0 0 root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_FR_CCIN2P3_DISK 0 0 root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_UK_GLASGOW 0 0 root://cephc02.gla.scotgrid.ac.uk:1094//cephfs/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_UK_LANCASTER_CEPH 0 0 root://xgate.hec.lancs.ac.uk:1094//cephfs/grid/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_UK_MANCHESTER_CEPH 0 0 root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_US_BNL_SDCC 0 0 root://dcdndoor.sdcc.bnl.gov:1094//pnfs/sdcc.bnl.gov/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_US_FNAL_DISK_STAGE 0 0 root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 NIKHEF 0 0 root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 PRAGUE 0 0 root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 QMUL 51 0 root://xrootd01.escqmul.ac.uk:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 RAL-PP 0 0 root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 RAL_ECHO 0 0 root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 SURFSARA 0 0 root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs