Skip to content
This repository has been archived by the owner on Jan 30, 2024. It is now read-only.

Rucio Traces

Paul Nilsson edited this page Mar 30, 2021 · 17 revisions

Note: this page is VO specific / users of Rucio

The Pilot sends detailed information about file transfers to Rucio. Here is a list of the different fields contained in the trace report.

Field name Type Meaning
appid   PanDA job id
catStart   Time stamp when Rucio has been queried for replica information
clientState   State at the time the trace is sent; e.g. INIT_REPORT, STAGEIN_FAILED, NO_REPLICA, STAGEIN_NOTALLOWED, STAGEIN_ATTEMPT_FAILED, STAGEOUT_ATTEMPT_FAILED, DONE
dataset   Dataset name (prodDBlock from job definition - or destinationDblock)
guid   (currently not set by the Pilot)
eventType  

Type of trace; currently the following event types are sent by the pilot (ordered by frequency; in the case of user jobs, an _a is added to the type): 1. get_sm_a: stage-in inputs for user analysis jobs 2. get_sm: stage-in inputs for non-analysis jobs 3. put_sm: stage-out outputs for non-analysis jobs 4. get_es: stage-in inputs for eventservice jobs 5. download: EXPLANATION MISSING 6. put_sm_logs: stage-out logs for non-analysis jobs 7. put_sm_a: stage-out outputs for analysis jobs 8. put_sm_logs_a: stage-out logs for analysis jobs 9. upload: EXPLANATION MISSING 10. put_es: stage-out outputs for eventservice jobs 11. put_sm_logs_os: stage-out logs to objectstore (special transfer defined in CRIC per

panda queue)
eventVersion String Pilot version
filename String The local file name (LFN)
filesize   The file size
guid String A unique file identifier. The guid sent with the traces has the '-' signs removed
hostname String Host name as returned by socket.gethostbyaddr(socket.gethostname())[0]
ip String IP number of the local host
localSite String Local site name as defined by the RUCIO_LOCAL_SITE_ID environmental variable. If this is not set, then use the DDM endpoint
protocol String Name of copy tool used by the Pilot; e.g. xrdcp
remoteSite String DDM endpoint
scope String Replica scope
stateReason String Error message or explanation; e.g. BAD_COPYTOOL, OK, 'skip stagein file'
suspicious String Currently not being reported by the pilot (always set to '0')
taskid String PanDA task ID
timeEnd String Time stamp when transfer or replica lookup, etc has been finished/failed by the pilot
timeStart String Start time of the trace report
transferStart String Currently the same time stamp as relativeStart
url String TURL in case of direct access for a given replica
usr String The hash of the user DN (usrdn field); hashlib.md5(job.prodUserID).hexdigest()
usrdn String The distinguished name of the user (DN)
uuid String The hash of the JobDefinitionID (from the job definition)
validateStart String Time stamp when the Pilot performs checksum verification during stage-in and stage-out
version String Currently not used by the Pilot