Skip to content
This repository has been archived by the owner on Jan 30, 2024. It is now read-only.

Error codes

Paul Nilsson edited this page Mar 5, 2021 · 17 revisions

When detecting a fatal problem, the Pilot assigns an error code and informs the server. Aside from the numerical code itself, it also reports the error meaning and a more detailed error diagnostics. The current range of error codes are listed in the [Pilot 2 wiki](https://twiki.cern.ch/twiki/bin/view/PanDA/Pilot2ErrorCodes).

Error code Acronym Meaning Notes
1008 GENERALERROR General pilot error, consult batch log  
1098 NOLOCALSPACE Not enough local space Error code is set e.g. by job monitoring, also if copytool command fails (if "No space left on device" is found in command output)
1099 STAGEINFAILED Failed to stage-in file  
1100 REPLICANOTFOUND The rucio API function list_replicas() did not return any replicas. Check log for details.  
1103 NOSUCHFILE No such file or directory Error thrown by open_file() function. Also set if copytool fails if copytool fails and "No such file or directory" is found in command output
1104 USERDIRTOOLARGE User work directory too large The error is set if the user work directory exceeds the maximum allowed limit, as defined by schedconfig.maxwdir (default: 14 GB)
1106 STDOUTTOOBIG Payload log or stdout file too big Set if stdout exceeds maximum allowed limit of 2 GB, set in the Pilot's default config file
1110 SETUPFAILURE Failed during payload setup  
1115 NFSSQLITE NFS SQLite locking problems Pilot identifies this error by doing a grep on the strings "prepare 5 database is locked" and "Error SQLiteStatement" in the payload stdout