Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-enable OSG 23 development images (SOFTWARE-5673) #162

Conversation

brianhlin
Copy link
Member

No description provided.

@matyasselmeci
Copy link
Contributor

LGTM; once pelican-osdf-compat is in the htcondor 23.x daily repos (and therefore osg-23-upcoming-development repos) then you should rerun that failed job and merge.

@matyasselmeci
Copy link
Contributor

Brian, if this PR is still relevant, can you rerun the jobs and merge if they pass?

@brianhlin brianhlin force-pushed the SOFTWARE-5673.reenable-osg23-dev branch from f2c14d7 to 3634503 Compare May 21, 2024 17:45
@brianhlin
Copy link
Member Author

I couldn't find a button to rerun tests for CI this old so I pushed a dummy commit then removed it and force pushed

@brianhlin
Copy link
Member Author

Hrm, I'm not sure I understand the failures:

  1. Testing singularity detection inside the backfill container.
INFO   Determining extra Singularity capabilities...
INFO   Sourcing /pilot/osgvo-pilot-ufwmtr/add_config_line.source
INFO  Error testing for SIF support; output:
INFO  FATAL:   container creation failed: image driver mount failure: image driver squashfuse_ll instance exited with error: squashfuse_ll exited: fuse: device not found, try 'modprobe fuse' first
  1. Similarly in the startd
<149>1 2024-05-21T17:49:05.320Z b8df5ed7ea4c condor_starter 4543 - [cat="D_ALWAYS:2" slot="testing" GLIDEIN_ResourceName="'None'" GLIDEIN_Site="'None'"] Attempting to run: '/usr/bin/apptainer exec -C /usr/libexec/condor/exit_37.sif /exit_37'.
<149>1 2024-05-21T17:49:05.364Z b8df5ed7ea4c condor_starter 4543 - [cat="D_ALWAYS" slot="testing" GLIDEIN_ResourceName="'None'" GLIDEIN_Site="'None'"] '/usr/bin/apptainer exec -C /usr/libexec/condor/exit_37.sif /exit_37' did not exit successfully (code 65280); stderr is :
<149>1 2024-05-21T17:49:05.364Z b8df5ed7ea4c condor_starter 4543 - [cat="D_ALWAYS" slot="testing" GLIDEIN_ResourceName="'None'" GLIDEIN_Site="'None'"] [singularity stderr]: FATAL:   container creation failed: image driver mount failure: image driver squashfuse_ll instance exited with error: squashfuse_ll exited: fuse: device not found, try 'modprobe fuse' first
<29>1 2024-05-21T17:49:05.364Z b8df5ed7ea4c supervisord 1 - [level="INFO" GLIDEIN_ResourceName="'None'" GLIDEIN_Site="'None'"] reaped unknown pid 4607 (terminated by SIGKILL)

@matyasselmeci
Copy link
Contributor

Possibly #171 fixes this. Brian, can you rebase on master and try again?

@brianhlin brianhlin force-pushed the SOFTWARE-5673.reenable-osg23-dev branch from 3634503 to a8e81d7 Compare July 10, 2024 13:19
@matyasselmeci
Copy link
Contributor

I see the problem. I was expecting 23.10.0 to fix the issue and wrote the version comparison accordingly (>=23.8.0, <23.10.0), but the daily is technically called 23.10.0 and doesn't yet fix it.

@brianhlin brianhlin force-pushed the SOFTWARE-5673.reenable-osg23-dev branch from a8e81d7 to 3d46662 Compare July 10, 2024 21:19
@matyasselmeci matyasselmeci merged commit 44bb85f into opensciencegrid:master Jul 10, 2024
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants