-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Runner pod always comes back in Terminating state #37
Comments
I am seeing the same error as well (I think the pod object just did not get recreated?)
|
@lanecm Thank you for the report! |
@summerwind Here is what I did,
Running into same issue |
@summerwind -- Similar to what @chenrui333 did: I deleted the runnerdeployment using:
I've also tried deleting the controller as well. Let me know if I can provide any additional information! |
@summerwind -- Just wanted to checkin: Any updates on this issue? Or any additional information I can provide or help with? |
Sorry for the delayed response. I'll see if it reproduces in my environment. |
I couldn't reproduce the problem in my environment as follows. Manifest
Starting runnersAll runners have been successfully started.
Deleting runnersConfirmed that all runners have been stopped.
@lanecm Can I see your pod status in the terminating state with the following command?
|
Hi @summerwind -- Yes, output:
Thank you for investigating! Please let me know how I can help. |
@lanecm Thank you for the information.
|
@summerwind -- Output for each command:
|
@summerwind -- Unfortunately, it's bit tricky to get the pod logs, but it's stuck in a loop:
Not sure the best way to get the logs? |
Thanks! How about to use
|
Hey @summerwind, I've encountered the same issue in our EKS cluster. The runner pod is stuck in terminated state and there is an event saying
Runner has service account attached with I managed to workaround it hardcoding volumes same way as mutating webhook like this: apiVersion: actions.summerwind.dev/v1alpha1
kind: RunnerDeployment
metadata:
name: ${var.github_organization}-runner
namespace: actions-runner-system
spec:
replicas: 3
template:
spec:
image: summerwind/actions-runner-dind:latest
dockerdWithinRunnerContainer: true
organization: ${var.github_organization}
env:
- name: AWS_REGION
value: ${var.region}
- name: AWS_DEFAULT_REGION
value: ${var.region}
- name: AWS_ROLE_ARN
value: ${aws_iam_role.actions_runner.arn}
- name: AWS_WEB_IDENTITY_TOKEN_FILE
value: /var/run/secrets/eks.amazonaws.com/serviceaccount/token
securityContext:
fsGroup: 65534
volumeMounts:
- mountPath: /var/run/secrets/eks.amazonaws.com/serviceaccount
name: aws-iam-token
readOnly: true
volumes:
- name: aws-iam-token
projected:
defaultMode: 420
sources:
- serviceAccountToken:
audience: sts.amazonaws.com
expirationSeconds: 86400
path: token |
@svrakitin It's been addressed in #200 and the PR #226 and should be non-issue today! Thanks for reporting. |
Closing as resolved. |
I deleted a runner pod to pick IRSA changes, but the pod always comes back in Terminating state:
In the manager logs, I see the following error:
Any ideas on what I'm doing wrong? Thank you!
The text was updated successfully, but these errors were encountered: