Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: deploy/agent deleted on k8s auto upgrade on k3s #4132

Open
dlarson04 opened this issue Aug 12, 2024 · 1 comment
Open

Bug: deploy/agent deleted on k8s auto upgrade on k3s #4132

dlarson04 opened this issue Aug 12, 2024 · 1 comment
Assignees

Comments

@dlarson04
Copy link
Contributor

Describe the bug.

Intermittendly, the OH agent pod is not running after an auto upgrade ... but the NMP status changes to success. Seeing this in the cronjob log

2024-08-11 12:21:09 cronjob under namesapce: edgecluster-ns03
2024-08-11 12:21:09 DEBUG: get_status_path() start
2024-08-11 12:21:09 VERBOSE: Checking if /var/horizon/nmp directory exists...
2024-08-11 12:21:09 VERBOSE: Checking if /var/horizon/nmp/{org} directory exists...
2024-08-11 12:21:09 VERBOSE: Searching NMP subdirectories...
2024-08-11 12:21:09 VERBOSE: Getting latest upgrade job status file...
2024-08-11 12:21:09 STATUS_PATH is /var/horizon/nmp/myorg/Mesh-NMP/status.json
2024-08-11 12:21:09 VERBOSE: Found job: /var/horizon/nmp/myorg/Mesh-NMP/status.json
2024-08-11 12:21:09 DEBUG: get_status_path() end
2024-08-11 12:21:10 DEBUG: Pod status: Running
Pending
2024-08-11 12:21:14 DEBUG: Deployment status: deployment
deployment
Running
cat: /var/horizon/nmp/myorg/Mesh-NMP/status.json: No such file or directory
2024-08-11 12:21:15 DEBUG: Cron Job status:
2024-08-11 12:21:15 Checking if agent is running and deployment is successful...
2024-08-11 12:21:15 Agent is not running. Checking if rollback was already attempted...
2024-08-11 12:21:15 DEBUG: Checking if agent upgrade was initiated...
2024-08-11 12:21:15 Starting rollback process...
2024-08-11 12:21:15 VERBOSE: Setting the status to "rollback started"...
jq: error: Could not open file /var/horizon/nmp/myorg/Mesh-NMP/status.json: No such file or directory
/usr/local/bin/auto-upgrade-cronjob.sh: line 531: /var/horizon/nmp/myorg/Mesh-NMP/status.json: No such file or directory

Describe the steps to reproduce the behavior.

No response

Expected behavior.

install edge cluster agent on k3s
Trigger an auto upgrade
Intermittent failure

Screenshots.

No response

Operating Environment

Linux

Additional Information

No response

@dlarson04
Copy link
Contributor Author

k8s-upgrade-fail.tar.gz

LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 13, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 13, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 14, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 15, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 15, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 15, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 16, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 16, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 16, 2024
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 16, 2024
LiilyZhang added a commit that referenced this issue Aug 16, 2024
Issue #4132 - Bug: deploy/agent deleted on k8s auto upgra…
LiilyZhang added a commit to LiilyZhang/anax that referenced this issue Aug 16, 2024
LiilyZhang added a commit that referenced this issue Aug 17, 2024
Issue #4132 - Bug: deploy/agent deleted on k8s auto upgra…
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants