Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FKNMS InfluxDB container crashes every few days. #52

Open
dotis opened this issue Jul 31, 2023 · 3 comments
Open

FKNMS InfluxDB container crashes every few days. #52

dotis opened this issue Jul 31, 2023 · 3 comments
Assignees
Labels

Comments

@dotis
Copy link
Collaborator

dotis commented Jul 31, 2023

I triggered DAG runs for FKNMS and FGBNMS dashes and now FK is not working, although it did work yesterday. FGB seems fine. I wonder if this is the same docker issue?

@7yl4r
Copy link
Member

7yl4r commented Aug 2, 2023

Looks like it failed for 2 days then went back to working. The error message does look related to #53 (InfluxDB outage):

[2023-07-31 13:41:56,351] {subprocess.py:78} INFO - requests.exceptions.ConnectionError: HTTPConnectionPool(host='35.211.75.134', port=8086): Max retries exceeded with url: /query?q=SHOW+DATABASES&db=fwc_coral_disease (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f196406e820>: Failed to establish a new connection: [Errno 111] Connection refused'))

Closing as resolved by #53

@7yl4r 7yl4r closed this as completed Aug 2, 2023
@dotis
Copy link
Collaborator Author

dotis commented Aug 8, 2023

Need to re-open this. DAGs not running and getting bad gateway error w/no time series data for FK. FGB board seems fine.

@dotis dotis reopened this Aug 8, 2023
@7yl4r
Copy link
Member

7yl4r commented Aug 11, 2023

This "bad gateway" error means that the gateway to InfluxDB isn't working, and that is almost always because the influxDB container has died.

tylarmurray@fknms-dashboard-04:~$ docker container ls -a
CONTAINER ID   IMAGE                                      COMMAND                  CREATED        STATUS                      PORTS                                                 NAMES
[...]
1353c06aedc5   influxdb:1.8                               "/entrypoint.sh infl…"   6 weeks ago    Exited (137) 5 days ago                                                           influxdb
[...]

Same as before in #53 there is again nothing in the logs.
I can restart this easily whenever it goes down, but we need to identify why the container is crashing so I can stop it.
There may be logs within the container somewhere that I can dig into.
I will try that next time; for now I am pouring all spare energy into the POSE proposal.

@7yl4r 7yl4r changed the title FKNMS DAG runs failing FKNMS InfluxDB container crashes every few days. Aug 11, 2023
@7yl4r 7yl4r added bug Something isn't working tool-influxdb client-fknms labels Aug 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants