-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
goofys needs remounting #671
Comments
The logs give more clues about
The mount process was killed as the system ran out of memory. In perspective of using this production, I agree this should definitely be reported asap. Unless someone suggests a different option, I'll look into adding a monitoring endpoint. |
Yes, monitoring would be great, thanks. In the meantime, how to I unmount/remount to get up and running again? |
In general
is the way to mount the bucket |
I previously had the bucket mounted with:
But now I think I need to unmount before remounting. If I try to run that mounting command I get:
Not sure where to see |
Agreed, and this might be a side-effect of the way the process was terminated. Does the following work?
|
|
A few thoughts: |
Good thoughts @sbesson!
No |
Seen again today on
Fixed as above |
Failed again on idr-testing..
|
Will try adding
|
Since goofys mount has been stable through all the memo file generation and check_pixels.py testing, this is looking good. Still an open question of whether we need any additional monitoring of the goofys mount or if other monitoring would pick this up (e.g. if images can't be viewed, error logs etc)? |
Externally, it should be possible to extend https://github.com/IDR/upptime to monitor specific endpoints testing images available via Internally, the best strategy would be to make use Prometheus e.g. by defining custom alert rules like https://github.com/IDR/deployment/blob/368051656c541d315c3cc77b59762f964abf972a/ansible/idr-ftp-monitoring.yml#L23. |
To compare goofys and geesefs (similar to benchmark test at https://github.com/yandex-cloud/geesefs/blob/master/bench/README.md#goofys-tests), lets compare speed of Let's pick a couple of different idr0011 plates each time... Check a single plane at a time On
Install geesefs
Test mount a bucket...
Now replace goofys mount...
Now repeat testing above... new plate...
Conclusion - performance is comparable between goofys and geesefs. |
👍 on similar performance. A follow up question might be whether or not it handles large numbers, though. |
@joshmoore Good point.... I tried the So, for each of the readonly servers on idr-testing, I installed geesefs and replaced Then tested with idr-testing... I still managed to get the server quite unhappy, but it felt like it lasted several times longer with |
Switched idr-testing back to using goofys...
|
All the BioStudies s3 data imported to
idr0125-pilot
is currently givingResourceError
when trying to view images.This is due to a failure of the goofys-mounted BioStudies s3 bucket.
See kahing/goofys#208
Advice is to unmount and re-mount.
Tried un-mounting as at kahing/goofys#77
As
omero-server
user:This is currently a blocker on my NGFF update work on
idr0125-pilot
(can use idr0138-pilot in the mean time) but it also raises questions on how to detect and fix this once we start using it on production IDR server.cc @sbesson @joshmoore
The text was updated successfully, but these errors were encountered: