Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Meta: Export cluster resource usage and health metrics #742

Open
4 tasks
dynamic-entropy opened this issue Mar 6, 2024 · 2 comments
Open
4 tasks

Meta: Export cluster resource usage and health metrics #742

dynamic-entropy opened this issue Mar 6, 2024 · 2 comments
Assignees

Comments

@dynamic-entropy
Copy link
Contributor

Enhancement Description

Track resource usage metrics such as CPU and memory

Export resource and health metrics from our cluster

  • Application (daemons, server ... )
  • Infrastructure (Prometheus, fluent bit ... )

In monit

  • Create dashboards for monitoring resources
  • Setup alerts for various components

Use Case

Often, errors and misbehaving processes go unnoticed until they have caused inconvenience and are manually reported by someone. This is slow and the delay sometimes causes us to have a recovery period for the system; which is not desired.

Possible Solution

No response

Related Issues

No response

@ericvaandering
Copy link
Member

Look at what we already have from kube-eagle

@ericvaandering
Copy link
Member

@Panos512 will talk with IT and/or @dynamic-entropy will talk with @arooshap and @vkuznet to see what the generic/supported way of doing this will be

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants