Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BACKLOG] Monitoring Systems (metrics collection and visualization) #459

Open
1 of 6 tasks
bombnp opened this issue Jan 10, 2023 · 0 comments
Open
1 of 6 tasks

[BACKLOG] Monitoring Systems (metrics collection and visualization) #459

bombnp opened this issue Jan 10, 2023 · 0 comments
Labels
new feature New feature

Comments

@bombnp
Copy link
Member

bombnp commented Jan 10, 2023

Problem

There's currently no monitoring dashboard for our system in the following categories:

  1. Resource usage (CPU, Memory, Disk, I/O) -> can utilize open-source metric exporters
  2. Performance (Latency, Error rates) -> requires custom logging?

We want to identify areas to optimize resources since our resources are starting to run out.

Task Description

Create a monitoring system consisting of metrics collection (through exporters and Prometheus) and visualization (through Grafana?). Visualize metrics by resources, nodes, pods, or other api objects as needed.

Additional Context

For now, I've enabled Prometheus + node-exporter + kube-state-metrics stacks integration from Lens(in lens-metrics namespace), which can be used to visualize usage of specific nodes/pods, but multiple at the same time. It's likely you'd be using the same stacks, but some metrics must be installed ourselves.

Related Teams

  • Frontend
  • Backend
  • Data
  • Design
  • Infra
  • QA

Task Advisors

@bombnp

@bombnp bombnp added the new feature New feature label Jan 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new feature New feature
Projects
Status: Todo
Development

No branches or pull requests

1 participant