-
Notifications
You must be signed in to change notification settings - Fork 214
feat!(metrics): simplify power metrics by consolidating zone-based metrics #2105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat!(metrics): simplify power metrics by consolidating zone-based metrics #2105
Conversation
cfcc7c4
to
8141311
Compare
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## reboot #2105 +/- ##
==========================================
- Coverage 92.22% 92.10% -0.13%
==========================================
Files 30 30
Lines 2226 2140 -86
==========================================
- Hits 2053 1971 -82
- Misses 131 134 +3
+ Partials 42 35 -7 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
…trics The new approach simplifies metric naming by consolidating zone-based metrics generated dynamically with static CPU (device) specific ones with zone labels. This reduces cardinality, improving queryability, and minimizes runtime complexity. Changes: - Replaced dynamic zone-based metric descriptors with static CPU (device) specific descriptors. E.g. `kepler_node_package_joules_total` -> `kepler_node_package_joules_total{zone="package"}` - Consolidated metrics under `kepler_<level>_cpu_<unit>` with zone labels - Removed `kepler_node_energy_zone` metric as it is no longer required. - Added helper functions for descriptor creation - Use consistent label names for "zone", "container_id" to ease with writing queries. Signed-off-by: Sunil Thaha <sthaha@redhat.com>
8141311
to
817785f
Compare
Signed-off-by: Sunil Thaha <sthaha@redhat.com>
"instant": false, | ||
"legendFormat": "Kepler Reboot Node (${zones})", | ||
"legendFormat": "Kepler Reboot Node (${zone}) (ΔJ/s)", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
greek letter 'delta" ?
"kepler_node_package_joules_total", | ||
"kepler_node_package_watts", | ||
"kepler_node_dram_joules_total", | ||
"kepler_node_dram_watts", | ||
"kepler_node_energy_zone", | ||
"kepler_node_cpu_joules_total", | ||
"kepler_node_cpu_watts", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need more tests for zone name assertions?
3d01253
into
sustainable-computing-io:reboot
The new approach simplifies metric naming by consolidating zone-based metrics generated dynamically with static CPU (device) specific ones with zone labels.
Pros:
label_replace
hacks to query by selected zones as zone is now a label.Validates if the sum of process watts in container for is the same as the container's power use for each available zone
kepler_node_cpu_core|package_
Changes:
Replaced dynamic zone-based metric descriptors with static CPU (device) specific descriptors. E.g.
kepler_node_package_joules_total
->kepler_node_cpu_joules_total{zone="package"}
Consolidated metrics under
kepler_<level>_cpu_<unit>
with zone labelsRemoved
kepler_node_energy_zone
metric as it is no longer required.Added helper functions for descriptor creation
Use consistent label names for "zone", "container_id" to ease with writing queries.