Skip to content
This repository was archived by the owner on Jun 6, 2024. It is now read-only.
This repository was archived by the owner on Jun 6, 2024. It is now read-only.

Add alert for GPU perf issue #5342

@Binyang2014

Description

@Binyang2014

We already has alert for this issue, but not cover all situations.

  • Add case: GPU perf in P0 status, but application clock not correct
  • Add auto-fix tools. When detected this issue, we can launch a privileged pod and run command to fix this.

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions