SREP-3733: Add ClusterPullSecretInvalidSRE PrometheusRule for pull secret health#2667
Conversation
|
@MateSaary: This pull request references SREP-3733 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
1 similar comment
|
@MateSaary: This pull request references SREP-3733 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Run ID: 📒 Files selected for processing (4)
✅ Files skipped from review due to trivial changes (1)
WalkthroughAdds a new PrometheusRule named Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes 🚥 Pre-merge checks | ✅ 5✅ Passed checks (5 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
@MateSaary: This pull request references SREP-3733 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/hold ...pending merge of openshift/osd-metrics-exporter#284 |
There was a problem hiding this comment.
🧹 Nitpick comments (1)
deploy/sre-prometheus/100-pull-secret-health.PrometheusRule.yaml (1)
19-21: Consider adding alinklabel for SOP documentation.The similar pull secret alerts in
100-ocm-agent-operator.PrometheusRule.yamlinclude alinklabel pointing to SOP documentation for responding to the alert. Adding this would help SREs quickly access runbook procedures when responding to this alert.📖 Proposed fix to add SOP link
labels: severity: warning namespace: openshift-monitoring + link: "https://github.com/openshift/ops-sop/blob/master/v4/alerts/OCMAgentResponseFailureServiceLogsSRE.md#verify-cluster-pull-secrets"Note: If a dedicated SOP for this specific alert exists or will be created, use that link instead.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed. In `@deploy/sre-prometheus/100-pull-secret-health.PrometheusRule.yaml` around lines 19 - 21, Add a `link` label to the alert metadata so SREs can quickly access the runbook; locate the labels block (currently containing `severity: warning` and `namespace: openshift-monitoring`) and add a `link: "<SOP_URL>"` entry (use the existing SOP URL used in 100-ocm-agent-operator.PrometheusRule.yaml or the specific runbook URL for this alert) alongside `severity` and `namespace` to ensure the alert includes the SOP link.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.
Nitpick comments:
In `@deploy/sre-prometheus/100-pull-secret-health.PrometheusRule.yaml`:
- Around line 19-21: Add a `link` label to the alert metadata so SREs can
quickly access the runbook; locate the labels block (currently containing
`severity: warning` and `namespace: openshift-monitoring`) and add a `link:
"<SOP_URL>"` entry (use the existing SOP URL used in
100-ocm-agent-operator.PrometheusRule.yaml or the specific runbook URL for this
alert) alongside `severity` and `namespace` to ensure the alert includes the SOP
link.
ℹ️ Review info
⚙️ Run configuration
Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml
Review profile: CHILL
Plan: Pro
Run ID: 4987b8f4-6731-4709-a07d-69439f7cad91
📒 Files selected for processing (1)
deploy/sre-prometheus/100-pull-secret-health.PrometheusRule.yaml
44293bd to
a6b37a9
Compare
|
@MateSaary: This pull request references SREP-3733 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
a6b37a9 to
db6eed1
Compare
|
/test checklinks-pr |
|
/hold cancel |
|
@MateSaary: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: bergmannf, MateSaary The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
feature
What this PR does / why we need it?
Adds a
ClusterPullSecretInvalidSREPrometheusRule that alerts when thepull_secret_validmetric (openshift/osd-metrics-exporter#284) reports the cluster pull secret is invalid. The alert message includes thereasonlabel for actionable context (e.g.MissingRegistry,MalformedJSON,EmptyCredential).Which Jira/Github issue(s) this PR fixes?
Fixes SREP-3733
Special notes for your reviewer:
Pre-checks (if applicable):
Tested latest changes against a cluster
Included documentation changes with PR
If this is a new object that is not intended for the FedRAMP environment (if unsure, please reach out to team FedRAMP), please exclude it with: