OCPBUGS-78990: Bump 1.35.3 to master by jubittajohn · Pull Request #2633 · openshift/kubernetes

jubittajohn · 2026-03-20T21:49:05Z

Bump release-4.22/master from v1.35.2 from v1.35.3

This PR fixes the flaky TestApplyCRDuringCRDFinalization test that was failing intermittently on slower systems (s390x architecture, race detector builds). The root cause was a race condition where the test would attempt to apply a CR immediately after requesting CRD deletion, without waiting for the CRD to actually enter the terminating state. The fix explicitly waits for the CRD to have the Terminating condition set to True before attempting the apply.

NewSimpleClientset was marked as deprecated when NewClientset was introduced. This has caused some confusion: - Not all packages have NewClientset (kubernetes#135980). - Tests that work with NewSimpleClientset fail when switched to NewClientset (kubernetes#136327) because of missing CRD support (kubernetes#126850). It doesn't seem burdensome to keep NewSimpleClientset around forever. Some unit tests may even prefer to use it when they don't need server-side apply (less overhead). Therefore there is no need to deprecate it. This avoids churn in the eco system because contributors no longer create PRs "because the linter complains about the usage of a deprecated function".

…update logic Co-authored-by: Pohly <patrick.ohly@intel.com>

…-pick-of-#135567-upstream-release-1.35 [release1.35]Automated cherry pick of kubernetes#135567: Fix flaky TestApplyCRDuringCRDFinalization test

…binding slice first

1.35: add dockerized go cache chmod to `make clean`

we only use the rules in the master branch since we don't need rules.yaml, we don't have two places to match, so we can drop the golang version entirely from this file bump .go-version alone will be sufficient* on release branches after kubernetes#136954 * ignoring e2e images like agnhost, which will require follow-up PRs ...

…k-of-#137253-upstream-release-1.35 Automated cherry pick of kubernetes#137253: DRA: start scheduler after creating binding/non-binding slicesin Basicflow

The test uses an invalid image to induce a pull error. The previous image name 'some-image-that-doesnt-exist' causes slow DNS/registry resolution on some environments (especially metal), leading to 30s timeouts. Using 'localhost/some-image-that-does-not-exist' makes the pull fail instantly since there is no registry on localhost, avoiding flaky timeouts.

…-of-#136455-origin-release-1.35 Automated cherry pick of kubernetes#136455: fake client-go: un-deprecate NewSimpleClientset

If /var/lib/kubelet is MS_SHARED mountpoint, all the mountpoints under /var/lib/kubelet will have duplicate one. When `kubeadm reset -f` is executed, it will try to umount one path twice. However, they are in the peer group. Once we umount one path, the duplicate one will be umounted as well. So, in this case, we should ignore EINVAL error. Signed-off-by: Wei Fu <fuweid89@gmail.com> (cherry picked from commit 2634261) Signed-off-by: Wei Fu <fuweid89@gmail.com>

…-pick-of-#135611-upstream-release-1.35 [release-1.35]Automated cherry pick of kubernetes#135611: Fix flake TestDeviceTaintRule test

…-of-#137251-upstream-release-1.35 Automated cherry pick of kubernetes#137251: kubeadm: do not add learner member to etcd client endpoints

drop publishing rules from dependencies.yaml on release branch

…y-pick-of-#137252-upstream-release-1.35 Automated cherry pick of kubernetes#137252: Use localhost Image Reference in PodObservedGenerationTracking E2E Test

[release-1.35] cmd/kubeadm: ignore EINVAL error during unmount

Kubernetes official release v1.35.3

openshift-ci-robot · 2026-03-20T21:49:14Z

@jubittajohn: This pull request references Jira Issue OCPBUGS-78990, which is valid. The bug has been moved to the POST state.

3 validation(s) were run on this bug

bug is open, matching expected state (open)
bug target version (4.22.0) matches configured target version for branch (4.22.0)
bug is in the state New, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @gangwgr

The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

Bump release-4.22/master from v1.35.2 from v1.35.3

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

openshift-ci-robot · 2026-03-20T21:49:17Z

@jubittajohn: the contents of this pull request could not be automatically validated.

The following commits could not be validated and must be approved by a top-level approver:

02115b6|Update CHANGELOG/CHANGELOG-1.35.md for v1.35.2: does not specify an upstream backport in the commit message
0a9d14b|Fix flaky TestApplyCRDuringCRDFinalization test: does not specify an upstream backport in the commit message
1c0c911|kubeadm: do not add learner member to etcd client endpoints: does not specify an upstream backport in the commit message
2371267|add dockerized go cache to make clean: does not specify an upstream backport in the commit message
39a9d96|UPSTREAM: : hack/update-vendor.sh, make update and update image: does not specify an upstream backport in the commit message
4e48345|Merge pull request #136929 from BenTheElder/1.35-clean-fix: does not specify an upstream backport in the commit message
60ebb8a|Merge pull request #137366 from tsj-30/automated-cherry-pick-of-#137253-upstream-release-1.35: does not specify an upstream backport in the commit message
61003ad|drop publishing rules from dependencies.yaml on release branch: does not specify an upstream backport in the commit message
659d489|dra: stabilize DeviceBindingConditions BasicFlow by creating without-binding slice first: does not specify an upstream backport in the commit message
6c1cd99|Release commit for Kubernetes v1.35.3: does not specify an upstream backport in the commit message
77d8b63|Merge pull request #137257 from ahrtr/automated-cherry-pick-of-#137251-upstream-release-1.35: does not specify an upstream backport in the commit message
946b7f7|Merge pull request #136387 from vikasbolla/automated-cherry-pick-of-#135567-upstream-release-1.35: does not specify an upstream backport in the commit message
a0e5f1a|Merge pull request #136903 from pohly/automated-cherry-pick-of-#136455-origin-release-1.35: does not specify an upstream backport in the commit message
af6488c|Use localhost image reference in PodObservedGenerationTracking test: does not specify an upstream backport in the commit message
b57e146|cmd/kubeadm: ignore EINVAL error during unmount: does not specify an upstream backport in the commit message
b6f632b|Merge pull request #137569 from fuweid/cp-1374942-135: does not specify an upstream backport in the commit message
b9be60a|start scheduler after creating binding/non-binding slices: does not specify an upstream backport in the commit message
bd39991|fake client-go: un-deprecate NewSimpleClientset: does not specify an upstream backport in the commit message
d05c506|Fix flake TestDeviceTaintRule test by adjusting event hanlder status update logic Co-authored-by: Pohly patrick.ohly@intel.com: does not specify an upstream backport in the commit message
d305427|Merge tag 'v1.35.3' into release-4.22: does not specify an upstream backport in the commit message
d595aaf|Merge pull request #137047 from vikasbolla/automated-cherry-pick-of-#135611-upstream-release-1.35: does not specify an upstream backport in the commit message
f503429|Merge pull request #137400 from Chandan9112/automated-cherry-pick-of-#137252-upstream-release-1.35: does not specify an upstream backport in the commit message
fe6a745|Merge pull request #137382 from BenTheElder/no-rules: does not specify an upstream backport in the commit message

Comment /validate-backports to re-evaluate validity of the upstream PRs, for example when they are merged upstream.

coderabbitai · 2026-03-20T21:50:16Z

Walkthrough

This pull request includes release documentation updates for v1.35.2, conditional permission fixes in build scripts, etcd endpoint handling changes for learner members, device taint eviction controller enhancements with delayed work scheduling, and widespread API documentation updates to fake clientset and code generator templates regarding field tracking support.

Changes

Cohort / File(s)	Summary
Release and metadata updates `CHANGELOG/CHANGELOG-1.35.md`, `openshift-hack/images/hyperkube/Dockerfile.rhel`	Added v1.35.2 release notes with download artifacts and sha512 hashes; updated container image version label from 1.35.2 to 1.35.3.
Build and dependency configuration `build/common.sh`, `build/dependencies.yaml`	Added conditional `chmod -R +w` for dockerized Go cache before cleanup; removed dedicated golang upstream version dependency entry (1.25.7).
Kubeadm core fixes `cmd/kubeadm/app/cmd/phases/reset/unmount_linux.go`, `cmd/kubeadm/app/util/etcd/etcd.go`	Added `EINVAL` error suppression during kubelet unmounting; modified etcd member client URL appending to skip learner members.
Device taint eviction controller `pkg/controller/devicetainteviction/device_taint_eviction.go`	Enhanced logging verbosity; refactored eviction scheduling to support immediate vs. delayed work; adjusted rule change handling for immediate status updates; modified pod handling to use delayed rule status updates instead of immediate.
Device taint eviction tests `pkg/controller/devicetainteviction/device_taint_eviction_test.go`	Updated mock work queue expectations for delayed items; refactored "evict-pod-rule-later" test scenario with multi-phase time advancement; added queue comparison helpers; changed worker count from 10 to 1 for deterministic ordering.
Fake clientset API documentation `staging/src/k8s.io/apiextensions-apiserver/examples/client-go/pkg/client/clientset/versioned/fake/clientset_generated.go`, `staging/src/k8s.io/apiextensions-apiserver/pkg/client/clientset/clientset/fake/clientset_generated.go`, `staging/src/k8s.io/client-go/kubernetes/fake/clientset_generated.go`, `staging/src/k8s.io/kube-aggregator/pkg/client/clientset_generated/clientset/fake/clientset_generated.go`, `staging/src/k8s.io/metrics/pkg/client/clientset/versioned/fake/clientset_generated.go`	Removed deprecation notices from `NewSimpleClientset`; added documentation to `NewClientset` describing field tracking and server-side apply support with CRD limitation warnings.
Code generator and examples `staging/src/k8s.io/code-generator/cmd/client-gen/generators/fake/generator_fake_for_clientset.go`, `staging/src/k8s.io/code-generator/examples/.../clientset/versioned/fake/clientset_generated.go`	Updated generated documentation templates to clarify field tracking/server-side apply support in `NewClientset` and removed deprecation notices from `NewSimpleClientset` across multiple example generators.
Integration and e2e test updates `staging/src/k8s.io/apiextensions-apiserver/test/integration/finalization_test.go`, `test/e2e/node/pods.go`, `test/integration/dra/binding_conditions_test.go`	Added CRD termination condition polling verification; changed invalid image string for pod spec testing; moved scheduler startup to after resource slice creation in DRA binding test.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 Trivy (0.69.3)

Trivy execution failed: Unknown error

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

CodeRabbit can use OpenGrep to find security vulnerabilities and bugs across 17+ programming languages.

OpenGrep is compatible with Semgrep configurations. Add an opengrep.yml or semgrep.yml configuration file to your project to enable OpenGrep analysis.

openshift-ci · 2026-03-20T21:51:19Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jubittajohn

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~DOWNSTREAM_OWNERS~~ [jubittajohn]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

CHANGELOG/CHANGELOG-1.35.md (1)

679-680: ⚠️ Potential issue | 🟡 Minor

Fix malformed markdown in two changelog bullets.

At Line 679 and Line 680, each bullet is missing a closing ) before the [SIG ...] suffix, which can break rendering/parsing of this section.

Suggested fix

-- Fixes a bug where `MutatingAdmissionPolicy` would fail to apply to objects with duplicate list items (like env vars). ([`#135560`](https://github.com/kubernetes/kubernetes/pull/135560), [`@lalitc375`](https://github.com/lalitc375) [SIG API Machinery]
+- Fixes a bug where `MutatingAdmissionPolicy` would fail to apply to objects with duplicate list items (like env vars). ([`#135560`](https://github.com/kubernetes/kubernetes/pull/135560), [`@lalitc375`](https://github.com/lalitc375)) [SIG API Machinery]
-- K8s.io/client-go: Fixes a regression in 1.34+ which prevented informers from using configured Transformer functions. ([`#135580`](https://github.com/kubernetes/kubernetes/pull/135580), [`@serathius`](https://github.com/serathius) [SIG API Machinery]
+- K8s.io/client-go: Fixes a regression in 1.34+ which prevented informers from using configured Transformer functions. ([`#135580`](https://github.com/kubernetes/kubernetes/pull/135580), [`@serathius`](https://github.com/serathius)) [SIG API Machinery]

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@CHANGELOG/CHANGELOG-1.35.md` around lines 679 - 680, Two changelog bullets
referencing "MutatingAdmissionPolicy" and "K8s.io/client-go" are missing a
closing parenthesis before their trailing "[SIG API Machinery]" suffix; update
each bullet so the PR link + author group is properly closed (add the missing
")" after the author link in the "([`#135560`]..., [`@lalitc375`]...)" and
"([`#135580`]..., [`@serathius`]...)" groups) so the markdown renders correctly.

🧹 Nitpick comments (2)

build/common.sh (1)
343-353: Update stale comment to match the new dockerized cache handling.

Line 347 says dockerized builds don’t need this chmod, but Lines 351-353 now do exactly that. Please align the comment to avoid future confusion.
Suggested diff
-    # We don't need to do this at all for dockerized builds
+    # Dockerized builds can also leave go/cache non-writable on the host.
+    # Handle that path explicitly as well.
As per coding guidelines, "Focus on major issues impacting performance, readability, maintainability and security. Avoid nitpicks and avoid verbosity."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@build/common.sh` around lines 343 - 353, Update the stale comment above the
chmod blocks so it accurately describes current behavior: change the sentence
that claims "We don't need to do this at all for dockerized builds" to reflect
that both local and dockerized cache directories are now explicitly made
writable; reference the existing variables/paths
LOCAL_OUTPUT_ROOT/local/go/cache and LOCAL_OUTPUT_ROOT/dockerized/go/cache and
ensure the comment explains why only these cache paths (not the whole output
root) are chmodded.
pkg/controller/devicetainteviction/device_taint_eviction_test.go (1)
1998-2002: Remove the unconditional fmt.Println from the test loop.

This will spam stdout on every run and makes CI output harder to read. If the trace is still useful, route it through tContext.Logf instead.
Suggested cleanup
 		for _, item := range tContext.mockQueue.State().Later {
-			fmt.Println(item.Item, item.Duration)
 			tContext.mockQueue.CancelAfter(item.Item)
 			tContext.mockQueue.AddAfter(item.Item, item.Duration-state.advance)
 		}
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@pkg/controller/devicetainteviction/device_taint_eviction_test.go` around
lines 1998 - 2002, Remove the unconditional fmt.Println in the test loop that
iterates over tContext.mockQueue.State().Later; instead either delete that print
or replace it with a test-friendly logger call like tContext.Logf, keeping the
existing CancelAfter and AddAfter calls
(tContext.mockQueue.CancelAfter(item.Item) and
tContext.mockQueue.AddAfter(item.Item, item.Duration-state.advance)) unchanged
so test output no longer spams stdout.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@pkg/controller/devicetainteviction/device_taint_eviction.go`:
- Around line 1286-1288: The current approach only enqueues the rule with
tc.workqueue.Add(workItemForRule(newRule)) but does not guarantee its processing
runs before pod eviction work items; to fix, perform the required rule-status
update synchronously (call the controller method that updates rule status
directly, e.g., tc.syncRuleStatus or the same status-update routine used by the
worker) before releasing the mutex/enqueuing pods, or implement explicit
prioritization/serialization (e.g., a dedicated rule-priority queue or a
blocking wait until the rule-status work item is processed) instead of relying
on workqueue order; update both places where workItemForRule(newRule) is used
(the block around the tc.logger.V(5).Info call and the similar section at the
later occurrence) to ensure the status transition happens deterministically
before any pod-eviction work runs.

In
`@staging/src/k8s.io/apiextensions-apiserver/test/integration/finalization_test.go`:
- Around line 188-200: The current wait on the CRD Terminating condition (using
wait.PollUntilContextTimeout + apiextensionshelpers.IsCRDConditionTrue on the
fetched crd) only ensures the prerequisite is set, but the create path can still
race and observe stale state; wrap the Apply/create call (the code that attempts
to create the test CR) in a retry loop (e.g. wait.PollUntilContextTimeout or
wait.PollImmediateUntil) that repeatedly calls the same Apply/create until the
operation fails due to the CRD being terminating (the behavior under test),
rather than performing a single Apply; locate the Apply invocation in this test
and replace the single attempt with a short-interval poll that asserts the Apply
returns the expected blocked/error result before proceeding.

---

Outside diff comments:
In `@CHANGELOG/CHANGELOG-1.35.md`:
- Around line 679-680: Two changelog bullets referencing
"MutatingAdmissionPolicy" and "K8s.io/client-go" are missing a closing
parenthesis before their trailing "[SIG API Machinery]" suffix; update each
bullet so the PR link + author group is properly closed (add the missing ")"
after the author link in the "([`#135560`]..., [`@lalitc375`]...)" and
"([`#135580`]..., [`@serathius`]...)" groups) so the markdown renders correctly.

---

Nitpick comments:
In `@build/common.sh`:
- Around line 343-353: Update the stale comment above the chmod blocks so it
accurately describes current behavior: change the sentence that claims "We don't
need to do this at all for dockerized builds" to reflect that both local and
dockerized cache directories are now explicitly made writable; reference the
existing variables/paths LOCAL_OUTPUT_ROOT/local/go/cache and
LOCAL_OUTPUT_ROOT/dockerized/go/cache and ensure the comment explains why only
these cache paths (not the whole output root) are chmodded.

In `@pkg/controller/devicetainteviction/device_taint_eviction_test.go`:
- Around line 1998-2002: Remove the unconditional fmt.Println in the test loop
that iterates over tContext.mockQueue.State().Later; instead either delete that
print or replace it with a test-friendly logger call like tContext.Logf, keeping
the existing CancelAfter and AddAfter calls
(tContext.mockQueue.CancelAfter(item.Item) and
tContext.mockQueue.AddAfter(item.Item, item.Duration-state.advance)) unchanged
so test output no longer spams stdout.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: e35431c4-4c05-4c7d-b9a0-3346d696384b

📥 Commits

Reviewing files that changed from the base of the PR and between a925ce0 and 39a9d96.

⛔ Files ignored due to path filters (2)

staging/src/k8s.io/sample-apiserver/pkg/generated/clientset/versioned/fake/clientset_generated.go is excluded by !**/generated/**
staging/src/k8s.io/sample-controller/pkg/generated/clientset/versioned/fake/clientset_generated.go is excluded by !**/generated/**

📒 Files selected for processing (22)

CHANGELOG/CHANGELOG-1.35.md
build/common.sh
build/dependencies.yaml
cmd/kubeadm/app/cmd/phases/reset/unmount_linux.go
cmd/kubeadm/app/util/etcd/etcd.go
openshift-hack/images/hyperkube/Dockerfile.rhel
pkg/controller/devicetainteviction/device_taint_eviction.go
pkg/controller/devicetainteviction/device_taint_eviction_test.go
staging/src/k8s.io/apiextensions-apiserver/examples/client-go/pkg/client/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/apiextensions-apiserver/pkg/client/clientset/clientset/fake/clientset_generated.go
staging/src/k8s.io/apiextensions-apiserver/test/integration/finalization_test.go
staging/src/k8s.io/client-go/kubernetes/fake/clientset_generated.go
staging/src/k8s.io/code-generator/cmd/client-gen/generators/fake/generator_fake_for_clientset.go
staging/src/k8s.io/code-generator/examples/HyphenGroup/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/code-generator/examples/MixedCase/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/code-generator/examples/apiserver/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/code-generator/examples/crd/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/code-generator/examples/single/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/kube-aggregator/pkg/client/clientset_generated/clientset/fake/clientset_generated.go
staging/src/k8s.io/metrics/pkg/client/clientset/versioned/fake/clientset_generated.go
test/e2e/node/pods.go
test/integration/dra/binding_conditions_test.go

💤 Files with no reviewable changes (4)

staging/src/k8s.io/kube-aggregator/pkg/client/clientset_generated/clientset/fake/clientset_generated.go
staging/src/k8s.io/metrics/pkg/client/clientset/versioned/fake/clientset_generated.go
staging/src/k8s.io/code-generator/examples/apiserver/clientset/versioned/fake/clientset_generated.go
build/dependencies.yaml

coderabbitai · 2026-03-20T22:00:13Z

pkg/controller/devicetainteviction/device_taint_eviction.go

+		// Update the status at least once, immediately and before evicting any pods.
+		tc.logger.V(5).Info("Adding immediate status update because of new rule", "deviceTaintRule", klog.KObj(newRule))
 		tc.workqueue.Add(workItemForRule(newRule))


⚠️ Potential issue | 🟠 Major

Queueing the rule item first does not guarantee the status update runs first.

These lines only change enqueue order. Once Run() uses multiple workers, a pod work item can still execute before the rule-status item after the mutex is released, so the promised “before evicting any pods” transition remains racy. The test workaround in pkg/controller/devicetainteviction/device_taint_eviction_test.go Lines 2429-2431 confirms the same ordering problem. If that sequencing matters, it needs explicit serialization/prioritization instead of relying on shared-queue order alone.

Also applies to: 1301-1307

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@pkg/controller/devicetainteviction/device_taint_eviction.go` around lines 1286 - 1288, The current approach only enqueues the rule with tc.workqueue.Add(workItemForRule(newRule)) but does not guarantee its processing runs before pod eviction work items; to fix, perform the required rule-status update synchronously (call the controller method that updates rule status directly, e.g., tc.syncRuleStatus or the same status-update routine used by the worker) before releasing the mutex/enqueuing pods, or implement explicit prioritization/serialization (e.g., a dedicated rule-priority queue or a blocking wait until the rule-status work item is processed) instead of relying on workqueue order; update both places where workItemForRule(newRule) is used (the block around the tc.logger.V(5).Info call and the similar section at the later occurrence) to ensure the status transition happens deterministically before any pod-eviction work runs.

coderabbitai · 2026-03-20T22:00:13Z

staging/src/k8s.io/apiextensions-apiserver/test/integration/finalization_test.go

+	// Wait for the CRD to have the Terminating condition set to True.
+	// The handler checks IsCRDConditionTrue(crd, apiextensionsv1.Terminating) to block
+	// CR creation, and this condition is set asynchronously by the CRD finalizer controller
+	// after it observes the DeletionTimestamp. Without this wait, the Apply could succeed
+	// if it races ahead of the controller setting the condition.
+	err = wait.PollUntilContextTimeout(t.Context(), 100*time.Millisecond, wait.ForeverTestTimeout, true, func(ctx context.Context) (bool, error) {
+		crd, err := apiExtensionClient.ApiextensionsV1().CustomResourceDefinitions().Get(ctx, noxuDefinition.Name, metav1.GetOptions{})
+		if err != nil {
+			return false, err
+		}
+		return apiextensionshelpers.IsCRDConditionTrue(crd, apiextensionsv1.Terminating), nil
+	})
+	require.NoError(t, err, "timed out waiting for CRD Terminating condition to be set")


⚠️ Potential issue | 🟠 Major

Single Apply can still race the terminating-state propagation.

Line 193 only waits for a direct CRD Get to see Terminating=True. The create path observes CRD state asynchronously, so Line 208 can still hit the handler before that update is visible there and intermittently succeed. The old retry loop synchronized on the behavior under test; this version only synchronizes on a prerequisite.

Suggested fix

- _, err = noxuResourceClient.Apply(t.Context(), name, instance, metav1.ApplyOptions{DryRun: []string{"All"}, FieldManager: "manager"}) - wantErr := `create not allowed while custom resource definition is terminating` - require.ErrorContains(t, err, wantErr) + wantErr := `create not allowed while custom resource definition is terminating` + var applyErr error + err = wait.PollUntilContextTimeout(t.Context(), 100*time.Millisecond, wait.ForeverTestTimeout, true, func(ctx context.Context) (bool, error) { + _, applyErr = noxuResourceClient.Apply(ctx, name, instance, metav1.ApplyOptions{DryRun: []string{"All"}, FieldManager: "manager"}) + return applyErr != nil, nil + }) + require.NoError(t, err) + require.ErrorContains(t, applyErr, wantErr)

Also applies to: 208-210

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@staging/src/k8s.io/apiextensions-apiserver/test/integration/finalization_test.go` around lines 188 - 200, The current wait on the CRD Terminating condition (using wait.PollUntilContextTimeout + apiextensionshelpers.IsCRDConditionTrue on the fetched crd) only ensures the prerequisite is set, but the create path can still race and observe stale state; wrap the Apply/create call (the code that attempts to create the test CR) in a retry loop (e.g. wait.PollUntilContextTimeout or wait.PollImmediateUntil) that repeatedly calls the same Apply/create until the operation fails due to the CRD being terminating (the behavior under test), rather than performing a single Apply; locate the Apply invocation in this test and replace the single attempt with a short-interval poll that asserts the Apply returns the expected blocked/error result before proceeding.

jubittajohn · 2026-03-23T19:17:38Z

/retest

gangwgr · 2026-03-26T07:23:21Z

/test e2e-aws-ovn-runc

openshift-ci · 2026-03-26T13:40:37Z

@jubittajohn: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/e2e-aws-ovn-runc	`39a9d96`	link	false	`/test e2e-aws-ovn-runc`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

aviralgarg05 and others added 23 commits January 21, 2026 18:32

add dockerized go cache to make clean

2371267

Fix flake TestDeviceTaintRule test by adjusting event hanlder status …

d05c506

…update logic Co-authored-by: Pohly <patrick.ohly@intel.com>

kubeadm: do not add learner member to etcd client endpoints

1c0c911

Update CHANGELOG/CHANGELOG-1.35.md for v1.35.2

02115b6

Merge pull request kubernetes#136387 from vikasbolla/automated-cherry…

946b7f7

…-pick-of-#135567-upstream-release-1.35 [release1.35]Automated cherry pick of kubernetes#135567: Fix flaky TestApplyCRDuringCRDFinalization test

dra: stabilize DeviceBindingConditions BasicFlow by creating without-…

659d489

…binding slice first

start scheduler after creating binding/non-binding slices

b9be60a

Merge pull request kubernetes#136929 from BenTheElder/1.35-clean-fix

4e48345

1.35: add dockerized go cache chmod to `make clean`

Merge pull request kubernetes#137366 from tsj-30/automated-cherry-pic…

60ebb8a

…k-of-#137253-upstream-release-1.35 Automated cherry pick of kubernetes#137253: DRA: start scheduler after creating binding/non-binding slicesin Basicflow

Merge pull request kubernetes#136903 from pohly/automated-cherry-pick…

a0e5f1a

…-of-#136455-origin-release-1.35 Automated cherry pick of kubernetes#136455: fake client-go: un-deprecate NewSimpleClientset

Merge pull request kubernetes#137047 from vikasbolla/automated-cherry…

d595aaf

…-pick-of-#135611-upstream-release-1.35 [release-1.35]Automated cherry pick of kubernetes#135611: Fix flake TestDeviceTaintRule test

Merge pull request kubernetes#137257 from ahrtr/automated-cherry-pick…

77d8b63

…-of-#137251-upstream-release-1.35 Automated cherry pick of kubernetes#137251: kubeadm: do not add learner member to etcd client endpoints

Merge pull request kubernetes#137382 from BenTheElder/no-rules

fe6a745

drop publishing rules from dependencies.yaml on release branch

Merge pull request kubernetes#137400 from Chandan9112/automated-cherr…

f503429

…y-pick-of-#137252-upstream-release-1.35 Automated cherry pick of kubernetes#137252: Use localhost Image Reference in PodObservedGenerationTracking E2E Test

Merge pull request kubernetes#137569 from fuweid/cp-1374942-135

b6f632b

[release-1.35] cmd/kubeadm: ignore EINVAL error during unmount

Release commit for Kubernetes v1.35.3

6c1cd99

Merge tag 'v1.35.3' into release-4.22

d305427

Kubernetes official release v1.35.3

UPSTREAM: <drop>: hack/update-vendor.sh, make update and update image

39a9d96

openshift-ci bot requested a review from gangwgr March 20, 2026 21:49

openshift-ci bot requested review from bertinatto and jerpeter1 March 20, 2026 21:51

openshift-ci bot added the vendor-update Touching vendor dir or related files label Mar 20, 2026

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 20, 2026

coderabbitai bot reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCPBUGS-78990: Bump 1.35.3 to master#2633

OCPBUGS-78990: Bump 1.35.3 to master#2633
jubittajohn wants to merge 23 commits intoopenshift:masterfrom
jubittajohn:rebase-v1.35.3

jubittajohn commented Mar 20, 2026

Uh oh!

openshift-ci-robot commented Mar 20, 2026

Uh oh!

openshift-ci-robot commented Mar 20, 2026

Uh oh!

coderabbitai bot commented Mar 20, 2026 •

edited

Loading

Uh oh!

openshift-ci bot commented Mar 20, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 20, 2026

Uh oh!

coderabbitai bot Mar 20, 2026

Uh oh!

jubittajohn commented Mar 23, 2026

Uh oh!

gangwgr commented Mar 26, 2026

Uh oh!

openshift-ci bot commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

Conversation

jubittajohn commented Mar 20, 2026

Uh oh!

openshift-ci-robot commented Mar 20, 2026

Uh oh!

openshift-ci-robot commented Mar 20, 2026

Uh oh!

coderabbitai bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Uh oh!

openshift-ci bot commented Mar 20, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

jubittajohn commented Mar 23, 2026

Uh oh!

gangwgr commented Mar 26, 2026

Uh oh!

openshift-ci bot commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

coderabbitai bot commented Mar 20, 2026 •

edited

Loading