define pod sandbox lifecycle contract by aojea · Pull Request #286 · containerd/nri

aojea · 2026-04-09T19:56:54Z

Define the contract for the PodSandbox hooks for the NRI plugins.

The Sandbox hooks are based on the CRI-API RPCs , since the OCI runtime only specify the container lifecycle.

SergeyKanzhelev · 2026-04-10T22:41:58Z

same way we do conformance for runtimes, some of these contracts may be added to conformance: kubernetes-sigs/cri-tools#2046. I think critest is the right place for it. But it opens an interesting discussion on whether we will also want to do conformance with other things like CNI. NRI is different enough that I would see it as a reasonable conformance requirement

aojea · 2026-04-11T08:37:10Z

other things like CNI.

CNI is not part of CRI, is an implementation detail of the runtimes and has several flaws that cause a lot of problem with current workloads ... during last kubecon during the OCI meeting we also discussed to replace it by a modular solution based on NRI ... I have a draft that once I have time I plan to finish and share, but I suggest to not include CNI as part of conformance of anything

klihub · 2026-04-11T08:44:34Z

+- `Event_STOP_POD_SANDBOX` (1 << 1) for StopPodSandbox
+- `Event_REMOVE_POD_SANDBOX` (1 << 2) for RemovePodSandbox
+
+These events are notified via the StateChange RPC call.


This is a wire protocol detail which is going to change once #274 is merged, at which point we are going to have dedicated messages and RPC calls on the wire for every single pod and container life cycle event, and eventually remove StateChange after a deprecation period.

I think it would be better to refer here to the stub-offered API which (so far has) always had stronger stability and backward compatibility promise than the wire protocol. So could we perhaps rephrase this as "These events are delivered to plugins using the RunPodSandbox, StopPodSandbox and RemovePodSandbox event handlers." ?

klihub · 2026-04-11T09:14:12Z

+Plugins should handle errors gracefully and avoid leaving the pod or system in an inconsistent state. Error recovery strategies:
+
+- **RunPodSandbox errors**: Problematic; may block pod creation depending on failure severity and runtime policy
+- **StopPodSandbox errors**: May not prevent scenario termination depending on runtime policy


I think on the teardown path, actually both for pods and containers, we should not allow a plugin to try and prevent the operation with an error. If we agree, then we should clearly state here that, for StopPodSandbox and RemovePodSandbox, a plugin failing with an error will not prevent the operation from proceeding.

The current implementation has incorrect/inconstent behavior in this regard when multiple plugins are involved in the sense that for some of these (or corresponding container) teardown lifecycle events, a failure in a plugin will incorrectly prevent the event from being delivered to subsequent plugins, although it will not prevent the CRI-level operation from proceeding. There is a fix coming in for this, but it is waiting for #274 from get merged first (which is waiting for #277 to get merged first).

@klihub can you please suggest the better wording for this?

I do not feel I'm able to translate that correctly to words :)

ok, let me take a stab at this, I think now I got what you mean

see new commit rephrasing it

klihub · 2026-04-11T13:41:36Z

@aojea Thank you, this looks great ! I only have a few comments.

aojea · 2026-04-13T11:58:17Z

@aojea Thank you, this looks great ! I only have a few comments.

@klihub addressed comments on a new commit to simplify the review

klihub · 2026-04-13T18:04:31Z

@aojea Thank you, this looks great ! I only have a few comments.

@klihub addressed comments on a new commit to simplify the review

@aojea Thanks ! This now LGTM. We can squash the commits before/during merging.

Define the contract for the PodSandbox hooks for the NRI plugins. The Sandbox hooks are based on the CRI-API RPCs , since the OCI runtime only specify the container lifecycle. Signed-off-by: Antonio Ojea <aojea@google.com>

aojea · 2026-04-13T19:11:13Z

squashed

klihub

LGTM.

aojea mentioned this pull request Apr 9, 2026

Incompatibility with CRI-O kubernetes-sigs/dranet#137

Open

aojea force-pushed the pod_sandbox_hooks branch from f1d28c3 to cb41fe2 Compare April 9, 2026 20:10

SergeyKanzhelev mentioned this pull request Apr 10, 2026

critest for NRI integration kubernetes-sigs/cri-tools#2046

Open

klihub reviewed Apr 11, 2026

View reviewed changes

define pod sandbox lifecycle contract

3c820b0

Define the contract for the PodSandbox hooks for the NRI plugins. The Sandbox hooks are based on the CRI-API RPCs , since the OCI runtime only specify the container lifecycle. Signed-off-by: Antonio Ojea <aojea@google.com>

aojea force-pushed the pod_sandbox_hooks branch from 609d764 to 3c820b0 Compare April 13, 2026 19:10

klihub self-requested a review April 14, 2026 06:52

klihub approved these changes Apr 14, 2026

View reviewed changes

klihub requested a review from chrishenzie April 14, 2026 06:53

Conversation

aojea commented Apr 9, 2026

Uh oh!

SergeyKanzhelev commented Apr 10, 2026

Uh oh!

aojea commented Apr 11, 2026

Uh oh!

klihub Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aojea Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

klihub Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

aojea Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

aojea Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

aojea Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

klihub commented Apr 11, 2026

Uh oh!

aojea commented Apr 13, 2026

Uh oh!

klihub commented Apr 13, 2026

Uh oh!

aojea commented Apr 13, 2026

Uh oh!

klihub left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

klihub Apr 11, 2026 •

edited

Loading