K8s 1.33 In-Place Pod Resize Guide

On March 19, 2026, Kubernetes 1.33.10 dropped with a feature that's been eight years in the making: in-place pod vertical resize went to beta. If you run stateful workloads (databases, message brokers, caches, search indexes, anything with warm state), this is the biggest capacity-planning improvement since the cluster autoscaler.

Key takeaways

Kubernetes 1.33.10 (March 19, 2026) ships in-place pod vertical resize in beta, eight years in the making.
Biggest win: resizing CPU/memory on stateful workloads without a restart window.
Know the state machine (Proposed → InProgress, or Deferred / Infeasible) and that memory cannot be shrunk below current usage.
Safe rollout: enable the feature gate, start with one non-critical workload, instrument what changed.

Here's why it matters, how it works under the hood, the four-state machine you need to understand before you rely on it, and a safe rollout plan for existing clusters.

1. The problem it solves

Every Kubernetes operator has told this story. A Postgres pod is running with 8GB of memory. Traffic grows. It needs 16GB. The "correct" Kubernetes way is to update spec.containers[].resources.limits.memory and let the pod restart.

Except that means:

The Postgres instance loses its warm buffer cache, potentially tens of GB of hot data.
Active connections get dropped.
Replication state has to re-sync.
You're taking an intentional, planned outage to increase capacity.

So in practice, teams pick one of three bad options: (a) over-provision from day one (expensive), (b) schedule memory resizes at 3 AM (still an outage), or (c) build complex blue/green resize workflows (pure operational overhead).

In-place resize kills all three paths. You update the spec. The kubelet changes the cgroup limits. No restart, no eviction.

2. How it actually works

Three steps instead of six, and the pod never restarts.

The in-place flow is actually very simple:

You update pod.spec.containers[*].resources (via a resize subresource in 1.33).
The kubelet on that node sees the update and validates that the resize is feasible.
The kubelet writes new cgroup limits: memory.max and cpu.max under cgroup v2.
pod.status.resources updates to reflect the new values.

Under the hood this leverages the Linux cgroup v2 API, which has allowed live modification of memory and CPU ceilings for years; Kubernetes just finally plumbed it through. The container runtime sees a new cgroup limit, the kernel enforces it on the next allocation, and your running process sees its budget grow mid-life.

3. The resize state machine

Deferred retries until the resize becomes possible. Infeasible is terminal and requires reschedule.

Not every resize succeeds. The kubelet can be in one of four states after a request:

Proposed: kubelet has seen the request and hasn't decided yet.
InProgress: resize is being applied; cgroups are being updated.
Deferred: resize can't happen right now, usually because shrinking memory would overcommit the pod's current working set. The kubelet retries automatically.
Infeasible: resize can never happen on this node (e.g., requesting more CPU than the node has allocatable). You'll need to reschedule.

The important subtlety: memory cannot be shrunk below current usage. If your pod is using 12GB and you set the limit to 10GB, Kubernetes says "Deferred" and waits for the pod's RSS to drop below 10GB. It will not kill the process to make it fit. That would defeat the entire point.

4. A safe rollout plan

Here's how we recommend teams adopt in-place resize in production:

Step 1: Enable the feature gate. In 1.33 this is beta but not on by default in every distribution. Set --feature-gates=InPlacePodVerticalScaling=true on kube-apiserver and kubelet. EKS, GKE, and AKS all expose this as a cluster setting from 1.33+.

Step 2: Start with a single non-critical workload. Pick a cache, a worker pool, or a staging database. Run kubectl patch to adjust resources and watch kubectl get pod -o yaml for the resize conditions.

Step 3: Instrument what changed. The Prometheus metric kube_pod_container_resource_limits will reflect the new values, but you also want container_memory_usage_bytes to confirm the process actually started using the new room.

Step 4: Integrate with VPA. Vertical Pod Autoscaler's InPlaceOrRecreate mode (still alpha in 1.33, beta targeted for 1.34) is the long-term destination. It uses in-place resize where possible and falls back to recreate where necessary.

Step 5: Don't trust memory shrink. Ever. Build your automation around "grow in-place, shrink on deploy." Memory is sticky, and Deferred states can last hours.

5. What it still doesn't fix

A realistic list of things in-place resize does not solve:

Changing storage (PVC) sizes: that's volume expansion, a separate feature.
Adding or removing containers in a pod: still requires a restart.
Changing node affinity, tolerations, or nodeSelector: still requires reschedule.
QoS class transitions (Guaranteed ↔ Burstable): the kubelet will refuse.
Init container resizes: ignored.

And the old-school caveat: if your process doesn't actually observe cgroup changes (some older JVMs, some CPython builds, Go binaries compiled before 1.18), you'll grow the box but the application will still act like it has the old budget. Test this per runtime, not per cluster.

The takeaway

In-place resize is one of those features where "yeah, we finally have it" is the entire review. It removes a long-standing Kubernetes tax on stateful workloads and makes VPA viable for a new class of services that couldn't tolerate the restart.

If you run Postgres, Kafka, Redis, Elasticsearch, or anything with warm state on Kubernetes and you haven't mapped out how in-place resize changes your capacity planning, that's a 30-minute conversation worth having in your next architecture review.

Planning a 1.33 upgrade and want a second set of eyes? Book a free 30-minute Kubernetes review. We'll look at your stateful workloads and tell you where in-place resize changes the plan.

Frequently asked questions

What is in-place pod resize in Kubernetes 1.33?

In-place pod resize (in-place pod vertical scaling) lets you change a running pod's CPU and memory limits without restarting it. You update the pod's resources through a resize subresource, and the kubelet rewrites the cgroup v2 limits live. It reached beta in Kubernetes 1.33, and the pod keeps its warm state, connections, and caches through the change.

Can you shrink a pod's memory limit without restarting it?

Not below what the process is currently using. If a pod is using 12GB and you set the limit to 10GB, the resize sits in the Deferred state and the kubelet retries automatically once memory usage drops below the new limit. Kubernetes will not kill the process to make the resize fit.

What are the resize states in Kubernetes 1.33?

A resize request moves through four states: Proposed (the kubelet has seen the request), InProgress (cgroup limits are being updated), Deferred (the resize cannot happen right now and will be retried automatically), and Infeasible (the resize can never happen on this node, for example requesting more CPU than the node has allocatable, so the pod needs rescheduling).

How do you enable in-place pod resize?

Set the InPlacePodVerticalScaling feature gate to true on kube-apiserver and kubelet; EKS, GKE, and AKS expose it as a cluster setting from 1.33 onwards. Start with a single non-critical workload like a cache or a staging database, patch its resources, and watch the resize conditions before relying on it for stateful production workloads.

Kubernetes 1.33 in-place pod resize:
the end of the 3 AM restart window.

1. The problem it solves

2. How it actually works

3. The resize state machine

4. A safe rollout plan

5. What it still doesn't fix

The takeaway

Frequently asked questions

Running stateful workloads on Kubernetes?

Related posts.

Three Mistakes We See in Every Kubernetes Migration

Kubernetes GPU Cost Crisis: Cut LLM Inference Bills by 60%

Kubernetes 1.33 in-place pod resize: the end of the 3 AM restart window.

1. The problem it solves

2. How it actually works

3. The resize state machine

4. A safe rollout plan

5. What it still doesn't fix

The takeaway

Frequently asked questions

Running stateful workloads on Kubernetes?

Related posts.

Three Mistakes We See in Every Kubernetes Migration

Kubernetes GPU Cost Crisis: Cut LLM Inference Bills by 60%

Kubernetes 1.33 in-place pod resize:
the end of the 3 AM restart window.