🎀Speaking at Red Hat Summit 2026GPUs take flight: Safety-first multi-tenant Platform Engineering with NVIDIA and OpenShift AILearn More
Troubleshooting intermediate ⏱ 30 minutes K8s 1.28+

Validate SR-IOV Operator Health Across Multiple Worker Nodes

Run a full checklist to confirm SR-IOV discovery, VF creation, scheduler resources, and pod attachment on multiple nodes.

By Luca Berton β€’ β€’ πŸ“– 5 min read

πŸ’‘ Quick Answer: Validate daemon pods, node states, VF counts, and allocatable resources on each worker; then schedule one SR-IOV test pod per node and confirm secondary interface attachment.

Use this checklist for nodes such as worker-a and worker-b.

1) Config Daemon on Each Worker

oc get pods -n openshift-sriov-network-operator -o wide | grep sriov-network-config-daemon

2) NodeState for Each Worker

oc get sriovnetworknodestate -n openshift-sriov-network-operator

3) Verify Target Interfaces in NodeState

oc get sriovnetworknodestate worker-a -n openshift-sriov-network-operator -o json | jq '.status.interfaces[] | {name,pciAddress,numVfs}'
oc get sriovnetworknodestate worker-b -n openshift-sriov-network-operator -o json | jq '.status.interfaces[] | {name,pciAddress,numVfs}'

4) Verify Scheduler Resources on Both Nodes

oc get node worker-a -o json | jq '.status.allocatable'
oc get node worker-b -o json | jq '.status.allocatable'

5) Run One Pod Per Node

Pin test pods with nodeSelector and request the SR-IOV resource (for example openshift.io/mellanoxnics: 1).

6) Confirm Secondary Interface in Pod

oc exec -n <ns> <pod-name> -- ip a

You should see an extra interface (for example net1) from the SR-IOV network attachment.

Success Criteria

  • Both nodes expose expected SR-IOV allocatable resources.
  • Pod networking succeeds on both nodes.
  • No repeated CNI/IPAM errors in pod events.
#sriov #validation #multinode #openshift #operator
Luca Berton
Written by Luca Berton

Principal Solutions Architect specializing in Kubernetes, AI/GPU infrastructure, and cloud-native platforms. Author of Kubernetes Recipes and creator of CopyPasteLearn courses.

Want More Kubernetes Recipes?

This recipe is from Kubernetes Recipes, our 750-page practical guide with hundreds of production-ready patterns.

Luca Berton Ansible Pilot Ansible by Example Open Empower K8s Recipes Terraform Pilot CopyPasteLearn ProteinLens