Skip to content

ovn-kubernetes, cluster-network-operator: scope hypershift-kubevirt conformance to sig-kubevirt#79039

Open
qinqon wants to merge 1 commit intoopenshift:mainfrom
qinqon:kubevirt-conformance-sig-kubevirt-only
Open

ovn-kubernetes, cluster-network-operator: scope hypershift-kubevirt conformance to sig-kubevirt#79039
qinqon wants to merge 1 commit intoopenshift:mainfrom
qinqon:kubevirt-conformance-sig-kubevirt-only

Conversation

@qinqon
Copy link
Copy Markdown
Contributor

@qinqon qinqon commented May 8, 2026

Summary

Scope the hypershift-kubevirt-conformance AWS workflow runs to the [sig-kubevirt] subset of the conformance suite, instead of running the entire ~1500-test parallel suite, on:

  • openshift/ovn-kubernetes (test e2e-aws-ovn-hypershift-kubevirt) — release-4.18, release-4.19, release-4.20 (job does not exist on later branches)
  • openshift/cluster-network-operator (test e2e-aws-hypershift-ovn-kubevirt) — release-4.18 through release-5.1, plus master

Both tests use the same hypershift-kubevirt-conformance step-registry workflow, so the same env override applies.

Why

These jobs currently inherit the default TEST_SUITE: openshift/conformance/parallel and run all of OpenShift's parallel conformance tests on top of a HyperShift hosted cluster whose worker nodes are KubeVirt VMs nested inside an AWS infra cluster.

This is both wasteful and fragile:

  • The KubeVirt-on-AWS nested topology is sensitive to infra hiccups. Any transient VM/storage stall fails dozens of unrelated tests simultaneously (DNS, oauth, services, sysctl, cpu_partitioning, ...) and turns a single infra glitch into a job-wide red signal.
  • The job names (hypershift-kubevirt) advertise kubevirt-specific coverage, but the only kubevirt-specific tests in the suite are [sig-kubevirt]. Everything else is generic conformance that is already run by many other gates.

What

Add to each affected test stanza:

env:
  TEST_INCLUDES: sig-kubevirt
  TEST_SUITE: openshift/conformance/parallel/minimal

This matches the existing pattern from openshift/hypershift/openshift-hypershift-release-4.18__periodics.yaml, which is the only AWS hypershift-kubevirt-conformance job in the entire repo that already scopes itself correctly.

Files changed (12)

ovn-kubernetes:

  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.18.yaml
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.19.yaml
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.20.yaml

cluster-network-operator:

  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-master.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.18.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.19.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.20.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.21.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.22.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.23.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-5.0.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-5.1.yaml

Validation

  • make ci-operator-prowgen — clean (no Prow job spec changes; env is read at runtime from the unresolved-config registry)
  • make sanitize-prow-jobs — passed
  • make ci-operator-checkconfig — passed (Configs reloaded)

Background

Investigating a recent failure of pull-ci-openshift-ovn-kubernetes-release-4.20-e2e-aws-ovn-hypershift-kubevirt showed the root cause was an in-guest kernel hung-task (disk I/O stall on both KubeVirt worker VMs simultaneously) ~80 minutes into the run. Because the full conformance suite was running, the single infra event caused 7 sig-kubevirt and many additional unrelated tests to fail with "no ready, schedulable nodes in the cluster". Scoping the jobs to their intended purpose makes the gate signal both faster and more meaningful.

@openshift-ci openshift-ci Bot requested review from jcaamano and kyrtapz May 8, 2026 10:15
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 8, 2026

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: d92efbf2-ee89-4afb-946e-8f2f6aadf5ea

📥 Commits

Reviewing files that changed from the base of the PR and between c40c744 and 3669cef.

📒 Files selected for processing (12)
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-master.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.18.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.19.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.20.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.21.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.22.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.23.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-5.0.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-5.1.yaml
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.18.yaml
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.19.yaml
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.20.yaml
✅ Files skipped from review due to trivial changes (2)
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.20.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.18.yaml
🚧 Files skipped from review as they are similar to previous changes (4)
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.18.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.19.yaml
  • ci-operator/config/openshift/ovn-kubernetes/openshift-ovn-kubernetes-release-4.19.yaml
  • ci-operator/config/openshift/cluster-network-operator/openshift-cluster-network-operator-release-4.20.yaml

Walkthrough

Multiple CI operator release configs under ci-operator/config/openshift/ovn-kubernetes and ci-operator/config/openshift/cluster-network-operator were updated to add environment variables to e2e-aws-*-hypershift-kubevirt test jobs: TEST_INCLUDES: sig-kubevirt and TEST_SUITE: openshift/conformance/parallel/minimal.

Changes

Test Job Environment Configuration (OVN Kubernetes & Cluster Network Operator releases)

Layer / File(s) Summary
Data / Config
ci-operator/config/openshift/ovn-kubernetes/...-release-4.18.yaml, ...-release-4.19.yaml, ...-release-4.20.yaml, ci-operator/config/openshift/cluster-network-operator/...-master.yaml, ...-release-4.18.yaml, ...-release-4.19.yaml, ...-release-4.20.yaml, ...-release-4.21.yaml, ...-release-4.22.yaml, ...-release-4.23.yaml, ...-release-4.18.yaml, ...-release-4.19.yaml, ...-release-5.0.yaml, ...-release-5.1.yaml
Added or extended env/steps.env blocks for the e2e-aws-*-hypershift-kubevirt test job entries across multiple release and branch YAMLs.
Env Values Added
(same files)
Each affected job now sets TEST_INCLUDES: sig-kubevirt and TEST_SUITE: openshift/conformance/parallel/minimal.
Scope / Context
(same files)
No other job fields, workflows, cluster_profile, or logic were modified; changes are limited to environment variables for these test entries.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately reflects the main change: scoping hypershift-kubevirt conformance tests to sig-kubevirt across ovn-kubernetes and cluster-network-operator configurations.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Comment @coderabbitai help to get the list of available commands and usage tips.

@qinqon
Copy link
Copy Markdown
Contributor Author

qinqon commented May 8, 2026

/pj-rehearse

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@qinqon: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@qinqon qinqon force-pushed the kubevirt-conformance-sig-kubevirt-only branch from 56af2f1 to c40c744 Compare May 8, 2026 10:35
@qinqon qinqon changed the title ovn-kubernetes: scope hypershift-kubevirt conformance to sig-kubevirt ovn-kubernetes, cluster-network-operator: scope hypershift-kubevirt conformance to sig-kubevirt May 8, 2026
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented May 8, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: qinqon
Once this PR has been reviewed and has the lgtm label, please assign tssurya for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

…nformance to sig-kubevirt

The e2e-aws-ovn-hypershift-kubevirt (ovn-kubernetes) and
e2e-aws-hypershift-ovn-kubevirt (cluster-network-operator) tests
inherit the default TEST_SUITE 'openshift/conformance/parallel' and
run the entire upstream OpenShift parallel conformance suite (~1500
tests) on top of a HyperShift hosted cluster whose worker nodes are
KubeVirt VMs nested in AWS.

This is both wasteful and fragile:
- The KubeVirt-on-AWS nested topology is sensitive to infra hiccups;
  any transient VM/storage stall fails dozens of unrelated tests
  simultaneously (DNS, oauth, services, sysctl, cpu_partitioning, ...)
  and turns a single infra glitch into a job-wide red signal.
- The job names ('hypershift-kubevirt') advertise kubevirt-specific
  coverage, but the only kubevirt-specific tests in the suite are
  '[sig-kubevirt]'. Everything else is generic conformance also run
  by many other gates.

Match the existing pattern from
openshift/hypershift/openshift-hypershift-release-4.18__periodics.yaml
which already scopes its kubevirt conformance run to:

  TEST_SUITE: openshift/conformance/parallel/minimal
  TEST_INCLUDES: sig-kubevirt

Both jobs use the same 'hypershift-kubevirt-conformance' workflow,
so the same env override applies. Scope:
- ovn-kubernetes: release-4.18, release-4.19, release-4.20 (job
  does not exist on later branches).
- cluster-network-operator: release-4.18 through release-5.1, plus
  master.

Expected impact: significantly faster runs, far fewer false-failure
test cases when the underlying KubeVirt infra hiccups, and a focused
gate signal aligned with the job's name.

Assisted-By: Claude <noreply@anthropic.com>
Signed-off-by: Enrique Llorente <ellorent@redhat.com>
@qinqon qinqon force-pushed the kubevirt-conformance-sig-kubevirt-only branch from c40c744 to 3669cef Compare May 8, 2026 10:38
@openshift-merge-bot
Copy link
Copy Markdown
Contributor

[REHEARSALNOTIFIER]
@qinqon: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-cluster-network-operator-release-5.1-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-4.19-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-4.18-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-4.20-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-master-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-4.21-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-4.22-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-4.23-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-cluster-network-operator-release-5.0-e2e-aws-hypershift-ovn-kubevirt openshift/cluster-network-operator presubmit Ci-operator config changed
pull-ci-openshift-ovn-kubernetes-release-4.18-e2e-aws-ovn-hypershift-kubevirt openshift/ovn-kubernetes presubmit Ci-operator config changed
pull-ci-openshift-ovn-kubernetes-release-4.19-e2e-aws-ovn-hypershift-kubevirt openshift/ovn-kubernetes presubmit Ci-operator config changed
pull-ci-openshift-ovn-kubernetes-release-4.20-e2e-aws-ovn-hypershift-kubevirt openshift/ovn-kubernetes presubmit Ci-operator config changed
Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@qinqon
Copy link
Copy Markdown
Contributor Author

qinqon commented May 8, 2026

/pj-rehearse

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@qinqon: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented May 8, 2026

@qinqon: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/rehearse/openshift/cluster-network-operator/release-4.21/e2e-aws-hypershift-ovn-kubevirt 3669cef link unknown /pj-rehearse pull-ci-openshift-cluster-network-operator-release-4.21-e2e-aws-hypershift-ovn-kubevirt
ci/rehearse/openshift/cluster-network-operator/release-4.23/e2e-aws-hypershift-ovn-kubevirt 3669cef link unknown /pj-rehearse pull-ci-openshift-cluster-network-operator-release-4.23-e2e-aws-hypershift-ovn-kubevirt
ci/rehearse/openshift/cluster-network-operator/master/e2e-aws-hypershift-ovn-kubevirt 3669cef link unknown /pj-rehearse pull-ci-openshift-cluster-network-operator-master-e2e-aws-hypershift-ovn-kubevirt
ci/rehearse/openshift/cluster-network-operator/release-5.1/e2e-aws-hypershift-ovn-kubevirt 3669cef link unknown /pj-rehearse pull-ci-openshift-cluster-network-operator-release-5.1-e2e-aws-hypershift-ovn-kubevirt
ci/rehearse/openshift/cluster-network-operator/release-4.20/e2e-aws-hypershift-ovn-kubevirt 3669cef link unknown /pj-rehearse pull-ci-openshift-cluster-network-operator-release-4.20-e2e-aws-hypershift-ovn-kubevirt

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant