neuron-ci: pin AZ and fix operator wait ordering for inf2 instances#76528
Conversation
The rosa-cluster-wait-ready-operators step was timing out because inf2.8xlarge worker nodes were not ready before operator health checks ran. Reorder to wait for nodes before operators, and pin ZONES_LIST to us-east-1a to ensure consistent AZ with inf2 capacity. Made-with: Cursor
|
/override ci/prow/step-registry-shellcheck |
|
@ybrodsky-rh: ybrodsky-rh unauthorized: /override is restricted to Repo administrators, approvers in top level OWNERS file, and the following github teams:openshift: openshift-release-oversight openshift-staff-engineers openshift-sustaining-engineers. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
/pj-rehearse ack |
|
@ybrodsky-rh: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
/pj-rehearse pull-ci-rh-ecosystem-edge-neuron-ci-main-4.20-stable-aws-neuron-operator-e2e |
|
@ybrodsky-rh: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
/pj-rehearse test |
|
@ybrodsky-rh: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
@ybrodsky-rh: job(s): test either don't exist or were not found to be affected, and cannot be rehearsed |
Variables name and mac are dynamically assigned via sourced yq output, so shellcheck cannot trace the assignment. Suppress SC2154 for the loop that references them. Made-with: Cursor
|
[REHEARSALNOTIFIER]
A total of 53 jobs have been affected by this change. The above listing is non-exhaustive and limited to 25 jobs. A full list of affected jobs can be found here Interacting with pj-rehearseComment: Once you are satisfied with the results of the rehearsals, comment: |
|
/lgtm |
|
/pj-rehearse ack |
|
@ybrodsky-rh: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
/pj-rehearse approve |
|
@ybrodsky-rh: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
@ybrodsky-rh: job(s): approve either don't exist or were not found to be affected, and cannot be rehearsed |
|
/approve |
|
/pj-rehearse ack |
|
@ybrodsky-rh: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: droslean, ybrodsky-rh, yevgeny-shnaidman The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
The rosa-cluster-wait-ready-operators step was timing out because inf2.8xlarge worker nodes were not ready before operator health checks ran. Reorder to wait for nodes before operators, and pin ZONES_LIST to us-east-1a to ensure consistent AZ with inf2 capacity.