add priority to workload entry #3619

ramaraochavali · 2025-12-26T04:54:57Z

Adds priority to workload entry. This is especially useful to setup active/passive between two clusters defined by two different DNS based clusters. When endpoints in priority(0), becomes unhealthy - traffic automatically moves to priority1.

Signed-off-by: Rama Chavali <rama.rao@salesforce.com>

istio-policy-bot · 2025-12-26T04:55:00Z

😊 Welcome @ramaraochavali! This is either your first contribution to the Istio api repo, or it's been
a while since you've been here.

You can learn more about the Istio working groups, Code of Conduct, and contribution guidelines
by referring to Contributing to Istio.

Thanks for contributing!

Courtesy of your friendly welcome wagon.

howardjohn

How does this relate to the many many other 'priority' mechanisms we have? what if its priority=3 but in another region with locality LB enabled?

This seems like it could be done today with DR failoverPriority? https://istio.io/latest/docs/reference/config/networking/destination-rule/#LocalityLoadBalancerSetting-failover_priority

ramaraochavali · 2025-12-30T05:19:54Z

This seems like it could be done today with DR failoverPriority?

I do not think this can be solved by DR. This is for endpoints matched by a DNS cluster. So they do not have k8s labels associated.

How does this relate to the many many other 'priority' mechanisms we have? what if its priority=3 but in another region with locality LB enabled?

The priority specified here would be taken in to consideration for DNS endpoints specified overriding the locality LB specified in DR. As I mentioned, this is mainly useful for workloadentries specified via DNS resolution to implement failover across two DNS clusters.

howardjohn · 2025-12-30T16:27:31Z

I do not think this can be solved by DR. This is for endpoints matched by a DNS cluster. So they do not have k8s labels associated.

What do you mean? Its a WorkloadEntry, it has labels?

ramaraochavali · 2025-12-31T00:42:21Z

It is the same workload entry that has different DNS as endpoints. It is used in the context of ServiceEntry as two different endpoints

ramaraochavali · 2025-12-31T00:42:23Z

It is the same workload entry that has different DNS as endpoints. It is used in the context of ServiceEntry as two different endpoints

howardjohn · 2025-12-31T00:43:53Z

how can 1 workload entry have multiple DNS addresses? maybe showing an example would help me understand

ramaraochavali · 2025-12-31T01:40:07Z

Sorry. I meant two workload entries used in the context of ServiceEntry as two different DNS end points. We could create synthetic labels for each to identify priority but that sounds very indirect way of doing it for external endpoints.
Will write an example and share

ramaraochavali · 2025-12-31T06:05:01Z

Here is the Service Entry I want to create - with two external endpoints. My goal is to have google.com endpoints have priority 0 and salesforce.com endpoints have priority 1.

apiVersion: networking.istio.io/v1
kind: ServiceEntry
metadata:
  name: service-entry-example
  namespace: istio-system
spec:
  hosts:
  - failover.example.com 
  ports:
  - number: 27018
    name: http-port
    protocol: HTTP
  location: MESH_EXTERNAL
  resolution: DNS
  endpoints:
  - address: www.google.com
    labels:
       loadbalancing-priority: "0"
  - address: www.salesforce.com
    labels:
       loadbalancing-priority: "1"

I want to configure a DR so that when google.com endpoints become unavailable, traffic goes through salesforce.com endpoints. Here is the DR I tried. But priorities wont change based on failoverPriority because the endpoints are resolved by Envoy.

apiVersion: networking.istio.io/v1
kind: DestinationRule
metadata:
  name: failover-example
  namespace: istio-system
spec:
  host: failover.example.com
  trafficPolicy:
    outlierDetection:
      consecutive5xxErrors: 1
      interval: 15s
      baseEjectionTime: 1m
    loadBalancer:
      localityLbSetting:
        enabled: true
        failoverPriority:
        - "loadbalancing-priority=0"

Am I missing some thing here that we can make this setup work with existing priority settings?

hzxuzhonghu · 2025-12-31T09:39:38Z

Even you set priority in the WLE, you cannot achieve auto failover by priority.

I want to configure a DR so that when google.com endpoints become unavailable, traffic goes through salesforce.com endpoints. Here is the DR I tried. But priorities wont change based on failoverPriority because the endpoints are resolved by Envoy.

This is because we donot suport DNS cluster failover yet. Haven't tried if envoy works in this case

	if c.LoadAssignment != nil {
		// TODO: enable failoverPriority for `STRICT_DNS` cluster type
		loadbalancer.ApplyLocalityLoadBalancer(c.LoadAssignment, nil, locality, proxyLabels, localityLB, enableFailover)
	}

ramaraochavali · 2025-12-31T11:11:42Z

. This is because we donot suport DNS cluster failover yet. Haven't tried if envoy works in this case

It works. I have validated it if we set the correct priority to each LocalityLbEndpoint.

ramaraochavali · 2025-12-31T11:29:59Z

BTW, The DR I created to just validate failOverPriority for DNS does not work today. If we add priority to WLE and set directly, we do not need DR changes

howardjohn · 2026-01-02T19:24:43Z

BTW, The DR I created to just validate failOverPriority for DNS does not work today. If we add priority to WLE and set directly, we do not need DR changes

IMO this is a bug. There is no reason for this other than the implementation in the control plane is wrong because we pass nil here (https://github.com/istio/istio/blob/2eae57b36d1740f8e44e54c0efde07bfb9c706c5/pilot/pkg/networking/core/cluster_traffic_policy.go#L329) but depend on it in the locality code

ramaraochavali · 2026-01-03T06:57:26Z

IMO this is a bug. There is no reason for this other than the implementation in the control plane is wrong because we pass nil here (https://github.com/istio/istio/blob/2eae57b36d1740f8e44e54c0efde07bfb9c706c5/pilot/pkg/networking/core/cluster_traffic_policy.go#L329) but depend on it in the locality code

Yes. This can be fixed (though it is not as straight forward as EDS). But the point I am making is, for DNS type of Endpoints defined via Service Entries, it seems creating synthetic labels to specify priority seems not intuitive (or indirect way) than specifying the priority directly when defining the WLE. We can argue if we need a different way to set priority than relying on failoverPriority in DR but IMO it seems simpler to specify priority directly instead of having manage labels especially when there are more than two endpoints in the SE.

hzxuzhonghu · 2026-01-04T03:07:07Z

The main concern is that WorkloadEntry can also be used separate object other than endpoints inlined in SE. If we support priority in the spec, it seems lack of flexibility when used separately. Two options may compromise:

Introduce a new PriorityEndpoints to SE, which is going to deprecate Endpoints
Not changing any api, just exploit the exisiting failOver in DR, and fix the missing gap in the controlplane.

ramaraochavali · 2026-01-05T03:53:50Z

The main concern is that WorkloadEntry can also be used separate object other than endpoints inlined in SE. If we support priority in the spec, it seems lack of flexibility when used separately.

I agree with this. I think it has partly to do with how Resolution is there in SE and reusing WorkloadEntry as is there. We can say if priority is set for WLE, it is considered and rest of the priority calculations are not done for DNS type of SEs. Other option is to have repeated Priorities in SE that maps to endpoints in the same order or introduce name in WLE and have named priorities. None of these are great options but may be better than synthetic label approach.

If we decide not to add any api option, we can always suggest DR option after fixing the bug (But I need to check how complicated the config will be if there are more than two endpoints)

howardjohn · 2026-01-05T18:40:00Z

But the point I am making is, for DNS type of Endpoints defined via Service Entries, it seems creating synthetic labels to specify priority seems not intuitive

Why not for all WorkloadEntry? Why not for Pod? why not for EndpointSlice?

We have a way to do priority, I don't think we should have multiple conflicting ways. How do we explain to users which to use? What happens when both are set?

ramaraochavali · 2026-01-09T10:12:54Z

Why not for all WorkloadEntry? Why not for Pod? why not for EndpointSlice?

It is mainly because for the above it can be determined by k8s labels (location, network etc) auto set by some controllers which we have control over and it is not needed to set priority for each endpoint in the slice (other than group of endpoints). But for DNS, the priority can be at each endpoint and can vary based on many constraints like if you are using LLM as DNS endpoint, cost or some preferred LLM for certain workloads etc. It may not always be possible to express them as labels.

How do we explain to users which to use? What happens when both are set?

We can say explicit priority is set that is honoured.

Any way, I am also in parallel trying to fix the failOverPriority for DNS endpoints and see if that helps

add priority to workload entry

eb41b88

Signed-off-by: Rama Chavali <rama.rao@salesforce.com>

ramaraochavali requested a review from a team as a code owner December 26, 2025 04:54

istio-testing added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Dec 26, 2025

zirain approved these changes Dec 26, 2025

View reviewed changes

zirain added the release-notes-none Indicates a PR that does not require release notes. label Dec 26, 2025

howardjohn requested changes Dec 29, 2025

View reviewed changes

add priority to workload entry #3619

Are you sure you want to change the base?

add priority to workload entry #3619

Conversation

ramaraochavali commented Dec 26, 2025

Uh oh!

istio-policy-bot commented Dec 26, 2025

Uh oh!

howardjohn left a comment

Choose a reason for hiding this comment

Uh oh!

ramaraochavali commented Dec 30, 2025

Uh oh!

howardjohn commented Dec 30, 2025

Uh oh!

ramaraochavali commented Dec 31, 2025

Uh oh!

ramaraochavali commented Dec 31, 2025

Uh oh!

howardjohn commented Dec 31, 2025

Uh oh!

ramaraochavali commented Dec 31, 2025

Uh oh!

ramaraochavali commented Dec 31, 2025

Uh oh!

hzxuzhonghu commented Dec 31, 2025

Uh oh!

ramaraochavali commented Dec 31, 2025

Uh oh!

ramaraochavali commented Dec 31, 2025

Uh oh!

howardjohn commented Jan 2, 2026

Uh oh!

ramaraochavali commented Jan 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hzxuzhonghu commented Jan 4, 2026

Uh oh!

ramaraochavali commented Jan 5, 2026

Uh oh!

howardjohn commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ramaraochavali commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ramaraochavali commented Jan 3, 2026 •

edited

Loading

howardjohn commented Jan 5, 2026 •

edited

Loading