DataFusion-Federation: Union Flattening Across Executors #22168
Unanswered
vikrantmehta123
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I was experimenting with union pushdown behavior in DataFusion Federation and noticed something I wanted to clarify.
Setup:
t1,t2t3The initial logical plan looks like:
With
optimize_unionsenabled, this is flattened into:This results in three independent SELECTs being executed, with the UNION performed in-memory.
If I disable
optimize_unionsoptimizer rule, Executor 1 instead receives a single pushed-down SQL query:while Executor 2 executes its own SELECT, followed by one final in-memory UNION.
Is flattening unions across executor boundaries here an intentional optimizer choice? Would pushing down UNIONs be better?
Beta Was this translation helpful? Give feedback.
All reactions