Skip to content

[AURON #2248] Support native Hudi scan for MOR read-optimized queries#2249

Draft
weimingdiit wants to merge 2 commits intoapache:masterfrom
weimingdiit:feat/hudi-mor-read-optimized-native-scan
Draft

[AURON #2248] Support native Hudi scan for MOR read-optimized queries#2249
weimingdiit wants to merge 2 commits intoapache:masterfrom
weimingdiit:feat/hudi-mor-read-optimized-native-scan

Conversation

@weimingdiit
Copy link
Copy Markdown
Contributor

Which issue does this PR close?

Closes #2248

Rationale for this change

Hudi MOR read-optimized queries only read base files and do not require log-file merging. Auron can therefore reuse the existing native Parquet/ORC scan path for this query mode.

What changes are included in this PR?

  • Detect Hudi query type from hoodie.datasource.query.type.
  • Also support the compatible hoodie.datasource.view.type option.
  • Allow MOR native scan only for explicit read_optimized queries.
  • Keep snapshot, incremental, realtime, and time travel queries on the Spark fallback path.
  • Add UT coverage for MOR read-optimized support detection and native scan execution plan.

Are there any user-facing changes?

Yes. MOR read-optimized Hudi queries can now use native Parquet/ORC scan when supported.

How was this patch tested?

  • Added unit test coverage for MOR read-optimized support detection.
  • Added correctness and native scan plan validation for a MOR read-optimized Hudi table scan.

Signed-off-by: weimingdiit <weimingdiit@gmail.com>
…ueries

Signed-off-by: weimingdiit <weimingdiit@gmail.com>
@weimingdiit weimingdiit force-pushed the feat/hudi-mor-read-optimized-native-scan branch from 4e67352 to 7337bbc Compare May 9, 2026 05:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Support native Hudi scan for MOR read-optimized queries

1 participant