Skip to content

feat: add sort_array benchmark#3758

Merged
comphead merged 1 commit intoapache:mainfrom
grorge123:sort_array_benchmark
Mar 22, 2026
Merged

feat: add sort_array benchmark#3758
comphead merged 1 commit intoapache:mainfrom
grorge123:sort_array_benchmark

Conversation

@grorge123
Copy link
Contributor

Which issue does this PR close?

Add microbenchmark for sort_array (#3706)

Rationale for this change

Add a new microbenchmark to compare the performance between Comet and Spark.

What changes are included in this PR?

Each benchmark will sort 4,194,304 rows. Each array contains integer and null values.
This PR includes four benchmarks.

  • ascending sort
  • descending sort
  • more large array for ascending sort
  • only take the first value for the sort array (Reduce the memory read impact)

How are these changes tested?

Run Benchmark: SPARK_GENERATE_BENCHMARK_FILES=1 make benchmark-org.apache.spark.sql.benchmark.CometArrayExpressionBenchmark
image

Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding this @grorge123

@comphead comphead merged commit aee298d into apache:main Mar 22, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants