Skip to content

GH-3215: Make the order of encodings in column metadata deterministic#3216

Closed
handmadecode wants to merge 1 commit intoapache:masterfrom
handmadecode:gh-3215-sort-format-encodings
Closed

GH-3215: Make the order of encodings in column metadata deterministic#3216
handmadecode wants to merge 1 commit intoapache:masterfrom
handmadecode:gh-3215-sort-format-encodings

Conversation

@handmadecode
Copy link
Contributor

Rationale for this change

Two processes running the same version of parquet-java and having identical writer configurations can still produce files that are different at the binary level for the exact same written data.

What changes are included in this PR?

A single line added to org.apache.parquet.format.converter.ParquetMetadataConverter::toFormatEncodings

Are these changes tested?

A unit test is provided. I have tested this in a custom built parquet-java with our production code for several months.

Are there any user-facing changes?

No

Closes #${3215}

…eated by org.apache.parquet.format.converter.ParquetMetadataConverter::toFormatEncodings
@wgtmac
Copy link
Member

wgtmac commented Feb 27, 2026

Sorry for overseeing this! Could you please rebase it to re-trigger ci?

@mbrannstrom
Copy link
Contributor

@wgtmac See PR #3413 which is a rebase of PR #3216.

@Fokko
Copy link
Contributor

Fokko commented Mar 7, 2026

Merged in #3413 (review) Nevertheless, thanks @handmadecode for working on this

@Fokko Fokko closed this Mar 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants