feat(iavl): add KV data reader & writer, and mmap wrapper #25645

aaronc · 2025-12-04T18:31:48Z

Description

This PR specifies the IAVLX KV data file format for storing the WAL as well as other key-value data (branch node keys and compacted changeset KV data), and implements the KVDataReader, KVDataWriter and WALReader types. It also adds the convenience FileWriter and Mmap wrapper types.

One design question for reviewers is whether we should proactively limit key and value size. I would suggest a key limit of 2^16-1 (64KB) and a value limit of 2^24-1 (16MB). Currently, this KV data file uses 32-bit offsets which limits its size to 4gb before we have to roll over. When initially writing changesets, we should probably roll over around 1 or 2gb and then compact up to 4gb. If, however, while writing a version we ran out of space, the node would crash non-deterministically. This is unlikely to happen if we roll over at 1 or 2gb unless someone introduces some really large unexpected KV data. Setting a limit to key and value size would be consensus breaking (unlikely to ever get triggered in practice), but would make such pathological scenarios cause nodes to fail more deterministically based on validation rather than just running out of disk space. We could also explore larger offsets of 40-64bits, but the larger the kv.dat file is, the more extra disk space we need when doing compaction. And also really large key/value data should probably be considered pathological anyway. Any thoughts on all of this?

…avlx-init

…-init2

…-part6

iavl/internal/kvdata_writer.go

+	"fmt"
+	"math"
+	"os"
+	"unsafe"


…-part6

codecov · 2025-12-05T15:28:23Z

Codecov Report

❌ Patch coverage is 85.07463% with 50 lines in your changes missing coverage. Please review.
✅ Project coverage is 70.48%. Comparing base (18e85de) to head (938c3c3).

Files with missing lines	Patch %	Lines
iavl/internal/kvdata_reader.go	82.97%	24 Missing ⚠️
iavl/internal/kvdata_writer.go	86.71%	19 Missing ⚠️
iavl/internal/file_writer.go	71.42%	4 Missing ⚠️
iavl/internal/mmap.go	93.33%	2 Missing ⚠️
iavl/internal/changeset_info.go	85.71%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #25645      +/-   ##
==========================================
+ Coverage   70.40%   70.48%   +0.08%     
==========================================
  Files         830      834       +4     
  Lines       54050    54380     +330     
==========================================
+ Hits        38052    38332     +280     
- Misses      15998    16048      +50

Files with missing lines	Coverage Δ
iavl/internal/leaf_layout.go	`66.66% <ø> (ø)`
iavl/internal/mem_node.go	`94.23% <ø> (ø)`
iavl/internal/changeset_info.go	`84.21% <85.71%> (-4.03%)`	⬇️
iavl/internal/mmap.go	`93.33% <93.33%> (ø)`
iavl/internal/file_writer.go	`71.42% <71.42%> (ø)`
iavl/internal/kvdata_writer.go	`86.71% <86.71%> (ø)`
iavl/internal/kvdata_reader.go	`82.97% <82.97%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

aaronc · 2025-12-05T18:39:46Z

iavl/internal/changeset_info.go

+	if unsafe.Sizeof(ChangesetInfo{}) != sizeChangesetInfo {
+		panic(fmt.Sprintf("invalid ChangesetInfo size: got %d, want %d", unsafe.Sizeof(ChangesetInfo{}), sizeChangesetInfo))
+	}
+}


This was missing in the previous PR

aaronc · 2025-12-05T18:45:36Z

iavl/internal/leaf_layout.go


+	// ValueOffset is the offset the value data for this node in the key value data file.
+	// The same size considerations apply here as for KeyOffset.
+	ValueOffset uint32


In order to efficiently cache keys, we need to allow key and value bytes to be non-contiguous in the data file. Adding a separate value offset allows us to put key and value data wherever we want to. Hopefully, the additional 4 bytes per leaf node is offset by more key caching in the kv data file.

aaronc · 2025-12-05T18:49:12Z

iavl/internal/mmap.go

+	"io"
+	"os"
+)
+import "github.com/edsrzf/mmap-go"


For now I am using this off-the-shelf mmap wrapper which has the highest number of known importers on pkg.go.dev: https://pkg.go.dev/github.com/edsrzf/mmap-go?tab=importedby

In the future, it may be worth considering creating our own mmap wrapper. On linux, it may be possible to apply an optimization where we can resize the mmap without unmapping memory: https://stackoverflow.com/questions/74243583/memory-map-file-with-growing-size

github-actions · 2025-12-05T19:12:19Z

@aaronc your pull request is missing a changelog!

aljo242 · 2025-12-05T20:31:09Z

@aaronc a few more linter compaints

aaronc added 30 commits December 1, 2025 16:19

feat(iavl): initialize disk layout

b27e5d3

change package

3599010

switch size to uint40, update docs and tests

e581282

update docs

abf0976

Merge branch 'main' into aaronc/iavlx-init

7fcba13

reorder code

a716e2f

Merge remote-tracking branch 'origin/aaronc/iavlx-init' into aaronc/i…

ca25571

…avlx-init

documented Uint40 endianness and added fmt.Stringer

cccd14b

feat(iavl): add Node, MemNode, and NodePointer

07d0d5e

switch table tests to key: value struct init

a65f1c6

Merge branch 'aaronc/iavlx-init' into aaronc/iavlx-init2

66d3dee

add basic mem node getter tests

b212f96

adding mutation, hash, verification code, basic tests

e6ccd8d

Merge branch 'main' of github.com:cosmos/cosmos-sdk into aaronc/iavlx…

fab53dc

…-init2

add tests

d19f0e7

Merge branch 'main' of github.com:cosmos/cosmos-sdk into aaronc/iavlx…

95318bb

…-init2

reduce PR size

90bad56

add get tests and update docs

e37410b

add more test explanations

0564bdb

update doc, add missing test

f4607bf

feat(iavl): define KV data format

78faae5

WIP on kv data design

e8cfa93

WIP on kv data design

3ff4da5

WIP on kv data writer

c75dde0

WIP on kv data writer

aea0c86

Merge branch 'main' of github.com:cosmos/cosmos-sdk into aaronc/iavlx…

6d28292

…-part6

update leaf size, add missing ChangesetInfo size check

8a74b7a

add Mmap

9b4a396

implement KVDataReader

b6b140d

fixes to KVDataReader

8f72673

aaronc added 2 commits December 4, 2025 13:29

fixes to KVDataReader

647f932

Merge branch 'main' of github.com:cosmos/cosmos-sdk into aaronc/iavlx…

b0efee3

…-part6

github-advanced-security bot found potential problems Dec 4, 2025

View reviewed changes

iavl/internal/kvdata_writer.go

"fmt"

"math"

"os"

"unsafe"

Check notice

Code scanning / CodeQL

Sensitive package import Note

Certain system packages contain functions which may be a possible source of non-determinism

aaronc added 2 commits December 5, 2025 10:16

Merge branch 'main' of github.com:cosmos/cosmos-sdk into aaronc/iavlx…

b0b1dd1

…-part6

add tests, update go.mod's

f42a7af

aaronc added 5 commits December 5, 2025 13:19

WIP on tests, add WAL mode checks

4156068

WIP on tests

b76da6b

WIP on tests

a34a174

document FileWriter

db113dc

add empty key/value test cases

493430b

aaronc commented Dec 5, 2025

View reviewed changes

add mmap tests, fix empty close bug, minor cleanups

9db9564

aaronc marked this pull request as ready for review December 5, 2025 19:11

lint fix

951c767

fix lint

938c3c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(iavl): add KV data reader & writer, and mmap wrapper #25645

feat(iavl): add KV data reader & writer, and mmap wrapper #25645

Uh oh!

aaronc commented Dec 4, 2025 •

edited

Loading

Uh oh!

Check notice

codecov bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

aaronc Dec 5, 2025

Uh oh!

aaronc Dec 5, 2025

Uh oh!

aaronc Dec 5, 2025

Uh oh!

github-actions bot commented Dec 5, 2025

Uh oh!

aljo242 commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(iavl): add KV data reader & writer, and mmap wrapper #25645

Are you sure you want to change the base?

feat(iavl): add KV data reader & writer, and mmap wrapper #25645

Uh oh!

Conversation

aaronc commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Check notice

codecov bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

aaronc Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

aaronc Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

aaronc Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 5, 2025

Uh oh!

aljo242 commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aaronc commented Dec 4, 2025 •

edited

Loading

codecov bot commented Dec 5, 2025 •

edited

Loading