Split batch on "request_size is too large" errors by tonyhb · Pull Request #3673 · google/go-cloud

tonyhb · 2026-03-18T16:45:57Z

We currently estimate the size of the batch and limit batch sizes to <= 10MB for GCP PubSub (#3005). This estimate can be wrong (by eg ~30kb).

In these cases, we don't want data loss. Instead, if the batch is too large we split the batch in half then send both batches. This obviously assumes the estimate isn't off by a factor of 2, which we've never seen in prod.

Also, ideally, we wouldn't even make the OG request and instead would inspect the size of the protobuf before sending over the wire. This requires changes to Google Cloud's apiv1 driver, so that's out of scope for gocloud.dev. In this case, we'll deal with a failing outbound request then attempt to recover.

edit:

This is specifically for GCP pub/sub. If the second half of sending fails, we'll attempt to retransmit the entire batch, leading to dupe messages for the first half. Given GCP pub/sub is at-least-once, I actually don't care about that. I'd much rather have no data loss and have dupe messages, as we already know we may get dupes.

We currently estimate the size of the batch and limit batch sizes to <= 10MB for GCP PubSub (google#3005). This estimate can be wrong (by eg ~30kb). In these cases, we don't want data loss. Instead, if the batch is too large we split the batch in half then send both batches. This obviously assumes the estimate isn't off by a factor of 2, which we've never seen in prod. Also, ideally, we wouldn't even make the OG request and instead would inspect the size of the protobuf before sending over the wire. This requires changes to Google Cloud's apiv1 driver, so that's out of scope for gocloud.dev. In this case, we'll deal with a failing outbound request then attempt to recover.

tonyhb · 2026-03-18T16:49:20Z

For clarity, this is an example of a production error encountered using v0.40.0 of this library:

pubsub (code=InvalidArgument): rpc error: code = InvalidArgument desc = The value for request_size is too large. You passed 10036929 in the request, but the maximum value is 10000000..

PR should fix this issue.

BrunoScheufler · 2026-03-18T17:17:54Z

pubsub/gcppubsub/gcppubsub.go

+		// estimates can be off, and some production use cases can have ~30KB data over the 10MB limit
+		// when sending batches.
+		//
+		// in this case, if we ever get "message too large" errors, split the batch in half and send


Do we have any way to test the splitting logic?

BrunoScheufler

Looks good. Wondering if we can test the splitting logic itself to make sure there's no weird index magic going on and we accidentally drop events because of it.

vangent · 2026-03-18T17:45:23Z

Have you considered moving to gcppubsubv2 ?
Maybe instead we can just change the MaxBatchByteSize to 8MB instead of 9MB?

tonyhb · 2026-03-18T18:48:04Z

We can definitely lower the max batch size and yes, I dont know how I missed V2. Thanks. Want me to change this PR to decrease max sized in this PR?

vangent · 2026-03-18T19:37:23Z

Want me to change this PR to decrease max sized in this PR?

Sure. The intent was that this cap would make it so that we ~never hit the batch size, so if you're hitting it then lowering it a bit is probably the right thing to do, and is simpler than retrying after a split.

I don't recall, but v2 doesn't have this cap, so it may not be an issue at all if you switch.

BrunoScheufler reviewed Mar 18, 2026

View reviewed changes

BrunoScheufler approved these changes Mar 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split batch on "request_size is too large" errors#3673

Split batch on "request_size is too large" errors#3673
tonyhb wants to merge 1 commit intogoogle:masterfrom
tonyhb:feat/split-batch-if-too-large

tonyhb commented Mar 18, 2026 •

edited

Loading

Uh oh!

tonyhb commented Mar 18, 2026 •

edited

Loading

Uh oh!

BrunoScheufler Mar 18, 2026

Uh oh!

BrunoScheufler left a comment

Uh oh!

vangent commented Mar 18, 2026

Uh oh!

tonyhb commented Mar 18, 2026

Uh oh!

vangent commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tonyhb commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tonyhb commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BrunoScheufler Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

BrunoScheufler left a comment

Choose a reason for hiding this comment

Uh oh!

vangent commented Mar 18, 2026

Uh oh!

tonyhb commented Mar 18, 2026

Uh oh!

vangent commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tonyhb commented Mar 18, 2026 •

edited

Loading

tonyhb commented Mar 18, 2026 •

edited

Loading