Skip to content

Antalya 26.3: Fix export task not being killed during s3 outage#1744

Open
zvonand wants to merge 1 commit intoantalya-26.3from
feature/antalya-26.3/pr-1564
Open

Antalya 26.3: Fix export task not being killed during s3 outage#1744
zvonand wants to merge 1 commit intoantalya-26.3from
feature/antalya-26.3/pr-1564

Conversation

@zvonand
Copy link
Copy Markdown
Collaborator

@zvonand zvonand commented May 6, 2026

Changelog category (leave one):

  • Bug Fix (user-visible misbehavior in an official stable release)

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

The drop table operation must signal cancellation to all background tasks and wait until they ack it. This is done checking the is_cancelled flag at each pipeline iteration. If S3 is unreachable and s3_retries_attempt is big (by default, it is 500), the pipeline gets stuck deep in the AWS SDK and never gets a chance to check the signal / flag. Making the task "unkillable".

This PR fixes it in a hackish way by overwriting the query_is_cancelled_predicate, which is checked by the S3 client retry strategy upon ShouldRetry (#1564 by @arthurpassos).

CI/CD Options

Exclude tests:

  • Fast test
  • Integration Tests
  • Stateless tests
  • Stateful tests
  • Performance tests
  • All with ASAN
  • All with TSAN
  • All with MSAN
  • All with UBSAN
  • All with Coverage
  • All with Aarch64
  • All Regression
  • Disable CI Cache

Regression jobs to run:

  • Fast suites (mostly <1h)
  • Aggregate Functions (2h)
  • Alter (1.5h)
  • Benchmark (30m)
  • ClickHouse Keeper (1h)
  • Iceberg (2h)
  • LDAP (1h)
  • Parquet (1.5h)
  • RBAC (1.5h)
  • SSL Server (1h)
  • S3 (2h)
  • S3 Export (2h)
  • Swarms (30m)
  • Tiered Storage (2h)

Cherry-picked from #1564.


Documentation entry for user-facing changes

...

@zvonand zvonand added the releasy Created/managed by RelEasy label May 6, 2026
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 6, 2026

Workflow [PR], commit [f5d52c7]

@arthurpassos
Copy link
Copy Markdown
Collaborator

oh yeah, that's an important one as well

@zvonand zvonand force-pushed the feature/antalya-26.3/pr-1564 branch 3 times, most recently from b2b724d to 15d7cc1 Compare May 7, 2026 10:42
…t_from_being_cancelled

Fix export task not being killed during s3 outage
@zvonand zvonand force-pushed the feature/antalya-26.3/pr-1564 branch from 15d7cc1 to f5d52c7 Compare May 7, 2026 11:40
Copy link
Copy Markdown
Collaborator

@arthurpassos arthurpassos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

releasy Created/managed by RelEasy

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants