[metrics] Janitor cleanup for parquet files by mattmkim · Pull Request #6248 · quickwit-oss/quickwit

mattmkim · 2026-03-30T18:59:05Z

Description

This PR can be reviewed commit by commit.

Enables the janitor to clean up parquet files, for metrics indexes. Functionally, should be the same as tantivy split cleanup.

How was this PR tested?

Describe how you tested this PR.

…s helper

… retention config

mattmkim · 2026-03-30T21:13:44Z

quickwit/quickwit-index-management/src/parquet_garbage_collection.rs

+
+/// Deletes a single batch of parquet splits from storage and metastore.
+/// Returns (succeeded, failed).
+async fn delete_parquet_splits_from_storage_and_metastore(


mimics logic in delete_splits_from_storage_and_metastore

mattmkim · 2026-03-31T21:34:30Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1d08467270

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-31T21:42:11Z

quickwit/quickwit-index-management/src/parquet_garbage_collection.rs

+            Err(err) => {
+                error!(index_uid=%index_uid, error=?err, "failed to list metrics splits");
+                break;


Propagate metastore list failures from parquet GC

When list_metrics_splits fails, this branch only logs and breaks, and delete_marked_parquet_splits later returns Ok(removal_info); run_parquet_garbage_collect therefore reports success even when no cleanup could be performed. In production metastore outages for metrics indexes, janitor success counters/metrics are incremented and operators lose the failure signal, so parquet GC can be silently ineffective for entire runs.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-31T21:42:11Z

quickwit/quickwit-janitor/src/retention_policy_execution.rs

+    let query = ListMetricsSplitsQuery::for_index(index_uid.clone())
+        .with_max_time_range_end(max_retention_timestamp);


Paginate parquet retention scans before marking splits

This retention path queries expired metrics splits without a limit/cursor and then marks all returned split IDs in one request, which scales poorly compared to the paginated GC path added in this commit. On indexes with many expired parquet splits, the response/request payload can become large enough to hit RPC/message-size limits or memory pressure, causing the retention execution to fail and leave old data unmarked.

Useful? React with 👍 / 👎.

mattmkim force-pushed the matthew.kim/parquet-janitor branch from d51894c to 37162f3 Compare March 30, 2026 20:56

mattmkim added 6 commits March 30, 2026 17:11

Add pagination and filtering fields to ListMetricsSplitsQuery

983679c

Extract shared storage deletion logic into reusable delete_split_file…

4c559d3

…s helper

Add parquet garbage collection core with two-phase mark-and-delete

63f8743

Integrate parquet GC and retention policy into janitor actors

aef848f

Add CLI support for parquet GC, IndexService method, and OTEL metrics…

bc3e7af

… retention config

Fix rustfmt formatting

fa5eee3

mattmkim force-pushed the matthew.kim/parquet-janitor branch from 993d043 to fa5eee3 Compare March 30, 2026 21:11

mattmkim changed the title ~~[draft] parquet janitor~~ [metrics] Janitor cleanup for parquet files Mar 30, 2026

mattmkim commented Mar 30, 2026

View reviewed changes

mattmkim marked this pull request as ready for review March 30, 2026 21:27

mattmkim added 3 commits March 31, 2026 09:41

Merge branch 'main' into matthew.kim/parquet-janitor

afb739c

Merge branch 'main' into matthew.kim/parquet-janitor

7629d2e

appease linter

1d08467

chatgpt-codex-connector bot reviewed Mar 31, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[metrics] Janitor cleanup for parquet files#6248

[metrics] Janitor cleanup for parquet files#6248
mattmkim wants to merge 9 commits intomainfrom
matthew.kim/parquet-janitor

mattmkim commented Mar 30, 2026 •

edited

Loading

Uh oh!

mattmkim Mar 30, 2026

Uh oh!

mattmkim commented Mar 31, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 31, 2026

Uh oh!

chatgpt-codex-connector bot Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		let query = ListMetricsSplitsQuery::for_index(index_uid.clone())
		.with_max_time_range_end(max_retention_timestamp);

Conversation

mattmkim commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How was this PR tested?

Uh oh!

mattmkim Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

mattmkim commented Mar 31, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mattmkim commented Mar 30, 2026 •

edited

Loading