Implement table & tree disk usage statistics by shuwenwei · Pull Request #17169 · apache/iotdb

shuwenwei · 2026-02-05T08:13:49Z

Description

This PR implements disk-usage statistics collection in table level and device level for tree model. It adds the necessary data structures, background tasks, and read APIs to compute and expose disk usage metrics used by monitoring, admission control and operational tooling.

Tree Model (No Cache)

IoTDB> show disk_usage from root.test.**
+---------+----------+--------+-------------+-----------+
| Database|DataNodeId|RegionId|TimePartition|SizeInBytes|
+---------+----------+--------+-------------+-----------+
|root.test|         1|       3|            0|         70|
+---------+----------+--------+-------------+-----------+
Total line number = 1
It costs 0.959s

Implements ShowDiskUsageNode and ShowDiskUsageOperator.
• Disk usage is calculated by scanning relevant TsFiles at query time.
• Supports:
• Path pattern matching
• Time partition filtering
• Existing SQL semantics (SHOW DISK_USAGE)

Table Model (With Cache)

IoTDB:information_schema> select * from table_disk_usage
+--------+----------+-------+---------+--------------+-------------+
|database|table_name|datanode_id|region_id|time_partition|size_in_bytes|
+--------+----------+-------+---------+--------------+-------------+
|   test1|        t1|          1|        5|             0|          142|
|   test1|        t2|          1|        5|             0|            0|
|   test1|        t1|          1|        6|             0|            0|
|   test1|        t2|          1|        6|             0|           82|
+--------+----------+-------+---------+--------------+-------------+
Total line number = 4
It costs 2.821s

Table Model introduces a dedicated disk usage cache:
• TableDiskUsageCache manages all cache operations.
• A single-threaded background worker processes write, read, and maintenance tasks via an operation queue.
• Cache state is persistent across restarts.

Cached Data
• TsFile-level table size statistics
• Object file size deltas, recorded incrementally
• Periodic snapshot + delta compaction

Query Integration
• Exposes statistics via information_schema.table_disk_usage.
• Supports:
• Predicate pushdown (except on aggregated size columns)
• Limit / offset
• Parallel region-level scanning

Copilot

Pull request overview

This PR implements comprehensive disk usage statistics collection for both Tree Model (device-based) and Table Model databases in IoTDB. The implementation provides monitoring capabilities at the table/device level and time partition level.

Changes:

Adds SHOW DISK_USAGE SQL statement for Tree Model with on-demand calculation
Implements table_disk_usage information schema table for Table Model with persistent cache
Introduces background task infrastructure for cache maintenance with periodic compaction
Adds predicate push-down and limit/offset optimization support for information schema tables

Reviewed changes

Copilot reviewed 95 out of 95 changed files in this pull request and generated 33 comments.

Show a summary per file

File	Description
pom.xml	Updates tsfile version to 2.2.1-260205-SNAPSHOT
TsFileID.java	Adds SHALLOW_SIZE constant (contains bug)
InformationSchema.java	Adds table_disk_usage schema and push-down support (contains bug)
TableDiskUsageCache*.java	Core cache implementation with writer/reader classes
ShowDiskUsageNode.java	Plan node for tree model disk usage queries
TableDiskUsageInformationSchemaTableScanNode.java	Plan node for table model information schema scans
ShowDiskUsageOperator.java	Execution operator for tree model
DiskUsageStatisticUtil.java	Base utility class for disk usage calculation
IoTDBDescriptor.java	Configuration support (contains bug)
Integration tests	Comprehensive tests for both tree and table models

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-05T08:33:47Z

...core/node-commons/src/main/java/org/apache/iotdb/commons/schema/table/InformationSchema.java

+  public static boolean supportsPushDownLimitOffset(String tableName) {
+    return columnsThatSupportPushDownPredicate.containsKey(tableName);


The method supportsPushDownLimitOffset checks if a key exists in columnsThatSupportPushDownPredicate instead of checking tablesThatSupportPushDownLimitOffset. This will cause incorrect behavior for tables that support limit/offset push-down but are not in the predicate push-down map. The condition should be tablesThatSupportPushDownLimitOffset.contains(tableName).

Copilot · 2026-02-05T08:33:48Z

iotdb-core/datanode/src/main/java/org/apache/iotdb/db/conf/IoTDBDescriptor.java

+                  "max_sub_task_num_for_information_table_scan",
+                  Integer.toString(conf.getMaxSubTaskNumForInformationTableScan())));
+      if (maxSubTaskNumForInformationTableScan > 0) {
+        conf.setMaxRowsInCteBuffer(maxSubTaskNumForInformationTableScan);


Configuration property mismatch: The method sets maxRowsInCteBuffer instead of maxSubTaskNumForInformationTableScan. This will cause the wrong configuration property to be updated when loading hot-modified properties for max_sub_task_num_for_information_table_scan. The correct call should be conf.setMaxSubTaskNumForInformationTableScan(maxSubTaskNumForInformationTableScan).

Suggested change

conf.setMaxRowsInCteBuffer(maxSubTaskNumForInformationTableScan);

conf.setMaxSubTaskNumForInformationTableScan(maxSubTaskNumForInformationTableScan);

Copilot · 2026-02-05T08:33:48Z

...b/storageengine/dataregion/utils/tableDiskUsageCache/TimePartitionTableSizeQueryContext.java

+        + ramBytesUsedOfTsFileIDOffsetMap();
+  }
+
+  // tsFileIDOffsetInValueFileMap should be null af first


Typo in comment: "af" should be "at". The comment should read "tsFileIDOffsetInValueFileMap should be null at first".

Copilot · 2026-02-05T08:33:48Z

...ore/datanode/src/main/java/org/apache/iotdb/db/storageengine/dataregion/tsfile/TsFileID.java


 public class TsFileID {

+  public static final long SHALLOW_SIZE = TsFileID.SHALLOW_SIZE;


Circular reference detected: TsFileID.SHALLOW_SIZE is defined as TsFileID.SHALLOW_SIZE. This will result in uninitialized constant value (likely 0) and incorrect memory estimation. The field should reference RamUsageEstimator.shallowSizeOfInstance(TsFileID.class) instead.

Copilot · 2026-02-05T08:33:48Z

...core/node-commons/src/main/java/org/apache/iotdb/commons/schema/table/InformationSchema.java

  private static final Map<String, TsTable> schemaTables = new HashMap<>();
+  private static final Map<String, Set<String>> columnsThatSupportPushDownPredicate =
+      new HashMap<>();
+  private static final Set<String> tablesThatSupportPushDownLimitOffset = new HashSet<>();


The contents of this container are never accessed.

Copilot · 2026-02-05T08:33:54Z

...ode/src/test/java/org/apache/iotdb/db/storageengine/dataregion/utils/TableDiskUsageTest.java

+  }
+
+  @After
+  public void tearDown() throws IOException, StorageEngineException {


This method overrides AbstractCompactionTest.tearDown; it is advisable to add an Override annotation.

Copilot · 2026-02-05T08:33:55Z

...ode/src/test/java/org/apache/iotdb/db/storageengine/dataregion/utils/TableDiskUsageTest.java

+  private TsFileManager mockTsFileManager;
+
+  @Before
+  public void setUp()


This method overrides AbstractCompactionTest.setUp; it is advisable to add an Override annotation.

Copilot · 2026-02-05T08:33:55Z

.../java/org/apache/iotdb/db/storageengine/dataregion/utils/TsFileTableSizeCacheWriterTest.java

+  }
+
+  @After
+  public void tearDown() throws IOException, StorageEngineException {


This method overrides AbstractCompactionTest.tearDown; it is advisable to add an Override annotation.

Copilot · 2026-02-05T08:33:55Z

.../java/org/apache/iotdb/db/storageengine/dataregion/utils/TsFileTableSizeCacheWriterTest.java

+  private TsFileManager mockTsFileManager;
+
+  @Before
+  public void setUp()


This method overrides AbstractCompactionTest.setUp; it is advisable to add an Override annotation.

Copilot · 2026-02-05T08:33:55Z

...b/storageengine/dataregion/utils/tableDiskUsageCache/object/IObjectTableSizeCacheReader.java

+      DataRegionTableSizeQueryContext dataRegionContext, long startTime, long maxRunTime)
+      throws IOException;
+
+  void close();


This method overrides AutoCloseable.close; it is advisable to add an Override annotation.

sonarqubecloud · 2026-02-05T11:13:38Z

Quality Gate failed

Failed conditions
C Reliability Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

shuwenwei added 30 commits October 27, 2025 17:06

add information schema table

f9eee27

fix

9e8b71a

update InformationSchema

b471127

Merge branch 'iotdb_master' into table_disk_usage_statistics

0a8c04f

add pushdown limit offset

687bc24

remove showDiskUsageStatement from table model

1699e44

add it

098392d

Merge branch 'iotdb_master' into table_disk_usage_statistics

ccfb4ec

fix

3fbf6a3

fix

b6787b4

fix

880718f

Merge branch 'iotdb_master' into table_disk_usage_statistics

03b0f2c

fix conflict

640f998

Merge branch 'iotdb_master' into table_disk_usage_statistics

a71f0d7

add cache

524c727

compact

3e3f682

fix

6575e6e

Merge branch 'iotdb_master' into table_disk_usage_statistics_with_cache

8f2de96

record table size for when generating tsfile

991a1b6

use ServiceLoader

dfc7929

add interface for object file table size cache

aafbb1c

refactor

2b87ce7

add abstract cache writer

3c63c68

refactor

79d39aa

refactor

c3945c6

throw exception

d641d10

move constant to parent class

d3ec980

refactor

50bcedd

refactor

3a91c80

refactor

d661431

shuwenwei added 24 commits January 26, 2026 15:19

rename

8b115ea

add comments

7305468

close

51336a3

add ut

b698859

fix bug

415e426

add ut

83482fa

fix bug

3210de1

add metric

3cdec0d

fix bug

dfd2078

add statistics

a18f763

fix

48ac272

add config

79b1bb9

remove stale code

56e9bf2

Merge branch 'iotdb_master' into table_disk_usage_statistics_with_cache

325f513

fix bug

eda148d

modify TableDiskUsageSupplier

34efb01

fix

0bbf45d

use final

0076b6b

refactor

eda01fe

fix

d4d641c

fix bug

8ea69e7

fix bug

271bd12

fix

07246a8

update tsfile version

66328b3

JackieTien97 requested a review from Copilot February 5, 2026 08:23

Copilot started reviewing on behalf of JackieTien97 February 5, 2026 08:23 View session

Copilot AI reviewed Feb 5, 2026

View reviewed changes

shuwenwei added 2 commits February 5, 2026 18:40

fix ut

0877c14

fix copilot review comments

398cf2b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement table & tree disk usage statistics#17169

Implement table & tree disk usage statistics#17169
shuwenwei wants to merge 62 commits intomasterfrom
table_disk_usage_statistics_with_cache

shuwenwei commented Feb 5, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

sonarqubecloud bot commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		public static boolean supportsPushDownLimitOffset(String tableName) {
		return columnsThatSupportPushDownPredicate.containsKey(tableName);

	conf.setMaxRowsInCteBuffer(maxSubTaskNumForInformationTableScan);
	conf.setMaxSubTaskNumForInformationTableScan(maxSubTaskNumForInformationTableScan);


		public class TsFileID {

		public static final long SHALLOW_SIZE = TsFileID.SHALLOW_SIZE;

Conversation

shuwenwei commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tree Model (No Cache)

Table Model (With Cache)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Feb 5, 2026

Quality Gate failed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

shuwenwei commented Feb 5, 2026 •

edited

Loading