Feature Details
Design and implement a stateful feature store that (a) caches per-ticker feature artifacts (lags, masks, embeddings indices, scalers’ states, selected lags, etc.), and (b) supports incremental updates as new observations arrive, without recomputing the full history. The store should be durable (disk-backed), versioned, and safe to read concurrently during training.
Core goals:
-
$O(\Delta t)$ append-only updates for rolling windows and statistics.
- Deterministic, reproducible snapshots for training/checkpoint alignment.
- Clean invalidation when upstream raw data changes (cache busting via checksum/version).
Affected Modules
As stated in the parent issue.
Implementation Checklist
Limitations
As stated in the parent issue.
Feature Details
Design and implement a stateful feature store that (a) caches per-ticker feature artifacts (lags, masks, embeddings indices, scalers’ states, selected lags, etc.), and (b) supports incremental updates as new observations arrive, without recomputing the full history. The store should be durable (disk-backed), versioned, and safe to read concurrently during training.
Core goals:
Affected Modules
As stated in the parent issue.
Implementation Checklist
• Cold start → fit state → update with Δt > 0 → outputs match full recompute on the same window.
• Concurrency: writer during reader; readers see consistent snapshots.
• Invalidation: modify raw data → state invalidates and recomputes.
• Serialization parity across platforms (endian, dtype, version).
Limitations
As stated in the parent issue.