Summary
Add a prep recipe engine for inspectable, reversible data-cleaning and reshaping operations that AI can suggest but not execute opaquely.
Problem
If datasight adds AI-assisted cleaning, it needs a constrained execution model. Free-form AI SQL or code is hard to trust, explain, preview, and export.
Proposed scope
- Define a versioned prep recipe format.
- Support an initial set of operations such as:
- unpivot / melt
- cast / rename
- split composite columns
- fill missing timestamps
- resample grain
- interpolate numeric values
- forward-fill selected fields
- Provide preview and apply flows.
- Allow AI to propose recipes or recipe fragments, with the system validating and executing them.
CLI sketch
datasight prep suggest
datasight prep apply recipe.yaml
Acceptance criteria
- There is a concrete recipe schema with versioning.
- Recipes can be previewed before apply.
- The first supported operations cover untidy reshaping and time-series gap handling.
- AI-generated prep suggestions compile to the same recipe model used by deterministic workflows.
- Tests cover recipe validation and execution for core operations.
Notes
- This is a foundational issue for guided data cleaning and untidy-data remediation.
- Keeping this layer inspectable is more important than making the first version broad.
Summary
Add a prep recipe engine for inspectable, reversible data-cleaning and reshaping operations that AI can suggest but not execute opaquely.
Problem
If datasight adds AI-assisted cleaning, it needs a constrained execution model. Free-form AI SQL or code is hard to trust, explain, preview, and export.
Proposed scope
CLI sketch
Acceptance criteria
Notes