Skip to content

Conversation

@shiyasmohd
Copy link
Contributor

@shiyasmohd shiyasmohd commented Jan 16, 2026

  • Extract amp-providers-registry crate from amp-dataset-store
  • Update admin-api to use DatasetsRegistry directly for registry operations
  • Introduce Dataset trait
    • Introduces a Dataset trait in datasets-common
    • Add DatasetWithFunctions trait for UDF supported datasets
    • Moves JsUdf from common to js-runtime
    • Adds udf module re-exports in datasets-common
    • Updates all extractors to implement the new Dataset trait
    • Updates imports across the codebase

Closes #1558

@shiyasmohd shiyasmohd changed the title refactor(dataset-store): extract amp-providers-registry crate refactor(dataset-store): extract amp-providers-registry crate Jan 16, 2026
@shiyasmohd shiyasmohd self-assigned this Jan 16, 2026
@shiyasmohd shiyasmohd requested a review from LNSD January 16, 2026 08:05
@shiyasmohd shiyasmohd force-pushed the shiyasmohd/amp-providers-registry branch 3 times, most recently from 347801d to 275a524 Compare January 18, 2026 10:37
@shiyasmohd shiyasmohd force-pushed the shiyasmohd/amp-providers-registry branch from 275a524 to 2e76ee7 Compare January 18, 2026 19:17
@shiyasmohd shiyasmohd marked this pull request as draft January 18, 2026 19:18
@shiyasmohd shiyasmohd marked this pull request as ready for review January 19, 2026 06:26
@shiyasmohd
Copy link
Contributor Author

@LNSD PR is ready for review now.

Copy link
Contributor

@LNSD LNSD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a huge PR. It is too much risk to merge it as is.

Can you split it into two separate PRs?

  • PR 1: the new providers-registry crate + the raw dataset kind type in datasets-common
  • PR 2: the dataset trait and all the associated changes.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we remove this Dataset type?

Comment on lines +72 to +75
pub fn dataset(
reference: HashReference,
manifest: Manifest,
) -> impl datasets_common::dataset::Dataset {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The dataset() fn should return the extractor-specific dataset type, not the impl trait

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

refactor(dataset-store): extract amp-providers-registry crate

3 participants