Add measures pages, Stats API and dataset analysis improvements#379
Open
HelderMendes wants to merge 6 commits intoopenml:masterfrom
Open
Add measures pages, Stats API and dataset analysis improvements#379HelderMendes wants to merge 6 commits intoopenml:masterfrom
HelderMendes wants to merge 6 commits intoopenml:masterfrom
Conversation
…ature distribution logic - Add pagination to Features tab (50 per page) with grid/list view toggle - Add pagination to Distribution tab feature selector (50 per page) - Make correlation heatmap theme-aware (dark/light mode font + grid colors) - Use transparent background for correlation colorscale midpoint - Show nominal distributions from metadata for large datasets, "coming soon" for numeric - Default feature selection: target (if nominal) + up to 5 numeric features - Remove global "coming soon" block from correlation (always show) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Enhance dataset analysis: pagination, theme-aware correlation, per-feature distribution logic#
…tems doc - Fix parquet-wasm: call .intoIPCStream() before tableFromIPC() (fixes metadata.map error) - Bump MAX_PARQUET_SIZE from 5MB to 10MB (allows datasets like 1590) - Add dark/light mode styling to distribution plots (font, grid colors) - Add loading state and fallback message to correlation heatmap - Add OPEN_ITEMS.md documenting outstanding items for team discussion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
The Stats API was incorrectly disabled for huge datasets (>5GB) by passing !isHugeDataset as the enabled parameter. The Stats API was specifically designed to handle large datasets server-side, so it should always be enabled. This fixes the issue where large datasets were falling back to old architecture (parquet download or Dash iframe) instead of using the new efficient Stats API. Co-Authored-By: Claude <noreply@anthropic.com> enable Stats API for all dataset sizes (initial implementation)#
- Add useDatasetStats hook for fetching pre-computed statistics - Add Next.js API proxy route at /api/datasets/[id]/stats - Required for Vercel deployment and Stats API functionality Co-Authored-By: Claude <noreply@anthropic.com>
Contributor
Author
|
5 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Changes
app-next/src/app/[locale]/(explore)/measures/— new measures pagesapp-next/src/components/measure/— new measure components (header, search, stats, analysis)app-next/src/app/api/datasets/[id]/stats/route.ts— Stats API routeapp-next/src/hooks/useDatasetStats.ts— Stats API hookapp-next/src/components/dataset/data-analysis-section.tsx— analysis improvementsapp-next/src/components/benchmark/andcollection/— navigation and section componentsNotes
Test plan
This is Work in progress