Skip to content

Consider moving datasets out of the core python library #40

@tpgillam

Description

@tpgillam

I very much appreciate that moocore has a narrow set of functionality with minimal dependencies.

I'm wondering if the _datasets module, and corresponding data directory, could or should be moved into the examples, and hence not bundled in the wheel. On MacOS at least, this would remove between 1/4 and 1/3 of the installed size-on-disk. For most practical uses of this package that I can imagine, end-users will be solving their own problems rather than examples.

Also, moocore's dependency tree is currently:

├── moocore v0.1.10
│   ├── cffi v1.17.1
│   │   └── pycparser v2.22
│   ├── numpy v2.3.2
│   └── platformdirs v4.3.8

The platformdirs package is used in exactly one place, for caching datasets in _datasets. So this dependency can also be removed if removing datasets.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions