Skip to content

FEAT - adding a heuristic for parsing units in string columns#1726

Draft
gabrielapgomezji wants to merge 2 commits intoskrub-data:mainfrom
gabrielapgomezji:1542-Transformer-parse
Draft

FEAT - adding a heuristic for parsing units in string columns#1726
gabrielapgomezji wants to merge 2 commits intoskrub-data:mainfrom
gabrielapgomezji:1542-Transformer-parse

Conversation

@gabrielapgomezji
Copy link
Copy Markdown
Contributor

@gabrielapgomezji gabrielapgomezji commented Nov 3, 2025

Fixes #1542

@gabrielapgomezji gabrielapgomezji changed the title pushing changes FEAT - adding a heuristic for parsing units in string columns Nov 3, 2025
@gabrielapgomezji gabrielapgomezji marked this pull request as draft November 3, 2025 13:18
@rcap107
Copy link
Copy Markdown
Member

rcap107 commented Nov 3, 2025

This issue was discussed IRL during the WiMLDS sprint at Probabl on oct 28th

Copy link
Copy Markdown
Member

@rcap107 rcap107 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Blocked by #1772 (parsing the numerical part will be much simpler after that PR is merged)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add a transformer that uses heuristics to parse string columns that contain mixed strings like 10$/100 kg

3 participants