Skip to content

Conversation

@johncalesp
Copy link
Contributor

In this PR I intent to add the brand field to the accuracy evaluation.
Since brand can be any string, I opted to use another package to perform the evaluation.
The library rapidfuzz helps compare strings and provide a numeric value based on a threshold. In comparisson using sklearn looks for exact string matches and If we have 1,000 different brands, sklearn treats this as a classification problem with 1,000 classes (multi-classification problem).

The evaluation now will look like this:

image

@johncalesp johncalesp requested a review from a team as a code owner December 6, 2025 01:06
@github-actions
Copy link
Contributor

github-actions bot commented Dec 6, 2025

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@wangshangsam
Copy link
Contributor

@hanyunfan Could you help to take a look and merge this PR, please? Thanks so much!

@mrmhodak mrmhodak merged commit 95dfd4c into mlcommons:master Dec 9, 2025
13 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Dec 9, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants