Add sort before `groupby` by denini08 · Pull Request #31 · Toilal/rebulk

denini08 · 2026-01-24T23:45:34Z

Hello!

While reading the code and running some tests, I noticed a small detail that could cause unexpected behavior when the input list is not ordered. The itertools.groupby() function assumes the data is already sorted by the key (docs), but in _group_by_match_index this was not guaranteed.

So I added a sorted() call before the groupby() to make the behavior safe and consistent, regardless of the input order.

Interestingly, the rest of the codebase already follows this same pattern:

rules.py:300 - Uses sorted() before groupby()
match.py:62 - Maintains proper ordering
match.py:71 - Maintains proper ordering

I also added a small test that explicitly checks the behavior with unsorted input data, just to make sure this case is always covered.

Thank youfor the project!
I hope this improvement is helpful.

sort before groupby

9f93bce

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sort before `groupby`#31

Add sort before `groupby`#31
denini08 wants to merge 1 commit intoToilal:developfrom
denini08:develop

denini08 commented Jan 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

denini08 commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

denini08 commented Jan 24, 2026 •

edited

Loading