Skip to content

fix(findsources): handle repos with many tags#217

Open
16Martin wants to merge 3 commits into
sw360:mainfrom
16Martin:main
Open

fix(findsources): handle repos with many tags#217
16Martin wants to merge 3 commits into
sw360:mainfrom
16Martin:main

Conversation

@16Martin
Copy link
Copy Markdown
Collaborator

fixes #206

The root cause are our current approaches for generating GitHub web and API calls. This patch does not attempt to fix this.

The ultimate current breaking point is the code block between the lines 263 and 287. It had/-s many issues which unfortunately cancel each other out just enough to appear reasonably stable. (#205 (comment))

In its current state, this code block's effectiveness is enhanced by its bugs. Resolving the bugs lowers the effectiveness and brings back the original performance problems. Until a sufficient amount of fixes are in place, all conceivable fixes essentially substitute one reason for timeout issues with another reason for timeout issues. Improvement vs regression is a function over API limits, network throughput and input dataset.

I therefore propose this bigger change which unfortunately bleeds into comparable_version.py.

It filters out more error messages, handles API results better and handles large scale repos better.

@16Martin
Copy link
Copy Markdown
Collaborator Author

This is v2.

The initially proposed version is already so large, it didn't make much sense anymore. This version is now now a more local implementation of get_new_prefixes and my most recent rewrite of the problematic code block, that is lines 263 and 289 on upstream main.

* rewrite of the block that tries to guess tag prefixes instead of
  iterating over pages
* introduce different gen_new_prefixes to keep fix local
@tngraf
Copy link
Copy Markdown
Collaborator

tngraf commented May 1, 2026

I fixed typos and flake8 issues.

@16Martin Please run mypy (or RunChecks.ps1) and fix all remaining issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Error when fetching a large GitHub API resource that offers no pagination

2 participants