Skip to content

[ENG-11010] Registrations continue to get stuck archiving#11728

Open
antkryt wants to merge 1 commit intoCenterForOpenScience:feature/pbs-26-9from
antkryt:fix/ENG-10575
Open

[ENG-11010] Registrations continue to get stuck archiving#11728
antkryt wants to merge 1 commit intoCenterForOpenScience:feature/pbs-26-9from
antkryt:fix/ENG-10575

Conversation

@antkryt
Copy link
Copy Markdown
Contributor

@antkryt antkryt commented May 6, 2026

Ticket

Purpose

Retry if archive job is missing

Changes

Side Effects

QE Notes

CE Notes

Documentation

Comment thread website/archiver/tasks.py
sentry.log_exception(archiver_state_exc)
raise archiver_state_exc
job_pk = kwargs.get('job_pk')
job = self.load_archive_job(job_pk, retry_if_missing=False, task_id=task_id, kwargs=kwargs)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to clarify:
why should it be retry_if_missing=False but not somehow depend on max_retries and retry attempts left

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This function is running after the Celery task failure, so we just try to get archive job (it either available or not). General idea looks like this:
ArchiveTask tries to get the ArchiveJob with retries (currently ArchiveTasks are retried ONLY if ArchiveJob is missing, because autoretry_for is None) -> after max retries we raise error, so ArchiveTask.on_failure is called and try to get ArchiveJob to create exception context later

@mkovalua
Copy link
Copy Markdown
Contributor

mkovalua commented May 8, 2026

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants