Skip to content

feat(rag): parallelize MinIO payload fetches for citations and VLM#414

Closed
smasurekar wants to merge 1 commit intorelease-v2.4.0from
dev/smasurekar/2.4.0/minio-optim
Closed

feat(rag): parallelize MinIO payload fetches for citations and VLM#414
smasurekar wants to merge 1 commit intorelease-v2.4.0from
dev/smasurekar/2.4.0/minio-optim

Conversation

@smasurekar
Copy link
Copy Markdown
Collaborator

Parallelize MinIO payload fetches for citations and VLM

Parallelizes MinIO get_payload calls used when building citations and VLM citation images. Adds a shared utility fetch_minio_payloads_for_documents() in utils/common.py that fetches all required payloads in parallel (capped at 10 workers via MINIO_FETCH_MAX_WORKERS). Response generator (prepare_citations) and VLM (citation image assembly) now use this utility instead of sequential per-doc MinIO calls, reducing latency when many image/structured docs are present.

Checklist

  • I am familiar with the Contributing Guidelines.
  • All commits are signed-off (git commit -s) and GPG signed (git commit -S).
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

@smasurekar smasurekar added the enhancement New feature or request label Mar 10, 2026
@kumar-punit kumar-punit self-requested a review March 10, 2026 05:36
Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com>
@smasurekar smasurekar force-pushed the dev/smasurekar/2.4.0/minio-optim branch from a870598 to 1a82192 Compare March 10, 2026 06:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants