mem: reduce PaddleOCR rec_batch_num from 6 to 1 by KRRT7 · Pull Request #4295 · Unstructured-IO/unstructured

KRRT7 · 2026-03-24T12:31:46Z

Reduce PaddleOCR rec_batch_num from 6 (default) to 1. Paddle's native inference engine allocates 500 MiB memory arena chunks proportional to recognition batch size. With batch_num=6, four chunks are allocated during text recognition. Setting it to 1 reduces this to one chunk.

Setting	Peak memory
`rec_batch_num=6`	7,184 MiB
`rec_batch_num=1`	2,684 MiB
Delta	-4,500 MiB (-62.6%)

Measured with memray run on layout-parser-paper-with-table.pdf through partition() with hi_res + PaddleOCR table OCR. On CPU, batch processing doesn't parallelize — it's sequential within predictor.run(). Smaller batches just allocate less workspace memory.

Reproduce

Requires unstructured[pdf], paddlepaddle, unstructured-paddleocr, and memray.

cat > /tmp/bench_paddle.py << 'SCRIPT'
from unstructured.partition.auto import partition
elements = partition(
    filename="example-docs/layout-parser-paper.pdf",
    strategy="hi_res",
    pdf_infer_table_structure=True,
    ocr_agent="unstructured.partition.utils.ocr_models.paddle_ocr.OCRAgentPaddle",
)
print(f"Partitioned: {len(elements)} elements")
SCRIPT

# Baseline (main branch, rec_batch_num=6):
git checkout main
memray run --native --trace-python-allocators -o /tmp/paddle_baseline.bin /tmp/bench_paddle.py
memray stats /tmp/paddle_baseline.bin | grep "Peak memory"

# With this change (rec_batch_num=1):
git checkout mem/paddle-rec-batch-num
memray run --native --trace-python-allocators -o /tmp/paddle_opt.bin /tmp/bench_paddle.py
memray stats /tmp/paddle_opt.bin | grep "Peak memory"

Paddle's native inference engine allocates 500 MiB memory arena chunks during text recognition, proportional to batch size. With the default rec_batch_num=6, four 500 MiB chunks are allocated simultaneously. Setting rec_batch_num=1 reduces this to a single chunk, cutting peak memory on the PaddleOCR code path by ~1,265 MiB (-42.6%). Latency benchmark (55 text regions, CPU, 5 runs): - rec_batch_num=6: 39.1s +/- 3.5s - rec_batch_num=1: 37.0s +/- 2.0s No throughput regression — on CPU, batch processing is sequential.

…h-num # Conflicts: # CHANGELOG.md

KRRT7 added 6 commits March 19, 2026 09:48

chore: add changelog entry and bump version to 0.22.1

7b8d18d

Merge remote-tracking branch 'upstream/main' into mem/paddle-rec-batc…

48f0987

…h-num # Conflicts: # CHANGELOG.md

chore: bump version to 0.22.2

20cb616

Add CHANGELOG entry for PaddleOCR rec_batch_num reduction

7080e33

Bump version to 0.22.3 for changelog CI check

a91d36a

badGarnet approved these changes Mar 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mem: reduce PaddleOCR rec_batch_num from 6 to 1#4295

mem: reduce PaddleOCR rec_batch_num from 6 to 1#4295
KRRT7 wants to merge 6 commits intoUnstructured-IO:mainfrom
KRRT7:mem/paddle-rec-batch-num

KRRT7 commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KRRT7 commented Mar 24, 2026

Reproduce

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants