Skip to content

Add Qianfan-OCR and dots.mocr to model registry#16

Merged
davanstrien merged 1 commit into
mainfrom
add-qianfan-dots-mocr-models
Mar 23, 2026
Merged

Add Qianfan-OCR and dots.mocr to model registry#16
davanstrien merged 1 commit into
mainfrom
add-qianfan-dots-mocr-models

Conversation

@davanstrien
Copy link
Copy Markdown
Owner

Summary

  • Register Qianfan-OCR (baidu/Qianfan-OCR, 4.7B) — Add more OCR models to registry #1 on OmniDocBench v1.5, 192 languages, Layout-as-Thought
  • Register dots.mocr (rednote-hilab/dots.mocr, 3B) — upgraded dots.ocr with 8 prompt modes (layout, SVG, scene spotting, etc.)
  • Both are opt-in via --models qianfan-ocr dots-mocr; DEFAULT_MODELS unchanged

Tested

  • Smoke-tested both models on Britannica 1771 (10 samples each) via HF Jobs
    • qianfan-ocr: completed in 3.0 min
    • dots-mocr: completed in 4.6 min
  • Both pushed results successfully to davanstrien/ocr-bench-britannica

Test plan

  • ruff check passes
  • All 233 tests pass
  • Smoke test: both models run end-to-end on Britannica via HF Jobs

🤖 Generated with Claude Code

Register two new OCR models available in uv-scripts/ocr:
- Qianfan-OCR (4.7B): #1 on OmniDocBench v1.5, 192 languages
- dots.mocr (3B): upgraded dots.ocr with layout/SVG/8 prompt modes

Both are opt-in via --models flag; DEFAULT_MODELS unchanged.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@davanstrien davanstrien merged commit 7314147 into main Mar 23, 2026
1 check passed
@davanstrien davanstrien deleted the add-qianfan-dots-mocr-models branch March 23, 2026 14:42
@davanstrien davanstrien mentioned this pull request Mar 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant