You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GDocZ is an intelligent document processing platform that combines multi-engine OCR (OlmOCR, Qwen VL, Gemini, Chandra) with AI-powered data extraction. It transforms complex documents—invoices, contracts, forms, receipts, multi-page reports, and more—into structured, actionable data.
From raw PDFs and images → clean JSON with intelligent schema validation
No complex integrations. No fragile CSS selectors. No maintenance nightmares. 🚀
💡 Why Choose GDocZ?
🎯 Multi-Model OCR Harmony
Intelligently routes documents to the best-fit engine (OlmOCR for speed, Qwen for charts, Gemini for complex layouts) based on document size and complexity.
📄 Cross-Page Intelligence
Extract data that spans multiple pages—invoices split across sheets, contracts with continuation pages, multi-page forms.GDocZ understands document flow.
🧠 Schema-Driven Extraction
Define what you want to extract once using intuitive schema definitions. Supports nested objects, arrays, and complex hierarchies.
📌 GPU Support (Optional)
NVIDIA/AMD for acceleration • Apple Silicon for MLX • CPU-only possible with smaller models
🔑 Required Credentials
🧠 OCR API Keys
DeepInfra Key (OlmOCR, Qwen)
Gemini API Key (Google)
📧 Email Config (SMTP/IMAP)
Gmail / Hostinger / Custom
OAuth2 Support
App passwords
☁️ AWS S3 (Optional)
Access Key ID
Secret Access Key
Bucket Name
⚙️ Installation & Setup
📌 Estimated Time: ~5-10 minutes for basic setup, ~20 minutes with optional services
Step 1️⃣ Clone the Repository
git clone https://github.com/your-org/gdocz-ai.git
cd gdocz-ai
Step 2️⃣ Create & Activate Virtual Environment
# Using Python venv (recommended)
python -m venv mineru_env
# Activate on Windows
mineru_env\Scripts\activate
# Activate on Linux/Macsource mineru_env/bin/activate
Verify API keys: echo $DEEPINFRA_API_KEY
Check logs: tail -f logs/olmocr_server.log
Test connection: curl -H "Authorization: Bearer KEY" https://api.deepinfra.com/v1/status
📧 Email Not Working
Status: curl -X GET http://localhost:8000/connector/smtp/status
Gmail? Use App Password, not regular password
Outlook? Enable "Less secure app access"
💾 Storage Issues
Local: ls -la data/storage/stored_documents/
S3: aws s3 ls --profile default
🔒 Security & Best Practices
✅ Secrets Management (DO's)
- Store credentials in .env (NEVER in git)
- Use strong, unique API keys
- Rotate credentials regularly
- Enable HTTPS in production
- Use environment-specific configs
- Monitor access logs
❌ Security Anti-Patterns (DON'Ts)
- Commit .env to version control
- Hardcode credentials in code
- Share API keys in logs
- Use weak passwords
- Store secrets in comments
- Log sensitive data
🔐 Database Security
-- 👤 Create restricted database userCREATEUSERgdocz_app WITH ENCRYPTED PASSWORD 'your_strong_password_here';
GRANT CONNECT ON DATABASE gdocz_db TO gdocz_app;
GRANT USAGE ON SCHEMA public TO gdocz_app;
GRANTSELECT, INSERT, UPDATE, DELETEON ALL TABLES IN SCHEMA public TO gdocz_app;
-- 🔒 Enable SSL connections-- Edit postgresql.conf: ssl = on
🛡️ API Authentication Headers
Feature
Details
Token Expiry
24 hours
Refresh
Use /auth/token/refresh before expiry
API Keys
For programmatic access (more secure for automation)
Rate Limiting
Recommended for production
📊 Performance Optimization
⚡ Large Document Handling
Strategy
Benefit
When to Use
🧠 Gemini 2.5 Flash
Auto-routes for >30K chars
Large contracts, reports
✂️ Manual Splitting
Control chunk boundaries
Over-sized PDFs
🚀 Batch Processing
Parallel OCR (3-5 pages)
High-volume workloads
☁️ S3 Storage
Scale-on-demand
Enterprise deployments
🗄️ Database Optimization
-- ⚡ Add indexes for common queriesCREATEINDEXidx_job_statusON jobs(status);
CREATEINDEXidx_user_idON documents(user_id);
CREATEINDEXidx_created_dateON documents(created_at);
-- 📊 Analyze query performance
EXPLAIN ANALYZE SELECT*FROM documents WHERE status ='completed';
# 1. Fork the repository (on GitHub)# 2. Clone your fork
git clone https://github.com/your-username/gdocz-ai.git
cd gdocz-ai
# 3. Create feature branch
git checkout -b feature/YourFeature
# 4. Setup dev environment
python -m venv venv
source venv/bin/activate # or venv\Scripts\activate (Windows)
pip install -r requirements.txt
pip install pytest pytest-cov black flake8
# 5. Make your changes with clear commits
git commit -m "feat: add your feature description"# 6. Run quality checks
black src/
flake8 src/
pytest tests/ -v
# 7. Push to your fork
git push origin feature/YourFeature
# 8. Open Pull Request on GitHub
📋 Code Style Guidelines
✅ Follow PEP 8 (use black for formatting)
✅ Add docstrings to all functions
✅ Include type hints
✅ Write tests for new features
✅ Update documentation if needed
MIT License
Copyright (c) 2026 Gramosoft Private Limited
Permission is hereby granted, free of charge, to any person
obtaining a copy of this software...
🙏 Acknowledgments
GDocZ stands on the shoulders of incredible open-source projects and the amazing developer community.
📅 Last Updated: March 23, 2026 📦 Version: 5.0.0 ⚡ Built in 🔥
About
GDocZ by Gramosoft is an open-source Intelligent Document Processing platform that turns raw PDFs and images into clean, structured JSON — powered by multi-engine OCR and AI-driven schema extraction.