cpu-only
Here are 22 public repositories matching this topic...
🦙 chat-o-llama: A lightweight, modern web interface for AI conversations with support for both Ollama and llama.cpp backends. Features persistent conversation management, real-time backend switching, intelligent context compression, and a clean responsive UI.
-
Updated
Dec 10, 2025 - Python
An LLM-based content moderator. Firefox extension to block webpages unrelated to work, based on page title and URL. Local LLMs with Ollama and Langchain to ensure your browsing history never leaves your device, for complete privacy. Google Gemini also supported.
-
Updated
Dec 12, 2024 - Python
A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.
-
Updated
Jan 6, 2026 - Python
-
Updated
Mar 10, 2026 - Shell
Image Classification with On-Device Inference, built with Flutter, AI model runs on mobile cpu
-
Updated
Jan 29, 2025 - Dart
Ternsig Virtual Mainframe Runtime (TVMR) — extensible VM with 10 standard extensions (121 instructions), Signal ISA, mastery learning, hot-reload firmware, and thermogram persistence.
-
Updated
Mar 14, 2026 - Rust
Cloud transcription stores your audio. Local alternatives need a GPU. Whiscribe runs on CPU, in your browser, with one Python script.
-
Updated
Mar 23, 2026 - Python
Face locking system built on ArcFace (ONNX) and 5-point alignment that recognizes a selected identity, locks onto it, tracks facial actions, and records behavior over time.
-
Updated
Feb 6, 2026 - Python
CPU-friendly experience-based reasoning framework combining meta-learning (MAML), state space models (SSM), and memory buffers for fast few-shot adaptation. Pure NumPy implementation for edge devices and low-compute environments.
-
Updated
Oct 23, 2025 - Python
Real-time sign language detection & translation using MediaPipe, LSTM, and Gemini 2.5 Flash — with WebSocket streaming, TTS audio output, and a Next.js frontend. CPU-only, low-spec friendly.
-
Updated
Mar 17, 2026 - Python
A new one shot face swap approach for image and video domains - version tailored to work on CPU
-
Updated
Aug 20, 2024 - Python
Pre-built Llama-CPP Wheel for HF Spaces (Python 3.13)
-
Updated
Mar 10, 2026
Probabilistic Signed Distance Fusion with View Planning on CPU
-
Updated
Mar 6, 2026 - Python
One-line TTS for Python. Real speech on any CPU. No GPU, no cloud, no API keys.
-
Updated
Mar 20, 2026 - Python
CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.
-
Updated
Jan 24, 2026 - Python
Train AI together — any device, any hardware. Federated fine-tuning with LISA layer selection.
-
Updated
Mar 24, 2026 - Python
CPU-only RAG stack: PDFs→Docling→Ollama→pgvector. Windows/macOS/Linux. Docker Compose. Graph-aware code search + scanned PDF OCR.
-
Updated
Mar 21, 2026 - Python
A lightweight reproduction and analysis inspired by recent work on presentation-aware deepfake / spoofing detection, with a focus on codec-induced presentation mismatch (AMR) under CPU-only constraints.
-
Updated
Mar 9, 2026 - Python
Improve this page
Add a description, image, and links to the cpu-only topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cpu-only topic, visit your repo's landing page and select "manage topics."