LAVIS - A One-stop Library for Language-Vision Intelligence
-
Updated
Nov 18, 2024 - Jupyter Notebook
LAVIS - A One-stop Library for Language-Vision Intelligence
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
An Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)
A curated collection of iOS, ML, AR resources sprinkled with some UI additions
iOS OCR Server, using Apple's Vision Framework API.
An open-source computer vision framework to build and deploy apps in minutes
Orchestrate zero-shot computer vision models
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
Face Detection with CoreML
An example of use a Vision framework for face landmarks detection in iOS 11
Vision Framework Demo on Text Detection
Try CoreML models on multiple images and videos easily and quickly
Camera preview and barcode scanner for .NET MAUI apps
PoseOSC + FaceOSC + HandOSC + OcrOSC + CatOSC + DogOSC
Simple document scanner built with the Apple's Vision framework
A scene text recognition demo app using Vision framework and tesseract
A CLI OCR tool, using Apple's Vision Framework API. (Supports macOS 13.0+)
Object Tracking using Apple's VISION Framework
🖐 Memory game with hand gesture recognition that will keep your brain in a good shape!
A demo app to explore how various parameter settings of VNDetectRectanglesRequest effect observation output.
Add a description, image, and links to the vision-framework topic page so that developers can more easily learn about it.
To associate your repository with the vision-framework topic, visit your repo's landing page and select "manage topics."