Releases: abra-code/AIChatApp
AIChat.app version 1.1
- Added local model selector dialog, trying to find GGUF model files you might have downloaded with HuggingFace or LM Studio.
- Added a HuggingFace model browser and downloader.
- Improved calculations for context size. Added settings for reduced kv cache size, making even more context size available per RAM.
- Added 10 min inactivity setting to llama-server to unload the model and free the memory - now the app can be left open in the background and will not wire the RAM permanently
- update llama-server to b9204
AIChat now requires macOS 14.6. The release is only for Apple Silicon macs (arm64). Intel mac support dropped - you cannot get a reasonable inference speeds on these machines.
AIChat.app version 1.0.2
Misc fixes:
- dark theme fix
- attempt to set a better context size depending on system RAM
- update llama.cpp to b7020
- improve loading pages
- persist last window size and position
AIChat.app version 1.0.1
Improve handing of slow llama-server launch with longer timeout, especially for the first launch of the app.
Track server launch progress in the WebView and give user more information if something goes wrong.
No executable binaries have been changed since v 1.0.
AIChat.app version 1.0
Initial release
AIChat.app is an app for running large language models in GGUF format locally.
Embedded llama-server is taken from this release:
https://github.com/ggml-org/llama.cpp/releases/tag/b6895
Download the archive with architecture appropriate for your computer.
Running some bigger models on outdated Intel Macs could be a painful experience. Apple Silicon Macs fare much better.