Skip to content

Releases: abra-code/AIChatApp

AIChat.app version 1.1

18 May 06:47

Choose a tag to compare

  • Added local model selector dialog, trying to find GGUF model files you might have downloaded with HuggingFace or LM Studio.
  • Added a HuggingFace model browser and downloader.
  • Improved calculations for context size. Added settings for reduced kv cache size, making even more context size available per RAM.
  • Added 10 min inactivity setting to llama-server to unload the model and free the memory - now the app can be left open in the background and will not wire the RAM permanently
  • update llama-server to b9204

AIChat now requires macOS 14.6. The release is only for Apple Silicon macs (arm64). Intel mac support dropped - you cannot get a reasonable inference speeds on these machines.

AIChat.app version 1.0.2

14 Nov 00:28

Choose a tag to compare

Misc fixes:

  • dark theme fix
  • attempt to set a better context size depending on system RAM
  • update llama.cpp to b7020
  • improve loading pages
  • persist last window size and position

AIChat.app version 1.0.1

03 Nov 08:59

Choose a tag to compare

Improve handing of slow llama-server launch with longer timeout, especially for the first launch of the app.
Track server launch progress in the WebView and give user more information if something goes wrong.
No executable binaries have been changed since v 1.0.

AIChat.app version 1.0

31 Oct 10:22

Choose a tag to compare

Initial release
AIChat.app is an app for running large language models in GGUF format locally.
Embedded llama-server is taken from this release:
https://github.com/ggml-org/llama.cpp/releases/tag/b6895

Download the archive with architecture appropriate for your computer.
Running some bigger models on outdated Intel Macs could be a painful experience. Apple Silicon Macs fare much better.