How to implement Claude Prompt Caching for intent classification in Neuron AI #374
-
|
I'm building a chatbot using Neuron AI framework and Claude API, and I have a use case that would greatly benefit from prompt caching. A comprehensive list of user intents (~20-30 intents) This static intent list is sent with every user message to classify their intent, which is inefficient in terms of both cost and latency. Adding the anthropic-beta: prompt-caching-2024-07-31 header My questions: Does Neuron AI framework support Claude's Prompt Caching feature? Add the required beta header to requests? Are there any examples or documentation for implementing prompt caching with Neuron AI? Any guidance or code examples would be greatly appreciated! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
It's not supported out of box. Provider specific features could be implemented hooking the current framework design. If it can't be done it's an opportunity to propose a design change to make the framework more flexible overtime. |
Beta Was this translation helpful? Give feedback.
It's not supported out of box. Provider specific features could be implemented hooking the current framework design. If it can't be done it's an opportunity to propose a design change to make the framework more flexible overtime.