Question: Injecting token usage while constructing the system prompt will invalidate the cache.

https://github.com/bubbuild/bub/blob/951da645054f7a565a892cbc06c687b66dae217e/src/bub/core/model_runner.py#L198

Provide the LLM with a tool similar to `from litellm import token_counter` to calculate tokens, and then remove token_usage from the system prompt