Fix/llm optimizations by Iamsdt · Pull Request #123 · 10xHub/Agentflow

Iamsdt · 2026-05-22T13:37:44Z

This pull request introduces a shared utility for making single-turn LLM calls (call_llm), centralizing logic that was previously duplicated across several modules. It also adds improved cache handling and logging for both Google and OpenAI LLM providers, and updates module exports to surface the new utility and related classes. The changes improve code reuse, simplify future maintenance, and enhance observability of cache hits.

Key changes include:

Shared LLM Call Utility:

Added a new call_llm function in agentflow/core/llm/caller.py that abstracts single-turn LLM calls for both Google and OpenAI, handling provider detection, system prompts, JSON mode, and cache support. This utility returns a tuple with the generated text and token usage statistics. ([agentflow/core/llm/caller.pyR1-R275](https://github.com/10xHub/Agentflow/pull/123/files#diff-82566d5139e0b7b2848b16f7b04915b14919caf110df4f89b05347b264583b2eR1-R275))
Updated agentflow/core/llm/__init__.py to export call_llm alongside the existing client factory utilities. ([agentflow/core/llm/__init__.pyR3-R7](https://github.com/10xHub/Agentflow/pull/123/files#diff-ad245908c6f01b3a0914880a56645148d6deaef6c6250a34e56badc6cd62c3a9R3-R7))

Cache Handling and Observability:

Improved handling of explicit cache content for Google LLM calls: if a cache is used, the system instruction is not resent, and cache hits are logged. ([[1]](https://github.com/10xHub/Agentflow/pull/123/files#diff-e3191b678bfb7cbad630798c0122a01b62134f674c62fc09a90c492e2f3d7135L307-L309), [[2]](https://github.com/10xHub/Agentflow/pull/123/files#diff-e3191b678bfb7cbad630798c0122a01b62134f674c62fc09a90c492e2f3d7135R315-R321), [[3]](https://github.com/10xHub/Agentflow/pull/123/files#diff-e3191b678bfb7cbad630798c0122a01b62134f674c62fc09a90c492e2f3d7135L373-R387), [[4]](https://github.com/10xHub/Agentflow/pull/123/files#diff-e3191b678bfb7cbad630798c0122a01b62134f674c62fc09a90c492e2f3d7135R410-R427))
Enhanced OpenAI API call methods to log cache hits and extract cache token usage from responses, both for chat completions and responses APIs. ([[1]](https://github.com/10xHub/Agentflow/pull/123/files#diff-c14055d8309966d71807222c92629294b7e68eb3d788da4657baf196e21311c8L104-R133), [[2]](https://github.com/10xHub/Agentflow/pull/123/files#diff-c14055d8309966d71807222c92629294b7e68eb3d788da4657baf196e21311c8L226-R254))

Module Exports and State Management:

Updated agentflow/core/state/__init__.py to export SummaryContextManager, making it available for import elsewhere in the codebase. ([[1]](https://github.com/10xHub/Agentflow/pull/123/files#diff-2e3404ad0021e70d98d93d1257fe333381b405bb71aba7f05dff7e4b13bd03f6R42), [[2]](https://github.com/10xHub/Agentflow/pull/123/files#diff-2e3404ad0021e70d98d93d1257fe333381b405bb71aba7f05dff7e4b13bd03f6R66))

…ation with token-budget support feat: add call_llm utility for single-turn LLM calls, centralizing provider dispatch refactor: update __init__.py files to include new context manager and utility imports test: add unit tests for SummaryContextManager functionality and behavior

…d add related configurations

… hits

…le LLM calls

+
+from __future__ import annotations
+
+from unittest.mock import AsyncMock, MagicMock, patch


+
+from agentflow.core.state.agent_state import AgentState
+from agentflow.core.state.message import Message
+from agentflow.core.state.message_block import TextBlock, ToolCallBlock, ToolResultBlock


codecov · 2026-05-22T13:40:22Z

Codecov Report

❌ Patch coverage is 61.15108% with 108 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
agentflow/core/llm/caller.py	32.18%	53 Missing and 6 partials ⚠️
agentflow/qa/evaluation/criteria/llm_utils.py	40.62%	17 Missing and 2 partials ⚠️
agentflow/core/graph/agent_internal/openai.py	47.61%	8 Missing and 3 partials ⚠️
agentflow/core/state/summary_context_manager.py	90.29%	5 Missing and 5 partials ⚠️
agentflow/core/graph/agent_internal/google.py	57.14%	3 Missing and 3 partials ⚠️
...entflow/qa/evaluation/simulators/user_simulator.py	70.00%	3 Missing ⚠️

📢 Thoughts on this report? Let us know!

Iamsdt added 4 commits May 22, 2026 16:33

feat: enhance OpenAI API integration with support for "chat" style an…

1ebe8dc

…d add related configurations

feat: add caching support for LLM calls and enhance logging for cache…

2d8718f

… hits

fix: adjust handling of system instructions in caching logic for Goog…

7b63bdf

…le LLM calls

github-code-quality Bot found potential problems May 22, 2026

View reviewed changes

Iamsdt merged commit a443a1f into main May 22, 2026
5 of 6 checks passed

Iamsdt deleted the fix/llm_optimizations branch May 22, 2026 13:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/llm optimizations#123

Fix/llm optimizations#123
Iamsdt merged 4 commits into
mainfrom
fix/llm_optimizations

Iamsdt commented May 22, 2026

Uh oh!

codecov Bot commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		from __future__ import annotations

		from unittest.mock import AsyncMock, MagicMock, patch

Conversation

Iamsdt commented May 22, 2026

Uh oh!

codecov Bot commented May 22, 2026

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant