g3/crates/g3-providers at 2a4cd1f4d6518a007e67895fdcd2b6dae88d6032 - g3

alex/g3

Files

Dhanji R. Prasanna 2a4cd1f4d6 fix: strip duplicate tool call JSON from assistant messages when LLM stutters

When the LLM emits identical JSON tool calls as text content (JSON
fallback mode), the raw duplicate JSON was being stored in the assistant
message in conversation history. This confused the model on subsequent
turns, causing it to stall or repeat itself.

Root cause: raw_content_for_log used get_text_content() which returns
the full parser buffer including all duplicate tool call JSONs.

Fix: Added get_text_before_tool_calls() to StreamingToolParser that
returns only the text before the first JSON tool call. Changed
raw_content_for_log to use this method so the assistant message only
contains the preamble text + the single executed tool call.

Added 5 integration tests covering stuttered duplicates, triple
stutter, cross-turn dedup, and different-args boundary case.

Added MockResponse helpers for simulating LLM stutter patterns.

2026-02-10 19:53:11 +11:00

src

fix: strip duplicate tool call JSON from assistant messages when LLM stutters

2026-02-10 19:53:11 +11:00

tests

Add integration tests for CacheStats and Gemini serialization

2026-01-29 11:28:52 +11:00

Cargo.toml

refactor: Clean up Cargo dependencies - remove unused, update outdated

2026-02-06 14:22:59 +11:00