alex/g3 - g3 - Millerson GIT hosting

alex/g3

Author	SHA1	Message	Date
Dhanji R. Prasanna	39e586982c	feat: Externalize research tool as embedded skill Replaces the built-in research/research_status tools with a portable skill-based approach: - Add embedded skills infrastructure (skills compiled into binary) - Add repo-local skills/ directory support (highest priority) - Create research skill with SKILL.md and g3-research shell script - Script extraction to .g3/bin/ with version tracking - Filesystem-based handoff via .g3/research/<id>/status.json - Remove PendingResearchManager and all research tool code - Update system prompt to reference skill instead of tool Benefits: - No special tool infrastructure needed (just shell + read_file) - Context-efficient (reports stay on disk until needed) - Crash-resilient (state persisted to filesystem) - Portable (skill can be overridden per-workspace) Breaking change: research tool calls now return a deprecation message pointing to the research skill.	2026-02-05 13:23:26 +11:00
Dhanji R. Prasanna	0e64f13a8a	Merge feature/agent-skills-support: Agent Skills specification support	2026-02-05 12:46:53 +11:00
Dhanji R. Prasanna	0f919237ea	Make plan approval gate only active in plan mode - Add in_plan_mode flag to Agent struct - Add set_plan_mode() and is_plan_mode() methods - Gate check now only runs when in_plan_mode is true - CLI calls set_plan_mode(true) on /plan command and EnterPlanMode - CLI calls set_plan_mode(false) on approval and CTRL-D exit - Update integration test to enable plan mode - Fix test YAML to use Vec<Check> for negative/boundary checks	2026-02-05 11:41:52 +11:00
Dhanji R. Prasanna	c347a73cbd	Add plan approval gate to block file changes without approved plan - Add check_plan_approval_gate() in tools/plan.rs that runs after each tool call - Detects file changes via git status --porcelain when plan exists but not approved - Reverts changes: git checkout for modified files, rm for new untracked files - Returns blocking message instructing LLM to create/approve plan first - Add ApprovalGateResult enum with Allowed/Blocked/NotGitRepo variants - Add set_session_id() and set_working_dir() methods on Agent for testing - Add integration test using MockProvider to simulate blocked write_file	2026-02-05 11:34:10 +11:00
Dhanji R. Prasanna	fdb1255f02	Add --resume <session-id> flag for explicit session resumption - Add --resume CLI flag that conflicts with --new-session - Add load_continuation_by_id() to load sessions by full or partial ID - Support loading from latest.json or falling back to session.json - Handle --resume in both normal and agent modes - Agent mode validates session belongs to correct agent	2026-02-05 10:23:39 +11:00
Dhanji R. Prasanna	a5f6475603	feat: implement Agent Skills specification support Implements the Agent Skills specification (https://agentskills.io) for portable skill packages that give the agent new capabilities. Changes: - Add skills module with SKILL.md parser (YAML frontmatter + markdown body) - Implement skill discovery from ~/.g3/skills/, config extra_paths, and .g3/skills/ - Generate <available_skills> XML for system prompt injection - Add SkillsConfig to g3-config with enabled flag and extra_paths - Wire skills discovery into CLI startup - Add 29 unit tests for parser, discovery, and prompt generation - Update README with Agent Skills documentation Skill locations (priority order): 1. ~/.g3/skills/ (global) 2. Config extra_paths 3. .g3/skills/ (workspace, highest priority) At startup, g3 scans skill directories and injects a summary into the system prompt. When the agent needs a skill, it reads the full SKILL.md using the read_file tool.	2026-02-04 12:58:57 +11:00
Dhanji R. Prasanna	f93d05f444	Add real-time research completion notifications When background research completes, g3 now immediately prints a status message instead of waiting for the next user interaction: - Added ResearchCompletionNotification and broadcast channel to PendingResearchManager for push-based notifications - Added spawn_research_notification_handler() in interactive mode that listens for completions in a background task - When idle (at prompt): clears line, prints status, reprints prompt - When busy (processing): prints status inline (interleaving is fine) - Added G3Status::research_complete() for consistent formatting - Added enable_research_notifications() method to Agent Output format: "g3: 1 research report ... [done]"	2026-01-30 13:35:35 +11:00
Dhanji R. Prasanna	5ab1598e03	feat: async research tool - runs in background, returns immediately The research tool now spawns the scout agent in a background tokio task and returns immediately with a research_id placeholder. This allows the agent to continue working while research runs (30-120 seconds). Key changes: - New PendingResearchManager for tracking async research tasks - research tool returns immediately with placeholder containing research_id - research_status tool to check progress of pending research - Auto-injection of completed research at natural break points: - Start of each tool iteration (before LLM call) - Before prompting user in interactive mode - /research CLI command to list all research tasks - Updated system prompt to explain async behavior The agent can: - Continue with other work while research runs - Check status with research_status tool - Yield turn to user if results are critical before continuing	2026-01-30 13:00:02 +11:00
Dhanji R. Prasanna	7bfb9efa19	Remove automatic README loading from context window README.md is no longer auto-loaded into the LLM context at startup. This saves ~4,600 tokens per session while AGENTS.md and memory.md still provide all critical information for code tasks. Changes: - Delete read_project_readme() function - Remove readme_content parameter from combine_project_content() - Rename extract_readme_heading() -> extract_project_heading() - Rename Agent constructors: _with_readme_ -> _with_project_context_ - Update context preservation to only check for Agent Configuration - Remove has_readme field from LoadedContent - Update all tests to use new markers and function names The LLM can still read README.md on-demand via read_file when needed.	2026-01-29 11:07:41 +11:00
Dhanji R. Prasanna	735e9c9312	Add Google Gemini provider support - Add GeminiProvider with streaming and native tool calling - Support gemini-2.5-pro, gemini-2.0-flash, gemini-1.5-pro/flash models - Model-specific context window detection (1M-2M tokens) - Message conversion: assistant -> model role mapping - System messages extracted to system_instruction field - Tool schema conversion with functionCall/functionResponse parts - SSE streaming with JSON array buffer parsing - 8 unit tests for conversion and parsing logic - Register provider in g3-core and validate in g3-cli	2026-01-29 10:11:42 +11:00
Dhanji R. Prasanna	58fe74334d	Auto-detect context window size from GGUF for embedded providers - Add context_window_size() method to LLMProvider trait - Implement for EmbeddedProvider to return the auto-detected context length - Update Agent to query provider directly instead of using hardcoded defaults - Removes need for model-specific context length mappings	2026-01-28 11:16:14 +11:00
Dhanji R. Prasanna	55dba121b7	Add GLM-4 to context length defaults (32k) GLM-4 models support 32k context but were falling back to the conservative 4096 default, causing context overflow on startup.	2026-01-28 10:46:36 +11:00
Dhanji R. Prasanna	a902be1562	Refactor system prompts to eliminate duplication; upgrade embedded provider - Refactor prompts.rs: extract shared sections (intro, TODO, workspace memory, web research, response guidelines) used by both native and non-native prompts - Fix typo in native prompt: "save them.." -> "save them." - Fix non-native prompt: add missing closing braces in JSON examples, add IMPORTANT steps section, align with native prompt quality - Add 9 unit tests to verify both prompts contain required sections - Upgrade llama-cpp-2 dependency and refactor embedded provider - Update config.example.toml with embedded model examples - Update workspace memory	2026-01-28 09:56:39 +11:00
Dhanji R. Prasanna	cdb8b0f5eb	refactor(g3-core): consolidate Agent construction into single canonical path Eliminate code-path aliasing in Agent construction methods by introducing a single `build_agent()` helper that all constructors delegate to. Before: 3 nearly-identical `Ok(Self { ... })` blocks (~30 lines each) with subtle differences in auto_compact, is_autonomous, quiet, and computer_controller fields - prone to drift over time. After: Single canonical `build_agent()` method that constructs Agent with all fields. All public constructors delegate to this single path: - new_for_test() -> new_for_test_with_readme() -> build_agent() - new_with_mode_and_readme() -> build_agent() Changes: - Add `build_agent()` private helper method (single source of truth) - Simplify `new_for_test()` to delegate to `new_for_test_with_readme()` - Update `new_for_test_with_readme()` to use `build_agent()` - Update `new_with_mode_and_readme()` to use `build_agent()` Net reduction: ~43 lines (-109/+66) All 190 tests pass. Agent: fowler	2026-01-27 12:01:12 +11:00
Dhanji R. Prasanna	dfa0e4bfa2	refactor(g3-core): add section markers to lib.rs for better organization Added clear section comments to organize the 3000-line lib.rs into logical groupings: - CONSTRUCTION METHODS (~line 159) - CONFIGURATION & PROVIDER RESOLUTION (~line 444) - TASK EXECUTION (~line 782) - SESSION MANAGEMENT (~line 1069) - CONTEXT WINDOW OPERATIONS (~line 1148) - STREAMING & LLM INTERACTION (~line 1563) - TOOL EXECUTION (~line 2825) This improves code navigation and provides clear boundaries for future extraction into separate modules. No behavioral changes - all 191 tests pass. Agent: fowler	2026-01-27 11:46:17 +11:00
Dhanji R. Prasanna	5b4079e861	Add prompt cache statistics tracking to /stats command - Extend Usage struct with cache_creation_tokens and cache_read_tokens fields - Parse Anthropic cache_creation_input_tokens and cache_read_input_tokens - Parse OpenAI prompt_tokens_details.cached_tokens for automatic prefix caching - Add CacheStats struct to Agent for cumulative tracking across API calls - Add "Prompt Cache Statistics" section to /stats output showing: - API call count and cache hit count - Hit rate percentage - Total input tokens and cache read/creation tokens - Cache efficiency (% of input served from cache) - Update all provider implementations and test files	2026-01-27 11:32:45 +11:00
Dhanji R. Prasanna	9325a43ff3	feat(cli): shorten file paths in tool output display Add three-level path shortening hierarchy for cleaner CLI output: 1. Project path -> <project_name>/... (when project loaded via /project) 2. Workspace path -> ./... (relative to current working directory) 3. Home path -> ~/... (fallback for paths under home directory) Changes: - Add shorten_path() and shorten_paths_in_command() functions in display.rs - Add project_path/project_name fields to ConsoleUiWriter - Add set_workspace_path(), set_project_path(), clear_project() to UiWriter trait - Add ui_writer() getter to Agent struct - Wire up project path setting in /project and /unproject commands - Set workspace path when creating agents in all CLI modes Before: ● read_file \| /Users/dhanji/icloud/butler/projects/appa_estate/status.md After: ● read_file \| appa_estate/status.md (with project loaded) ● read_file \| ./src/main.rs (workspace-relative) ● read_file \| ~/Documents/file.txt (home-relative)	2026-01-21 21:27:16 +05:30
Dhanji R. Prasanna	feb7c3e40d	Add /project and /unproject commands for project-specific context - Add Project struct in crates/g3-cli/src/project.rs with file loading logic - Load brief.md, contacts.yaml, status.md from project path - Load projects.md from workspace root for cross-project context - Project content appended to system message (survives compaction/dehydration) - /project <path> loads project and auto-submits prompt asking about state - /unproject clears project content and resets context - Add set_project_content(), clear_project_content(), has_project_content() to Agent - Add new_for_test_with_readme() for testing with custom README content - Add 6 unit tests for Project struct - Add 9 integration tests for project context behavior	2026-01-21 14:53:30 +05:30
Dhanji R. Prasanna	07c0bf1e39	Remove 'Task: ' prefix from user messages The prefix was causing duplication when users typed 'Task: ...' themselves, resulting in '📋 Task: Task: ...' in context dumps. User messages are now stored as-is without any prefix.	2026-01-20 21:53:28 +05:30
Dhanji R. Prasanna	168cfff2ed	refactor(g3-core): extract tool output formatting to streaming.rs Centralize tool output formatting logic that was duplicated/scattered in stream_completion_with_tools(). This eliminates code-path aliasing where tool type checks were done in multiple places. Changes: - Add ToolOutputFormat enum (SelfHandled, Compact, Regular) - Add format_tool_result_summary() for centralized formatting decisions - Add is_compact_tool() and is_self_handled_tool() helper functions - Move parse_diff_stats() from lib.rs to streaming.rs - Simplify tool execution display logic in lib.rs using new helpers Net effect: -86 lines in lib.rs, +112 lines in streaming.rs The streaming.rs additions are reusable, well-named functions. All 585+ workspace tests pass. Agent: fowler	2026-01-20 15:45:35 +05:30
Dhanji R. Prasanna	9abb3735d2	refactor(g3-core): use StreamingState and IterationState structs in stream_completion_with_tools Consolidate scattered state variables in the 834-line stream_completion_with_tools() function to use the existing StreamingState and IterationState structs from streaming.rs. This eliminates code-path aliasing where state was tracked in multiple places and makes the streaming loop easier to reason about. Changes: - Add assistant_message_added field to StreamingState - Add stream_stop_reason field to IterationState - Replace 8 inline state variables with StreamingState::new() - Replace 7 iteration-local variables with IterationState::new() - All 585 workspace tests pass This is a pure refactor with no behavior changes. The state structs were already defined in streaming.rs but not used in the main streaming loop. Agent: fowler	2026-01-20 15:05:23 +05:30
Dhanji R. Prasanna	182f5f98fe	Centralize g3 status message formatting Extract a new g3_status module in g3-cli that provides consistent formatting for all 'g3:' prefixed system status messages. Key changes: - Add G3Status struct with methods for progress, done, failed, error, etc. - Add Status enum with Done, Failed, Error, Resolved, Insufficient, NoChanges - Add ThinResult struct in g3-core for semantic thinning data - Update UiWriter trait with print_thin_result() method - Refactor context thinning to return ThinResult instead of formatted strings - Update all callers to use the new centralized formatting - Session resume/decline messages now use G3Status - Compaction status messages now use G3Status This maintains clean separation of concerns: g3-core emits semantic data, g3-cli handles all terminal formatting and colors.	2026-01-20 09:50:55 +05:30
Dhanji R. Prasanna	7bd72a4a51	Add tests for tool-specific timeout durations Adds 8 unit tests verifying: - Research tool has 20-minute timeout - All other tools (shell, read_file, write_file, str_replace, code_search, webdriver_*, etc.) have standard 8-minute timeout - Comprehensive test_only_research_has_extended_timeout covers 19 tools This ensures future changes don't accidentally affect other tool timeouts.	2026-01-19 21:58:16 +05:30
Dhanji R. Prasanna	4b7be3f9ee	Increase research tool timeout to 20 minutes The research tool often runs past 8 minutes due to web browsing and analysis. Increased its timeout to 20 minutes while keeping other tools at 8 minutes. Changes: - Tool timeout is now tool-specific (20 min for research, 8 min for others) - Timeout error message now shows the correct duration for each tool	2026-01-19 21:51:08 +05:30
Dhanji R. Prasanna	5caa101b84	Fix inline JSON being incorrectly detected as tool call The bug was caused by mark_tool_calls_consumed() being called after displaying each chunk, which advanced last_consumed_position to the end of the current buffer. When the next chunk arrived with JSON, the unchecked_buffer started at position 0 of the slice, causing is_on_own_line() to return true (position 0 is always "on its own line"). Removed the problematic mark_tool_calls_consumed() call from the "no tool executed" branch. The remaining call after actual tool execution is correct and necessary. Added integration test that verifies inline JSON in prose is not detected as a tool call.	2026-01-19 14:35:01 +05:30
Dhanji R. Prasanna	349230d0b7	Fix missing assistant messages in context window Bug: When the LLM responded with text-only (no tool calls), the assistant message was sometimes not saved to the context window. This caused consecutive user messages where the LLM would lose track of previous responses. Root causes found and fixed: 1. Early return path (line ~2535): When stream finishes with no tools executed in previous iterations (any_tool_executed=false), the code returned early without saving the assistant message. Fixed by adding save before return. 2. Post-loop path (line ~2657): When raw_clean was empty but current_response had content, no message was saved. Fixed by falling back to current_response. Both paths now properly save the assistant message before returning. The assistant_message_added flag prevents any duplication. Added tests: - missing_assistant_message_test.rs: verifies the fallback logic - assistant_message_dedup_test.rs: verifies no duplicate messages - consecutive_assistant_message_test.rs: verifies alternation invariant	2026-01-19 13:50:28 +05:30
Dhanji R. Prasanna	fc702168ab	Add streaming completion integration test with mock LLM provider Adds tests to verify that: - All streaming chunks are processed before control returns to caller - Both tool calls in a multi-tool-call stream are executed - The finished signal properly terminates stream processing Also adds Agent::new_for_test() to allow injecting mock providers.	2026-01-16 20:52:32 +05:30
Dhanji R. Prasanna	0e33465342	Add print_g3_progress/print_g3_status methods for consistent status messages	2026-01-16 20:28:24 +05:30
Dhanji R. Prasanna	95f89d3f8e	Simplify compaction status messages	2026-01-16 20:26:35 +05:30
Dhanji R. Prasanna	7c59d1993c	Fix auto-memory JSON leak: tool call printed raw to UI The JSON filter only suppresses tool calls at line boundaries. When "Memory checkpoint: " was printed without a trailing newline, the LLM response `{"tool": "remember", ...}` appeared on the same line and leaked through to the UI. Fix: - Add trailing newline to "Memory checkpoint:" message - Reset JSON filter state before streaming the response Added test: test_tool_call_not_at_line_start_passes_through Documents the filter behavior and references the fix location.	2026-01-16 13:10:18 +05:30
Dhanji R. Prasanna	0ae1a13cdb	feat: real-time tool call streaming indicator with blinking UI - Add ToolParsingHint enum (Detected/Active/Complete) for UI feedback - New UiWriter methods: print_tool_streaming_hint(), print_tool_streaming_active() - Refactor ConsoleUiWriter state to use atomics in ParsingHintState - Add tool_call_streaming field to CompletionChunk for provider hints - Anthropic provider sends streaming hints when tool name detected - New streaming helpers: make_tool_streaming_hint(), make_tool_streaming_active() Parser improvements: - Add is_json_invalidated() to detect false positive tool patterns - Fix tool result poisoning when file contents contain partial JSON - Unescaped newlines in strings or prose after JSON invalidates detection User sees ' ● tool_name \|' immediately when tool call starts streaming, with blinking indicator while args are received.	2026-01-15 13:49:29 +05:30
Dhanji R. Prasanna	999ac6fe66	fix: prevent parser poisoning from inline tool-call JSON patterns The streaming parser was incorrectly detecting tool call patterns that appeared inline in prose (e.g., when explaining the format), causing g3 to return control mid-task. Fix: Modified find_first_tool_call_start() and find_last_tool_call_start() to only recognize patterns that appear on their own line (at start of buffer or after newline with only whitespace before the pattern). Changes: - Added is_on_own_line() helper to check line-boundary conditions - Updated detection methods to skip inline patterns - Removed sanitize_inline_tool_patterns() and LBRACE_HOMOGLYPH (no longer needed) - Rewrote tests for new behavior - Added streaming_repro tests that use process_chunk() to verify the exact bug scenario 28 tests covering: streaming repro, line boundaries, Unicode, code contexts, edge cases	2026-01-15 13:49:29 +05:30
Dhanji R. Prasanna	f4562cd4c9	config: default agent settings and provider override	2026-01-14 20:14:33 +05:30
Dhanji R. Prasanna	5104bd53b6	refactor(g3-core): improve stream_completion_with_tools readability Extract and simplify the streaming completion function: - Extract ensure_context_capacity() helper for pre-loop context management (thinning + compaction logic now in dedicated async method) - Simplify compact_summary generation block: flatten nested if/match, remove redundant comments, reorder branches for clarity - Remove dead code: unused _last_error variable and modified_tool_call - Streamline duplicate detection block: reduce verbose logging - Clean up text content display block: remove redundant comments, tighten variable declarations - Remove redundant is_todo_tool redefinition inside block expression Net reduction: 79 lines (-187/+108) Behavior unchanged, all unit tests pass. Agent: carmack	2026-01-14 15:11:53 +05:30
Dhanji R. Prasanna	dea0e6b1ca	Compact tool output improvements - Rename take_screenshot -> screenshot, code_coverage -> coverage (shorter names) - Align \| character across all compact tools (pad to 11 chars for str_replace) - Make code_search a compact tool with summary display - Show language and search name in code_search output (e.g., rust:"find structs") - Add format_code_search_summary() to extract match/file counts from JSON response	2026-01-14 08:12:50 +05:30
Dhanji R. Prasanna	7d17b436f9	refactor(g3-core): remove 3 unused Agent constructor variants Remove dead code - constructor variants that had no callers: - new_with_readme() - new_autonomous_with_readme() - new_with_quiet() These were thin wrappers around new_with_mode_and_readme() that were never used externally. All 5 remaining constructors have verified callers. Results: - lib.rs reduced from 2817 to 2797 lines (-20 lines) - Eliminated code-path aliasing: 8 constructors → 5 constructors - All g3-core tests pass - Full workspace compiles cleanly Agent: fowler	2026-01-14 04:26:42 +05:30
Dhanji R. Prasanna	a1dfd9c0b6	Enhanced auto-memory with rich few-shot format - Updated memory reminder prompt with per-symbol char ranges - Added two few-shot examples: Session Continuation (feature) + UTF-8 Safe Slicing (pattern) - Updated system prompt Memory Format section to match - Format: file -> nested symbols with [start..end] ranges and descriptions - Enables direct read_file navigation to specific functions	2026-01-13 21:49:48 +05:30
Dhanji R. Prasanna	5e45e110e2	refactor(g3-core): extract finalize_streaming_turn() to unify return paths Extract a single canonical helper function for completing streaming turns, eliminating 3 nearly-identical return paths in stream_completion_with_tools(). Changes: - Add finalize_streaming_turn() helper that handles: - Finishing streaming markdown - Saving context window - Adding timing footer (when requested) - Dehydrating context (when ACD enabled) - Building TaskResult - Replace 3 duplicated return blocks with calls to the helper - Remove unused mut on full_response variable Results: - Function reduced from 1067 to 999 lines (-68 lines) - Eliminated code-path aliasing: 3 paths → 1 canonical path - All 32 characterization tests pass - Full g3-core test suite passes Agent: fowler	2026-01-13 16:52:48 +05:30
Dhanji R. Prasanna	47e3a88cf6	refactor(g3-core): extract stats formatting to dedicated module Extract the get_stats() function (158 lines) from lib.rs to a new stats.rs module. Changes: - Create stats.rs with AgentStatsSnapshot struct for capturing agent state - Replace inline formatting logic with delegation to snapshot.format() - Add unit tests for stats formatting (empty and populated states) - Reduce lib.rs from 2961 to 2818 lines (-143 lines) The new module improves: - Testability: Stats formatting can now be unit tested in isolation - Separation of concerns: Formatting logic is decoupled from Agent struct - Readability: lib.rs is more focused on core agent behavior All 271 workspace tests pass. Agent: fowler	2026-01-13 16:11:53 +05:30
Dhanji R. Prasanna	a09967eb27	refactor(streaming): Extract deduplication and auto-continue logic into helpers Improve readability of stream_completion_with_tools (~1000 line function): - Add deduplicate_tool_calls() helper with closure for previous-message check - Add should_auto_continue() with AutoContinueReason enum for clearer control flow - Replace inline deduplication loop with helper call (-19 lines) - Replace complex auto-continue conditional with match on reason enum (-13 lines) - Add section comments for major phases (State Init, Pre-loop, Main Loop, Auto-Continue, Post-Loop) - Add comprehensive tests for new helpers Net reduction: 82 deletions, behavior unchanged (172+ tests pass) Agent: carmack	2026-01-13 11:44:06 +05:30
Dhanji R. Prasanna	8dcb7a3dba	feat: add compact styled output for TODO tools TODO tools (todo_read, todo_write) now display with a cleaner, more compact format: - Styled header: " ● todo_read" or " ● todo_write" - Tree-style prefixes for content lines (│ and └) - Checkbox conversion: "- [ ]" → □, "- [x]" → ■ - Dimmed content for visual distinction - No timing footer (cleaner output) Changes: - Add print_todo_compact() method to UiWriter trait - Implement print_todo_compact() in ConsoleUiWriter - Update todo.rs to call print_todo_compact() instead of line-by-line output - Skip tool header, output header, and timing for TODO tools in agent streaming	2026-01-13 10:58:55 +05:30
Dhanji R. Prasanna	a0b9126555	Revert "refactor(g3-core): extract streaming logic to agent_streaming.rs" This reverts commit `a2e51cf075`.	2026-01-13 07:59:18 +05:30
Dhanji R. Prasanna	6907fa36c0	UI: Add newline before auto-memory skip message	2026-01-13 07:03:42 +05:30
Dhanji R. Prasanna	a2e51cf075	refactor(g3-core): extract streaming logic to agent_streaming.rs Reduce lib.rs complexity by extracting the streaming completion logic: - Extract stream_completion_with_tools (~1080 lines) to agent_streaming.rs - Extract stream_with_retry helper method - Extract parse_diff_stats helper function - Add handle_pre_stream_compaction helper for cleaner pre-stream logic - Add format_tool_output helper for tool output formatting - Remove 3 unused constructor variants: - new_with_readme - new_autonomous_with_readme - new_with_quiet Results: - lib.rs reduced from 2974 to 1791 lines (40% reduction) - Streaming logic cleanly separated into dedicated module - All tests pass, no behavior changes Agent: fowler	2026-01-13 06:14:56 +05:30
Dhanji R. Prasanna	1b051aad94	Fix write_file compact summary to show actual line/char counts The write_file compact display was showing 1 line because it was counting lines in the success message, not the actual written content. Now parses the tool result (e.g. '✅ wrote 150 lines \| 4.2k chars') to extract and display the correct counts. Added format_write_file_result() to parse the tool output.	2026-01-12 20:32:54 +05:30
Dhanji R. Prasanna	6f3530544d	Fix compact tool failure display to use single-line format When compact tools (read_file, write_file, str_replace, etc.) failed, they would fall through to the non-compact output path, causing: - Missing or incorrect headers - Stray footers with wrong formatting - State leakage (is_shell_compact) between tool calls Now failed compact tools display in the same single-line format as successful ones, just with a truncated error message instead of the success summary: ● read_file \| path/to/file.txt \| ❌ Failed to read file... \| 123 ◉ 0ms This keeps the UI consistent and avoids the "stray footer" bug.	2026-01-12 20:02:08 +05:30
Dhanji R. Prasanna	c2aa80647a	Remove legacy logs/ directory, consolidate all data under .g3/ This change removes the legacy logs/ directory and consolidates all session data, error logs, and discovery files under the .g3/ directory. New directory structure: - .g3/sessions/<session_id>/session.json - session logs - .g3/errors/ - error logs (was logs/errors/) - .g3/background_processes/ - background process logs - .g3/discovery/ - planner discovery files (was workspace/logs/) Changes: - paths.rs: Remove get_logs_dir()/logs_dir(), add get_errors_dir(), get_background_processes_dir(), get_discovery_dir() - session.rs: Anonymous sessions now use .g3/sessions/anonymous_<ts>/ - error_handling.rs: Errors now saved to .g3/errors/ - project.rs: Remove logs_dir() and ensure_logs_dir() methods - feedback_extraction.rs: Remove logs_dir field and fallback logic - planner: Use .g3/ for workspace data and .g3/discovery/ for reports - flock.rs: Look for session metrics in .g3/sessions/ - coach_feedback.rs: Remove fallback to logs/ path - Update all tests to use new paths - Update README.md and .gitignore	2026-01-12 18:20:08 +05:30
Dhanji R. Prasanna	43a5d27149	Add compact format for remember, take_screenshot, code_coverage, rehydrate Extend compact single-line output to additional tools: - remember: shows '📝 memory updated (size)' - take_screenshot: shows '📸 path' - code_coverage: shows '📊 report generated' - rehydrate: shows '🔄 restored fragment_id' Tools without file_path argument use simplified format: ● tool_name \| summary \| tokens ◉ time	2026-01-12 14:45:50 +05:30
Dhanji R. Prasanna	2c411c058a	Compact single-line tool output for file operations and shell Implement compact display format for read_file, write_file, str_replace, and shell: - read_file/write_file/str_replace: Single line with dimmed summary and timing Format: ● tool_name \| path [range] \| summary \| tokens ◉ time - shell: Two-line format with command header and dimmed output Format: ● shell \| command └─ output (N lines) \| tokens ◉ time Changes: - Add print_tool_compact() method to UiWriter trait - Add is_shell_compact state tracking in ConsoleUiWriter - Add format_write_file_summary() and format_str_replace_summary() helpers - Fix duplicate response output by checking if response is empty before printing - Add finish_streaming_markdown() call before return to flush markdown buffer	2026-01-12 14:37:47 +05:30
Dhanji R. Prasanna	8df044ac13	refactor(g3-core): reduce lib.rs complexity by extracting utilities - Extract truncate_to_word_boundary() to utils.rs with tests - Consolidate duplicate detection: use streaming::are_tool_calls_duplicate() instead of inline closures (eliminates code-path aliasing) - Remove unused regex import - Remove wrapper methods format_duration/format_timing_footer that just delegated to streaming module - call streaming::* directly Reduces lib.rs from 2945 to 2897 lines (-48 lines, -1.6%) All 159+ g3-core tests pass. Agent: fowler	2026-01-12 09:47:47 +05:30

1 2 3 4 5 ...

296 Commits