alex/g3 - g3 - Millerson GIT hosting

alex/g3

Author	SHA1	Message	Date
Dhanji R. Prasanna	2c411c058a	Compact single-line tool output for file operations and shell Implement compact display format for read_file, write_file, str_replace, and shell: - read_file/write_file/str_replace: Single line with dimmed summary and timing Format: ● tool_name \| path [range] \| summary \| tokens ◉ time - shell: Two-line format with command header and dimmed output Format: ● shell \| command └─ output (N lines) \| tokens ◉ time Changes: - Add print_tool_compact() method to UiWriter trait - Add is_shell_compact state tracking in ConsoleUiWriter - Add format_write_file_summary() and format_str_replace_summary() helpers - Fix duplicate response output by checking if response is empty before printing - Add finish_streaming_markdown() call before return to flush markdown buffer	2026-01-12 14:37:47 +05:30
Dhanji R. Prasanna	8d5dd9f84a	Merge sessions/hopper/1156b5c9	2026-01-12 11:53:14 +05:30
Dhanji R. Prasanna	5dfabaf19a	Add 72 integration tests for compaction, retry, tool execution, and error classification Agent: hopper Added 4 new test files with blackbox/characterization-style integration tests: - compaction_behavior_test.rs (14 tests): Token cap calculation, thinking mode disable logic, summary message building, CompactionResult behavior - retry_behavior_test.rs (17 tests): RetryConfig presets and customization, RetryResult state handling, retry_operation behavior with simulated errors - tool_execution_roundtrip_test.rs (16 tests): End-to-end tool execution through Agent interface for read_file, write_file, shell, str_replace, and TODO tools - error_classification_test.rs (25 tests): Recoverable vs non-recoverable error classification, retry delay calculation, edge cases and priority handling All tests follow integration-first philosophy: - Test through stable public interfaces - Assert observable behavior, not implementation details - Use characterization style to document current behavior - Enable refactoring by not encoding internal structure	2026-01-12 11:40:19 +05:30
Dhanji R. Prasanna	9e26d6bbf9	Fix black box artifact in context thinning status line Add ANSI clear-to-end-of-line escape sequence (\x1b[K]) after the reset code in the context thinning animation. This prevents leftover background color artifacts when the carriage return overwrites the line during the flash animation.	2026-01-12 11:39:20 +05:30
Dhanji R. Prasanna	33558bc092	Update project memory with new location documentation	2026-01-12 11:25:59 +05:30
Dhanji R. Prasanna	d508ddd508	Move project memory from .g3/ to analysis/ for version control Project memory is now stored at analysis/memory.md instead of .g3/memory.md. This change enables: - Shared memory across git worktrees (studio agent sessions) - Version-controlled memory that persists across clones - Memory changes tracked in git history and reviewable in PRs Changes: - crates/g3-core/src/tools/memory.rs: Update get_memory_path() to use analysis/ - crates/g3-cli/src/project_files.rs: Update read_project_memory() path - crates/g3-core/src/prompts.rs: Update documentation references (2 occurrences) - analysis/memory.md: Add memory file (copied from .g3/memory.md)	2026-01-12 10:20:33 +05:30
Dhanji R. Prasanna	21ecbb3fb8	Merge sessions/fowler/9b17499a	2026-01-12 10:15:19 +05:30
Dhanji R. Prasanna	6c2563cd07	Merge sessions/fowler/36f031d6	2026-01-12 10:14:09 +05:30
Dhanji R. Prasanna	30bb63715e	Fix studio status to show full markdown-formatted summary Changes: - Fix JSON path for session logs: now reads from context_window.conversation_history (with fallback to messages for backwards compatibility) - Remove 500-character truncation to show full summary - Add termimad dependency for terminal markdown rendering - Display summary with proper markdown formatting (headers, bold, code, lists) The extract_session_summary() function was looking for messages at the wrong JSON path. Session logs store conversation history at context_window.conversation_history, not at the top-level messages key.	2026-01-12 10:13:58 +05:30
Dhanji R. Prasanna	8df044ac13	refactor(g3-core): reduce lib.rs complexity by extracting utilities - Extract truncate_to_word_boundary() to utils.rs with tests - Consolidate duplicate detection: use streaming::are_tool_calls_duplicate() instead of inline closures (eliminates code-path aliasing) - Remove unused regex import - Remove wrapper methods format_duration/format_timing_footer that just delegated to streaming module - call streaming::* directly Reduces lib.rs from 2945 to 2897 lines (-48 lines, -1.6%) All 159+ g3-core tests pass. Agent: fowler	2026-01-12 09:47:47 +05:30
Dhanji R. Prasanna	3a0b656161	refactor(g3-cli): eliminate code-path aliasing in config and project content loading Consolidate duplicated logic into canonical shared functions: - Extract load_config_with_cli_overrides() to utils.rs - Was duplicated in lib.rs and accumulative.rs with subtle differences - lib.rs version had Chrome diagnostics + provider validation - accumulative.rs version was missing both - Now all callers use the complete canonical implementation - Extract combine_project_content() to project_files.rs - Was duplicated inline in lib.rs and agent_mode.rs - Simplified implementation using iterator flatten - Added unit tests for all cases This eliminates drift risk where the duplicated implementations could diverge over time (accumulative.rs was already missing Chrome diagnostics and provider validation). Agent: fowler	2026-01-12 08:57:49 +05:30
Dhanji R. Prasanna	6c17f269d7	Add studio tool for multi-agent workspace management Studio enables running multiple g3 agents concurrently without conflicts by using git worktrees for isolation. Features: - studio run --agent <name> [args...]: Create worktree, spawn g3, tail output - studio list: Show all active sessions - studio status <id>: Show session details and summary - studio accept <id>: Merge session branch to main and cleanup - studio discard <id>: Delete session without merging Each session gets: - Isolated worktree at .worktrees/sessions/<agent>/<session-id> - Dedicated branch: sessions/<agent>/<session-id> - Short UUID (8 chars) for easy reference - Automatic --workspace and --agent flags passed to g3	2026-01-12 07:26:17 +05:30
Dhanji R. Prasanna	02799a8e69	refactor(g3-core): extract streaming helpers and simplify cache control logic Readability improvements to g3-core/src/lib.rs: - Extract format_tool_arg_value() to streaming.rs for tool argument display - Extract format_read_file_summary() to streaming.rs for file read summaries - Add format_tool_output_summary() helper for consistent output formatting - Add get_provider_cache_control() helper to eliminate duplicated cache lookup - Simplify cache control logic in execute_single_task and stream_completion_with_tools - Add unit tests for all new streaming helpers Results: - lib.rs: 2979 → 2945 lines (34 lines saved) - streaming.rs: 305 → 379 lines (74 lines added as reusable, tested helpers) - All 155+ tests pass Agent: carmack	2026-01-12 07:21:40 +05:30
Dhanji R. Prasanna	f10374c925	Remove machine mode entirely from g3 - Delete machine_ui_writer.rs - Remove --machine CLI flag from cli_args.rs - Remove run_machine_mode(), run_interactive_machine(), run_autonomous_machine() functions - Remove handle_machine_command() function - Simplify OutputMode enum to just use SimpleOutput directly - Simplify SimpleOutput struct (remove machine_mode field) - Remove machine_mode parameter from setup_workspace_directory() - Remove test_machine_option_accepted test - Disable ACD by default in agent_mode (requires --acd flag) - Change 'memory checkpoint' message formatting - Remove dehydration status message	2026-01-12 06:01:31 +05:30
Dhanji R. Prasanna	b9cdb99557	refactor(g3-cli): break lib.rs into focused modules Extract 7 modules from the 2966-line lib.rs: - cli_args.rs (133 lines): CLI argument parsing with clap - autonomous.rs (785 lines): coach-player feedback loop - agent_mode.rs (284 lines): specialized agent execution - accumulative.rs (343 lines): iterative requirements mode - interactive.rs (851 lines): REPL with command handling - task_execution.rs (212 lines): unified retry logic - utils.rs (91 lines): display and workspace helpers Key improvements: - lib.rs reduced from 2966 to 415 lines (86% reduction) - Eliminated duplicate retry logic between execute_task and execute_task_machine - Each module has a single responsibility - Easier to reason about and maintain Agent: fowler	2026-01-12 05:35:08 +05:30
Dhanji R. Prasanna	14cc28d9ba	Include full task in ACD dehydration stub for forensics Added first_user_message field to Fragment struct that captures the full first user message (task) from the dehydrated conversation. This is now displayed at the top of the stub with a 📋 Task: prefix. Removed the Topics section from the stub since the full task provides better context for forensics and debugging. Agent: g3	2026-01-12 05:17:45 +05:30
Dhanji R. Prasanna	42a747e745	Move UTF-8 safety pattern from AGENTS.md to project memory The UTF-8 string slicing pattern is better suited as a remembered pattern in project memory rather than a static AGENTS.md section. This keeps AGENTS.md focused on codebase-specific invariants while the pattern remains accessible for reference. Agent: g3	2026-01-12 05:14:25 +05:30
Dhanji R. Prasanna	f415dbb84b	Fix ACD turn summary loss and add /dump command ACD (Aggressive Context Dehydration) fixes: - Fixed dehydrate_context() to extract turn summary from context window instead of using the passed-in final_response (which contained only the timing footer, not the actual LLM response) - Removed final_response parameter from dehydrate_context() since it now self-extracts the last assistant message as the summary - This ensures the actual turn summary is preserved after dehydration, not just the timing footer New /dump command: - Added /dump command to dump entire context window to tmp/ for debugging - Shows message index, role, kind, content length, and full content - Available in both console and machine modes UTF-8 safety: - Fixed truncate_to_word_boundary() to use character indices instead of byte indices, preventing panics on multi-byte UTF-8 characters - Added UTF-8 string slicing guidance to AGENTS.md Agent: g3	2026-01-12 05:13:02 +05:30
Dhanji R. Prasanna	ac17b95b24	fix(read_file): clamp end position instead of erroring when it exceeds file length When read_file is called with an end position beyond the file length, instead of returning an error that forces a retry, now clamps to the actual file length and returns the content with an informative message. This eliminates wasteful retry cycles where the LLM had to make a second request with the corrected end position.	2026-01-12 05:11:09 +05:30
Dhanji R. Prasanna	da63e79a13	Move read_file metadata to end of output Change read_file output format so the "🔍 N lines read" appears as the last line after the file content, not before it. This keeps the output cleaner with just one metadata line at the end.	2026-01-11 19:56:23 +05:30
Dhanji R. Prasanna	ed1c31dd70	Improve tool output formatting 1. str_replace: Show insertion/deletion counts with colors "✅ +N insertions \| -M deletions" (green/red) 2. write_file: Compact format with human-readable sizes "✅ wrote N lines \| Xk chars" 3. read_file: Cleaner format "🔍 N lines read" instead of "📄 File content (N lines)" 4. webdriver_quit: Show correct driver name (safaridriver vs chromedriver) 5. read_file: When start position exceeds file length, read last 100 chars with explanation instead of failing 6. shell: Remove redundant "Command failed:" prefix from error messages	2026-01-11 19:52:00 +05:30
Dhanji R. Prasanna	7c960875ef	Add hint to re-read memory from disk in system prompt Added note that agents can use read_file .g3/memory.md to refresh project memory if needed (e.g., after another agent updates it).	2026-01-11 19:40:02 +05:30
Dhanji R. Prasanna	9754c4ee66	Fix code fence closing without trailing newline When a code block ended without a trailing newline after the closing \`\`\`, two bugs occurred in flush_incomplete(): 1. The closing \`\`\` was included as part of the code block content (displayed with syntax highlighting) 2. The same \`\`\` was then emitted again as literal text because current_line was not cleared after being pushed to block_buffer The fix: - Check if current_line is the closing fence before adding to block_buffer - Always clear current_line after processing in the CodeBlock case Added two tests: - test_code_fence_after_blank_line: code fence with trailing newline - test_code_fence_no_trailing_newline: code fence without trailing newline	2026-01-11 19:34:46 +05:30
Dhanji R. Prasanna	bb25c7881a	Change agent mode header text From: 🤖 Running as agent: fowler To: >> agent mode \| fowler	2026-01-11 17:24:26 +05:30
Dhanji R. Prasanna	4962f439f3	Simplify agent mode working directory display Change from: 📁 Working directory: "/Users/dhanji/src/g3" To: -> ~/src/g3 Replaces home directory with ~ for cleaner output.	2026-01-11 17:20:26 +05:30
Dhanji R. Prasanna	f83ae7fd39	Add status line showing loaded context in agent mode Shows checkmarks for README, AGENTS.md, and Memory if loaded, or dots if not found. Displayed below the working directory line.	2026-01-11 17:13:32 +05:30
Dhanji R. Prasanna	2b87a89617	Revert "Add fancy ASCII art header for agent mode" This reverts commit `08747595a1`.	2026-01-11 17:12:32 +05:30
Dhanji R. Prasanna	08747595a1	Add fancy ASCII art header for agent mode The agent mode header now shows: - Agent name in uppercase with box art - Working directory (truncated if too long) - Status indicators for README, AGENTS.md, and Memory loading - Task preview if provided Also exports truncate_for_display and adds truncate_path_for_display helper functions in project_files module.	2026-01-11 17:11:14 +05:30
Dhanji R. Prasanna	2fbdac7aa9	Fix extra newlines before tool calls in JSON filter The JSON tool call filter was outputting newlines immediately as they were encountered. When the LLM output contained multiple newlines before a tool call, each newline was output before the tool call JSON was detected and suppressed, leaving orphaned blank lines in the output. Changes: - Add pending_newlines field to FilterState to buffer newlines at line start - First newline after content is output immediately, subsequent ones buffered - When tool call confirmed, pending_newlines cleared (suppressing extra blanks) - When not a tool call, pending_newlines output with the buffer - Add flush_json_tool_filter() to flush pending content at end of streaming - Update tests to reflect new behavior - Add tests for newline suppression behavior	2026-01-11 17:04:27 +05:30
Dhanji R. Prasanna	9509e51708	style: simplify auto-memory checkpoint message	2026-01-11 16:51:09 +05:30
Dhanji R. Prasanna	cf3727f50d	refactor(g3-cli): Extract focused modules from lib.rs for improved readability Extract three cohesive modules from the monolithic lib.rs (3188 -> 2785 lines): - metrics.rs (147 lines): Turn metrics tracking and histogram generation - TurnMetrics struct - format_elapsed_time() for human-readable durations - generate_turn_histogram() for performance visualization - Added unit tests for core functions - project_files.rs (181 lines): Project file reading utilities - read_agents_config() for AGENTS.md loading - read_project_readme() for README detection - read_project_memory() for .g3/memory.md - extract_readme_heading() for display - Added unit tests - coach_feedback.rs (129 lines): Coach feedback extraction from session logs - extract_from_logs() main entry point - Helper functions for log parsing and text extraction All modules have clear single responsibilities, improved documentation, and maintain identical behavior to the original inline functions. Agent: carmack	2026-01-11 16:41:41 +05:30
Dhanji R. Prasanna	83c9b5d434	Add integration blackbox tests for g3-core Adds 18 new integration tests covering: - Background process lifecycle (start, check running, kill, list) - Unified diff edge cases (multi-hunk, additions-only, deletions-only, CRLF normalization, range constraints, error handling) - Error classification boundaries (rate limit, server error, timeout, network error, context length exceeded, model busy, non-recoverable) These tests follow blackbox/integration-first principles: - Test through stable public interfaces - Do not encode internal implementation details - Focus on observable behavior - Enable refactoring without test breakage Agent: hopper	2026-01-11 16:32:59 +05:30
Dhanji R. Prasanna	9c71d12561	style: change agent mode tool color from royal blue to light gray	2026-01-11 16:26:20 +05:30
Dhanji R. Prasanna	874be7b459	refactor(core): collapse nested if statements per clippy Collapsed nested if statements that check related conditions into single conditions using &&. This improves readability by making the logical relationship between conditions explicit. Files changed: - feedback_extraction.rs: 3 instances of tool_use/final_output checks - tools/todo.rs: 1 instance of todo completion check Agent: fowler	2026-01-11 16:21:33 +05:30
Dhanji R. Prasanna	1c3de60bb9	refactor(core): simplify truncate_line() by merging identical branches The function had two branches that both returned line.to_string(): - when !should_truncate - when line.chars().count() <= max_width Merged into a single condition. Also updated format! to use inline variable syntax per clippy suggestion. Agent: fowler	2026-01-11 16:18:48 +05:30
Dhanji R. Prasanna	74a18794a0	fix: load AGENTS.md and memory in agent mode Agent mode was only loading README.md but not AGENTS.md or project memory (.g3/memory.md). This meant agents were missing important context that normal mode had access to. Now agent mode uses the same read_agents_config(), read_project_readme(), and read_project_memory() functions as normal mode, combining all three into the agent context.	2026-01-11 16:15:58 +05:30
Dhanji R. Prasanna	1d884251cb	refactor(cli): remove duplicate agent mode check in run() The same if-let block checking for agent mode was duplicated, causing dead code on the second check. Removed the duplicate. Agent: fowler	2026-01-11 16:14:50 +05:30
Dhanji R. Prasanna	4fb605fe7e	Update dependency analysis artifacts Refreshed static dependency analysis for the G3 codebase: - graph.json: 143 nodes (9 crates, 134 files), 189 edges - No cycles detected (DAG structure confirmed) - Top fan-in: g3-core (43), g3-providers (27), g3-config (16) - Top fan-out: g3-core/src/lib.rs (27), g3-cli/src/lib.rs (12) - 4-layer architecture: Foundation → Core → Services → Application Extraction method: Cargo.toml parsing + regex-based import analysis Limitations documented: internal crate imports, re-exports, conditional compilation Agent: euler	2026-01-11 16:11:01 +05:30
Dhanji R. Prasanna	cfd5d69cce	refactor: auto-enable auto-memory in agent mode Simplify auto-memory by always enabling it in agent mode instead of requiring the --auto-memory flag. This makes sense because: - Agent mode is non-interactive, so blocking is acceptable - Agents benefit from automatically saving discoveries to memory - Reduces flag complexity for users The --auto-memory flag still works for other modes if desired.	2026-01-11 15:56:27 +05:30
Dhanji R. Prasanna	1575cafc4b	fix: add --auto-memory support to agent mode The --auto-memory flag was not being passed to run_agent_mode() and send_auto_memory_reminder() was not being called after agent task execution. Changes: - Pass auto_memory parameter to run_agent_mode() - Add auto_memory parameter to run_agent_mode() function signature - Call agent.set_auto_memory(true) when flag is enabled - Call send_auto_memory_reminder() after execute_task() in agent mode	2026-01-11 08:03:46 +08:00
Dhanji R. Prasanna	280ae1fcbb	feat: add --auto-memory flag to prompt LLM to save discoveries Adds a new --auto-memory CLI flag that automatically sends a reminder to the LLM after each turn where tools were called, prompting it to call the remember tool if it discovered any key code locations. Changes: - Add auto_memory field and set_auto_memory() method to Agent - Add tool_calls_this_turn tracking in execute_tool_in_dir() - Add send_auto_memory_reminder() that sends reminder after tool use - Add --auto-memory CLI flag and wire it up in console/machine modes - Call send_auto_memory_reminder() in single-shot and interactive modes - Add visible status messages for auto-memory actions Fixes bug where tool calls were not being tracked when execute_tool_in_dir was called directly with working_dir=None.	2026-01-11 08:00:51 +08:00
Dhanji R. Prasanna	39918cf281	fix: process bold/italic/code formatting inside markdown headers The format_header() function was not calling format_inline_content() to process inline formatting like bold, italic, and `code` within headers. This caused raw markdown markers to appear in output. Added 4 tests to verify the fix: - test_bold_inside_header - test_italic_inside_header - test_code_inside_header - test_mixed_formatting_inside_header	2026-01-11 08:00:34 +08:00
Dhanji R. Prasanna	fc9a2f835a	Fix streaming markdown code fence detection bug The code fence (```) was not being properly detected during streaming, causing it to be rendered as inline code instead of a code block. Root cause: When buffering a code fence after seeing ```, the code was returning early for ALL characters including newlines. This meant handle_newline() was never called and block_state was never set to BlockState::CodeBlock. Fixes: - Don't return early for newlines when buffering code fence, allow them to fall through to handle_newline() - Support indented code fences (up to 3 spaces per CommonMark spec) by using trim_start() when checking for ``` at line start	2026-01-11 07:42:02 +08:00
Dhanji R. Prasanna	bf53b81af3	remember tool prompt tweak	2026-01-11 07:22:43 +08:00
Dhanji R. Prasanna	e731bc8217	Make remember tool instructions more imperative in system prompts - Change 'call remember' to 'you MUST call remember' in native prompt - Change 'IF you discovered' to 'ALWAYS...when you discovered' - Add explicit list of trigger tools (code_search, rg, grep, find, read_file) - Add reminder to Response Guidelines section - Add remember tool and Project Memory section to non-native prompt - Remove redundant console output from remember tool - Fix test compilation errors (missing summary parameter, temporary borrow)	2026-01-11 06:49:45 +08:00
Dhanji R. Prasanna	1090e30d6c	Simplify system prompt: remove coding style and parallel tool call sections - Remove IMPORTANT FOR CODING section (~1,500 chars of coding guidelines) - Remove <use_parallel_tool_calls> block (~500 chars) - Remove unused const_format dependency from g3-core - Simplify get_system_prompt_for_native() to just return base prompt - Response Guidelines now cleanly ends the static prompt Prompt reduced from ~8,500 to ~6,500 characters.	2026-01-11 06:35:18 +08:00
Dhanji R. Prasanna	33c1aba86e	Show human-readable descriptions in /resume session list - Add description field to SessionContinuation struct - Extract first user message (truncated to ~60 chars at word boundary) - Display as quoted text instead of session ID hash - Fall back to session ID if no description available Example: [2 hours ago] 'when I call /resume it only shows me 2 sessions...'	2026-01-11 06:22:20 +08:00
Dhanji R. Prasanna	3fcef587e8	Fix /resume to show all sessions and use human-readable timestamps - Change run_autonomous to return Agent instead of () so session continuation is properly saved in accumulative mode - Update format_session_time to show relative times ("2 hours ago", "yesterday") for recent sessions and dates for older ones - Handle Ctrl+C cancellation gracefully with informative message	2026-01-11 06:13:27 +08:00
Dhanji R. Prasanna	8926775acb	Add session continuation symlink fix and /resume command Fix session detection: - Add save_session_continuation() calls at all session exit points - Sessions now properly create .g3/session symlink for resume detection - Fixes issue where g3 wasn't offering to resume previous sessions Add /resume command: - New list_sessions_for_directory() to scan available sessions - New switch_to_session() method to safely switch between sessions - Shows numbered list with timestamps, context %, and TODO status - Saves current session before switching (can be resumed later) - Restores full context if <80% used, otherwise uses summary - Machine mode supports /resume and /resume <number> Documentation: - Add /clear and /resume to CONTROL_COMMANDS.md - Update /help output with new commands	2026-01-11 05:30:58 +08:00
Dhanji R. Prasanna	86709834e2	Improve research tool error reporting for scout agent failures When the scout agent fails (e.g., context window exhaustion), now: - Captures both stdout and stderr from the scout process - Detects context window exhaustion errors with specific patterns - Provides detailed, actionable error messages to the user - Shows suggestions for how to work around the issue - Includes technical details (exit code, error output) for debugging Handles two failure modes: 1. Scout agent exits with non-zero status 2. Scout agent exits successfully but doesn't produce valid report markers Both cases now surface clear error messages instead of cryptic failures.	2026-01-10 20:50:43 +11:00

1 2 3 4 5 ...

545 Commits