alex/g3 - g3 - Millerson GIT hosting

alex/g3

Author	SHA1	Message	Date
Dhanji R. Prasanna	5e45e110e2	refactor(g3-core): extract finalize_streaming_turn() to unify return paths Extract a single canonical helper function for completing streaming turns, eliminating 3 nearly-identical return paths in stream_completion_with_tools(). Changes: - Add finalize_streaming_turn() helper that handles: - Finishing streaming markdown - Saving context window - Adding timing footer (when requested) - Dehydrating context (when ACD enabled) - Building TaskResult - Replace 3 duplicated return blocks with calls to the helper - Remove unused mut on full_response variable Results: - Function reduced from 1067 to 999 lines (-68 lines) - Eliminated code-path aliasing: 3 paths → 1 canonical path - All 32 characterization tests pass - Full g3-core test suite passes Agent: fowler	2026-01-13 16:52:48 +05:30
Dhanji R. Prasanna	b89d55a9ff	Add characterization tests for stream_completion_with_tools Add 32 blackbox characterization tests to lock down the behavior of the stream_completion_with_tools function (1067 lines) before refactoring. Tests cover key behaviors through stable boundaries: - StreamingToolParser: tool call detection, incomplete detection, text accumulation - Auto-continue logic: autonomous mode decisions, priority ordering - Duplicate detection: sequential duplicates, cross-message duplicates - Context window: token tracking, compaction threshold, history preservation - Tool execution: read_file, shell, write_file, todo tools through Agent - Streaming utilities: LLM token cleaning, duration formatting, truncation - Parser sanitization: inline tool pattern handling, homoglyph replacement These tests intentionally do NOT assert: - Internal parser state or implementation details - Specific timing values - UI output formatting - Provider-specific behavior Agent: hopper	2026-01-13 16:25:33 +05:30
Dhanji R. Prasanna	47e3a88cf6	refactor(g3-core): extract stats formatting to dedicated module Extract the get_stats() function (158 lines) from lib.rs to a new stats.rs module. Changes: - Create stats.rs with AgentStatsSnapshot struct for capturing agent state - Replace inline formatting logic with delegation to snapshot.format() - Add unit tests for stats formatting (empty and populated states) - Reduce lib.rs from 2961 to 2818 lines (-143 lines) The new module improves: - Testability: Stats formatting can now be unit tested in isolation - Separation of concerns: Formatting logic is decoupled from Agent struct - Readability: lib.rs is more focused on core agent behavior All 271 workspace tests pass. Agent: fowler	2026-01-13 16:11:53 +05:30
Dhanji R. Prasanna	82c0165765	Fix unused variable warning and UTF-8 panic in string slicing - Remove unused total_lines variable in file_ops.rs - Fix UTF-8 boundary panic in utils.rs when generating diff error preview The code was slicing at byte index 200 which could land inside a multi-byte character (e.g., box-drawing chars like ─). Now uses character-based slicing with chars().take() instead.	2026-01-13 14:52:52 +05:30
Dhanji R. Prasanna	118935d2da	Remove unused variable total_lines in file_ops.rs	2026-01-13 14:25:17 +05:30
Dhanji R. Prasanna	a09967eb27	refactor(streaming): Extract deduplication and auto-continue logic into helpers Improve readability of stream_completion_with_tools (~1000 line function): - Add deduplicate_tool_calls() helper with closure for previous-message check - Add should_auto_continue() with AutoContinueReason enum for clearer control flow - Replace inline deduplication loop with helper call (-19 lines) - Replace complex auto-continue conditional with match on reason enum (-13 lines) - Add section comments for major phases (State Init, Pre-loop, Main Loop, Auto-Continue, Post-Loop) - Add comprehensive tests for new helpers Net reduction: 82 deletions, behavior unchanged (172+ tests pass) Agent: carmack	2026-01-13 11:44:06 +05:30
Dhanji R. Prasanna	dc45987e8d	Add characterization tests for UTF-8 truncation and parser sanitization Agent: hopper Adds 32 new integration tests covering recent commits: ## UTF-8 Safe Truncation Tests (14 tests) Covers commit `f30f145` (Fix UTF-8 panics): - Topic extraction with emoji, CJK, and multi-byte characters - Truncation at character boundaries (not byte boundaries) - Edge cases: exactly 50 chars, 51 chars, 2-byte/3-byte/4-byte UTF-8 - Stub generation with multi-byte topics - Combining characters and diacritics ## Parser Sanitization Tests (18 tests) Covers commit `4c36cc0` (Prevent parser poisoning): - Code block contexts (inline code, after fences, prose) - Line boundary edge cases (empty lines, whitespace, indentation) - Unicode handling (emoji, bullets, CJK before patterns) - Multiple patterns on same line - Negative cases (similar but different patterns, partial patterns) - Real-world scenarios from the original bug report All tests are blackbox/characterization style - they test observable outputs through stable public interfaces without encoding internal implementation details.	2026-01-13 11:22:46 +05:30
Dhanji R. Prasanna	8dcb7a3dba	feat: add compact styled output for TODO tools TODO tools (todo_read, todo_write) now display with a cleaner, more compact format: - Styled header: " ● todo_read" or " ● todo_write" - Tree-style prefixes for content lines (│ and └) - Checkbox conversion: "- [ ]" → □, "- [x]" → ■ - Dimmed content for visual distinction - No timing footer (cleaner output) Changes: - Add print_todo_compact() method to UiWriter trait - Implement print_todo_compact() in ConsoleUiWriter - Update todo.rs to call print_todo_compact() instead of line-by-line output - Skip tool header, output header, and timing for TODO tools in agent streaming	2026-01-13 10:58:55 +05:30
Dhanji R. Prasanna	4c36cc058c	fix: prevent parser poisoning from inline tool-call JSON patterns When the streaming parser encountered fragments of JSON that looked like partial tool calls (e.g., {"tool":) embedded in inline text (like code examples or prose), it would incorrectly enter JSON parsing mode and poison the parser state, causing control to be returned to the user mid-task. This fix: - Adds sanitize_inline_tool_patterns() to detect tool-call patterns that are NOT on their own line and replace the opening brace with a Unicode homoglyph (fullwidth left curly bracket U+FF5B) - Integrates sanitization into process_chunk() before text is buffered - Updates system prompts to instruct LLMs to use homoglyphs when showing example tool call JSON in prose - Adds comprehensive tests for the sanitization logic Real tool calls from LLMs always appear on their own line, so those are left untouched. Only inline patterns (with non-whitespace before them) are sanitized.	2026-01-13 10:58:41 +05:30
Dhanji R. Prasanna	a0b9126555	Revert "refactor(g3-core): extract streaming logic to agent_streaming.rs" This reverts commit `a2e51cf075`.	2026-01-13 07:59:18 +05:30
Dhanji R. Prasanna	6907fa36c0	UI: Add newline before auto-memory skip message	2026-01-13 07:03:42 +05:30
Dhanji R. Prasanna	a2e51cf075	refactor(g3-core): extract streaming logic to agent_streaming.rs Reduce lib.rs complexity by extracting the streaming completion logic: - Extract stream_completion_with_tools (~1080 lines) to agent_streaming.rs - Extract stream_with_retry helper method - Extract parse_diff_stats helper function - Add handle_pre_stream_compaction helper for cleaner pre-stream logic - Add format_tool_output helper for tool output formatting - Remove 3 unused constructor variants: - new_with_readme - new_autonomous_with_readme - new_with_quiet Results: - lib.rs reduced from 2974 to 1791 lines (40% reduction) - Streaming logic cleanly separated into dedicated module - All tests pass, no behavior changes Agent: fowler	2026-01-13 06:14:56 +05:30
Dhanji R. Prasanna	f30f145c85	Fix UTF-8 panics and inconsistent retry logic - Fix 7 UTF-8 byte slicing panics that crash on multi-byte characters: - acd.rs: extract_topic_from_text() [..50] slice - streaming.rs: log_stream_error() [..500] slice - tools/acd.rs: rehydrate message truncation [..2000] slice - history.rs: git commit message truncation [..69] slice - planner.rs: commit summary/description truncation [..69] slices - llm.rs: requirements summary line truncation [..117] slice - All now use chars().count() and chars().take(N).collect() for UTF-8 safe truncation - Fix inconsistent retry logic in task_execution.rs: - Previously only retried on Timeout errors - Now retries on ALL recoverable errors (rate limits, network, server errors, model busy, token limits, context length) - Added error-specific base delays (rate limit: 5s, server: 2s, etc.) - Added exponential backoff with ±20% jitter - Consistent with autonomous mode retry behavior	2026-01-13 05:49:45 +05:30
Dhanji R. Prasanna	6f50d01ab6	Add comprehensive end-of-turn behavior tests for g3-core Agent: hopper Adds 56 new integration tests covering the observable end-of-turn behaviors in the streaming module: - Timing footer formatting (5 tests): verifies user-facing timing display with various durations, token counts, and context percentages - Tool call duplicate detection (6 tests): ensures identical sequential tool calls are detected while different tools/args are not - Empty response detection (9 tests): validates detection of empty, whitespace-only, and timing-only responses that trigger auto-continue - Connection error classification (5 tests): verifies EOF, connection, chunk, and body errors are correctly identified for graceful recovery - Tool output summary formatting (17 tests): covers read_file, write_file, str_replace, remember, screenshot, coverage, and rehydrate summaries - Duration formatting (4 tests): milliseconds, seconds, minutes, zero - Text truncation (4 tests): short/long strings, multiline, flag behavior - LLM token cleaning (3 tests): removal of stop tokens like <\|im_end\|> - Edge cases (4 tests): empty inputs, unicode handling, large numbers All tests are blackbox/characterization style - they test observable outputs through stable public interfaces without encoding internal implementation details. Tests remain stable under refactoring that preserves behavior.	2026-01-12 21:17:32 +05:30
Dhanji R. Prasanna	d164c97ad2	Fix multi-line error messages in compact tool output The truncate_for_display() function now takes only the first line of input before truncating. This prevents multi-line error messages (like str_replace failures) from breaking the compact single-line format. Added tests for multi-line input handling.	2026-01-12 20:55:05 +05:30
Dhanji R. Prasanna	1b051aad94	Fix write_file compact summary to show actual line/char counts The write_file compact display was showing 1 line because it was counting lines in the success message, not the actual written content. Now parses the tool result (e.g. '✅ wrote 150 lines \| 4.2k chars') to extract and display the correct counts. Added format_write_file_result() to parse the tool output.	2026-01-12 20:32:54 +05:30
Dhanji R. Prasanna	6f3530544d	Fix compact tool failure display to use single-line format When compact tools (read_file, write_file, str_replace, etc.) failed, they would fall through to the non-compact output path, causing: - Missing or incorrect headers - Stray footers with wrong formatting - State leakage (is_shell_compact) between tool calls Now failed compact tools display in the same single-line format as successful ones, just with a truncated error message instead of the success summary: ● read_file \| path/to/file.txt \| ❌ Failed to read file... \| 123 ◉ 0ms This keeps the UI consistent and avoids the "stray footer" bug.	2026-01-12 20:02:08 +05:30
Dhanji R. Prasanna	78516722df	Remove accidentally committed legacy logs/ directories	2026-01-12 18:20:20 +05:30
Dhanji R. Prasanna	c2aa80647a	Remove legacy logs/ directory, consolidate all data under .g3/ This change removes the legacy logs/ directory and consolidates all session data, error logs, and discovery files under the .g3/ directory. New directory structure: - .g3/sessions/<session_id>/session.json - session logs - .g3/errors/ - error logs (was logs/errors/) - .g3/background_processes/ - background process logs - .g3/discovery/ - planner discovery files (was workspace/logs/) Changes: - paths.rs: Remove get_logs_dir()/logs_dir(), add get_errors_dir(), get_background_processes_dir(), get_discovery_dir() - session.rs: Anonymous sessions now use .g3/sessions/anonymous_<ts>/ - error_handling.rs: Errors now saved to .g3/errors/ - project.rs: Remove logs_dir() and ensure_logs_dir() methods - feedback_extraction.rs: Remove logs_dir field and fallback logic - planner: Use .g3/ for workspace data and .g3/discovery/ for reports - flock.rs: Look for session metrics in .g3/sessions/ - coach_feedback.rs: Remove fallback to logs/ path - Update all tests to use new paths - Update README.md and .gitignore	2026-01-12 18:20:08 +05:30
Dhanji R. Prasanna	43a5d27149	Add compact format for remember, take_screenshot, code_coverage, rehydrate Extend compact single-line output to additional tools: - remember: shows '📝 memory updated (size)' - take_screenshot: shows '📸 path' - code_coverage: shows '📊 report generated' - rehydrate: shows '🔄 restored fragment_id' Tools without file_path argument use simplified format: ● tool_name \| summary \| tokens ◉ time	2026-01-12 14:45:50 +05:30
Dhanji R. Prasanna	2c411c058a	Compact single-line tool output for file operations and shell Implement compact display format for read_file, write_file, str_replace, and shell: - read_file/write_file/str_replace: Single line with dimmed summary and timing Format: ● tool_name \| path [range] \| summary \| tokens ◉ time - shell: Two-line format with command header and dimmed output Format: ● shell \| command └─ output (N lines) \| tokens ◉ time Changes: - Add print_tool_compact() method to UiWriter trait - Add is_shell_compact state tracking in ConsoleUiWriter - Add format_write_file_summary() and format_str_replace_summary() helpers - Fix duplicate response output by checking if response is empty before printing - Add finish_streaming_markdown() call before return to flush markdown buffer	2026-01-12 14:37:47 +05:30
Dhanji R. Prasanna	5dfabaf19a	Add 72 integration tests for compaction, retry, tool execution, and error classification Agent: hopper Added 4 new test files with blackbox/characterization-style integration tests: - compaction_behavior_test.rs (14 tests): Token cap calculation, thinking mode disable logic, summary message building, CompactionResult behavior - retry_behavior_test.rs (17 tests): RetryConfig presets and customization, RetryResult state handling, retry_operation behavior with simulated errors - tool_execution_roundtrip_test.rs (16 tests): End-to-end tool execution through Agent interface for read_file, write_file, shell, str_replace, and TODO tools - error_classification_test.rs (25 tests): Recoverable vs non-recoverable error classification, retry delay calculation, edge cases and priority handling All tests follow integration-first philosophy: - Test through stable public interfaces - Assert observable behavior, not implementation details - Use characterization style to document current behavior - Enable refactoring by not encoding internal structure	2026-01-12 11:40:19 +05:30
Dhanji R. Prasanna	d508ddd508	Move project memory from .g3/ to analysis/ for version control Project memory is now stored at analysis/memory.md instead of .g3/memory.md. This change enables: - Shared memory across git worktrees (studio agent sessions) - Version-controlled memory that persists across clones - Memory changes tracked in git history and reviewable in PRs Changes: - crates/g3-core/src/tools/memory.rs: Update get_memory_path() to use analysis/ - crates/g3-cli/src/project_files.rs: Update read_project_memory() path - crates/g3-core/src/prompts.rs: Update documentation references (2 occurrences) - analysis/memory.md: Add memory file (copied from .g3/memory.md)	2026-01-12 10:20:33 +05:30
Dhanji R. Prasanna	8df044ac13	refactor(g3-core): reduce lib.rs complexity by extracting utilities - Extract truncate_to_word_boundary() to utils.rs with tests - Consolidate duplicate detection: use streaming::are_tool_calls_duplicate() instead of inline closures (eliminates code-path aliasing) - Remove unused regex import - Remove wrapper methods format_duration/format_timing_footer that just delegated to streaming module - call streaming::* directly Reduces lib.rs from 2945 to 2897 lines (-48 lines, -1.6%) All 159+ g3-core tests pass. Agent: fowler	2026-01-12 09:47:47 +05:30
Dhanji R. Prasanna	02799a8e69	refactor(g3-core): extract streaming helpers and simplify cache control logic Readability improvements to g3-core/src/lib.rs: - Extract format_tool_arg_value() to streaming.rs for tool argument display - Extract format_read_file_summary() to streaming.rs for file read summaries - Add format_tool_output_summary() helper for consistent output formatting - Add get_provider_cache_control() helper to eliminate duplicated cache lookup - Simplify cache control logic in execute_single_task and stream_completion_with_tools - Add unit tests for all new streaming helpers Results: - lib.rs: 2979 → 2945 lines (34 lines saved) - streaming.rs: 305 → 379 lines (74 lines added as reusable, tested helpers) - All 155+ tests pass Agent: carmack	2026-01-12 07:21:40 +05:30
Dhanji R. Prasanna	f10374c925	Remove machine mode entirely from g3 - Delete machine_ui_writer.rs - Remove --machine CLI flag from cli_args.rs - Remove run_machine_mode(), run_interactive_machine(), run_autonomous_machine() functions - Remove handle_machine_command() function - Simplify OutputMode enum to just use SimpleOutput directly - Simplify SimpleOutput struct (remove machine_mode field) - Remove machine_mode parameter from setup_workspace_directory() - Remove test_machine_option_accepted test - Disable ACD by default in agent_mode (requires --acd flag) - Change 'memory checkpoint' message formatting - Remove dehydration status message	2026-01-12 06:01:31 +05:30
Dhanji R. Prasanna	14cc28d9ba	Include full task in ACD dehydration stub for forensics Added first_user_message field to Fragment struct that captures the full first user message (task) from the dehydrated conversation. This is now displayed at the top of the stub with a 📋 Task: prefix. Removed the Topics section from the stub since the full task provides better context for forensics and debugging. Agent: g3	2026-01-12 05:17:45 +05:30
Dhanji R. Prasanna	f415dbb84b	Fix ACD turn summary loss and add /dump command ACD (Aggressive Context Dehydration) fixes: - Fixed dehydrate_context() to extract turn summary from context window instead of using the passed-in final_response (which contained only the timing footer, not the actual LLM response) - Removed final_response parameter from dehydrate_context() since it now self-extracts the last assistant message as the summary - This ensures the actual turn summary is preserved after dehydration, not just the timing footer New /dump command: - Added /dump command to dump entire context window to tmp/ for debugging - Shows message index, role, kind, content length, and full content - Available in both console and machine modes UTF-8 safety: - Fixed truncate_to_word_boundary() to use character indices instead of byte indices, preventing panics on multi-byte UTF-8 characters - Added UTF-8 string slicing guidance to AGENTS.md Agent: g3	2026-01-12 05:13:02 +05:30
Dhanji R. Prasanna	ac17b95b24	fix(read_file): clamp end position instead of erroring when it exceeds file length When read_file is called with an end position beyond the file length, instead of returning an error that forces a retry, now clamps to the actual file length and returns the content with an informative message. This eliminates wasteful retry cycles where the LLM had to make a second request with the corrected end position.	2026-01-12 05:11:09 +05:30
Dhanji R. Prasanna	da63e79a13	Move read_file metadata to end of output Change read_file output format so the "🔍 N lines read" appears as the last line after the file content, not before it. This keeps the output cleaner with just one metadata line at the end.	2026-01-11 19:56:23 +05:30
Dhanji R. Prasanna	ed1c31dd70	Improve tool output formatting 1. str_replace: Show insertion/deletion counts with colors "✅ +N insertions \| -M deletions" (green/red) 2. write_file: Compact format with human-readable sizes "✅ wrote N lines \| Xk chars" 3. read_file: Cleaner format "🔍 N lines read" instead of "📄 File content (N lines)" 4. webdriver_quit: Show correct driver name (safaridriver vs chromedriver) 5. read_file: When start position exceeds file length, read last 100 chars with explanation instead of failing 6. shell: Remove redundant "Command failed:" prefix from error messages	2026-01-11 19:52:00 +05:30
Dhanji R. Prasanna	7c960875ef	Add hint to re-read memory from disk in system prompt Added note that agents can use read_file .g3/memory.md to refresh project memory if needed (e.g., after another agent updates it).	2026-01-11 19:40:02 +05:30
Dhanji R. Prasanna	bb25c7881a	Change agent mode header text From: 🤖 Running as agent: fowler To: >> agent mode \| fowler	2026-01-11 17:24:26 +05:30
Dhanji R. Prasanna	4962f439f3	Simplify agent mode working directory display Change from: 📁 Working directory: "/Users/dhanji/src/g3" To: -> ~/src/g3 Replaces home directory with ~ for cleaner output.	2026-01-11 17:20:26 +05:30
Dhanji R. Prasanna	f83ae7fd39	Add status line showing loaded context in agent mode Shows checkmarks for README, AGENTS.md, and Memory if loaded, or dots if not found. Displayed below the working directory line.	2026-01-11 17:13:32 +05:30
Dhanji R. Prasanna	9509e51708	style: simplify auto-memory checkpoint message	2026-01-11 16:51:09 +05:30
Dhanji R. Prasanna	83c9b5d434	Add integration blackbox tests for g3-core Adds 18 new integration tests covering: - Background process lifecycle (start, check running, kill, list) - Unified diff edge cases (multi-hunk, additions-only, deletions-only, CRLF normalization, range constraints, error handling) - Error classification boundaries (rate limit, server error, timeout, network error, context length exceeded, model busy, non-recoverable) These tests follow blackbox/integration-first principles: - Test through stable public interfaces - Do not encode internal implementation details - Focus on observable behavior - Enable refactoring without test breakage Agent: hopper	2026-01-11 16:32:59 +05:30
Dhanji R. Prasanna	874be7b459	refactor(core): collapse nested if statements per clippy Collapsed nested if statements that check related conditions into single conditions using &&. This improves readability by making the logical relationship between conditions explicit. Files changed: - feedback_extraction.rs: 3 instances of tool_use/final_output checks - tools/todo.rs: 1 instance of todo completion check Agent: fowler	2026-01-11 16:21:33 +05:30
Dhanji R. Prasanna	1c3de60bb9	refactor(core): simplify truncate_line() by merging identical branches The function had two branches that both returned line.to_string(): - when !should_truncate - when line.chars().count() <= max_width Merged into a single condition. Also updated format! to use inline variable syntax per clippy suggestion. Agent: fowler	2026-01-11 16:18:48 +05:30
Dhanji R. Prasanna	280ae1fcbb	feat: add --auto-memory flag to prompt LLM to save discoveries Adds a new --auto-memory CLI flag that automatically sends a reminder to the LLM after each turn where tools were called, prompting it to call the remember tool if it discovered any key code locations. Changes: - Add auto_memory field and set_auto_memory() method to Agent - Add tool_calls_this_turn tracking in execute_tool_in_dir() - Add send_auto_memory_reminder() that sends reminder after tool use - Add --auto-memory CLI flag and wire it up in console/machine modes - Call send_auto_memory_reminder() in single-shot and interactive modes - Add visible status messages for auto-memory actions Fixes bug where tool calls were not being tracked when execute_tool_in_dir was called directly with working_dir=None.	2026-01-11 08:00:51 +08:00
Dhanji R. Prasanna	bf53b81af3	remember tool prompt tweak	2026-01-11 07:22:43 +08:00
Dhanji R. Prasanna	e731bc8217	Make remember tool instructions more imperative in system prompts - Change 'call remember' to 'you MUST call remember' in native prompt - Change 'IF you discovered' to 'ALWAYS...when you discovered' - Add explicit list of trigger tools (code_search, rg, grep, find, read_file) - Add reminder to Response Guidelines section - Add remember tool and Project Memory section to non-native prompt - Remove redundant console output from remember tool - Fix test compilation errors (missing summary parameter, temporary borrow)	2026-01-11 06:49:45 +08:00
Dhanji R. Prasanna	1090e30d6c	Simplify system prompt: remove coding style and parallel tool call sections - Remove IMPORTANT FOR CODING section (~1,500 chars of coding guidelines) - Remove <use_parallel_tool_calls> block (~500 chars) - Remove unused const_format dependency from g3-core - Simplify get_system_prompt_for_native() to just return base prompt - Response Guidelines now cleanly ends the static prompt Prompt reduced from ~8,500 to ~6,500 characters.	2026-01-11 06:35:18 +08:00
Dhanji R. Prasanna	33c1aba86e	Show human-readable descriptions in /resume session list - Add description field to SessionContinuation struct - Extract first user message (truncated to ~60 chars at word boundary) - Display as quoted text instead of session ID hash - Fall back to session ID if no description available Example: [2 hours ago] 'when I call /resume it only shows me 2 sessions...'	2026-01-11 06:22:20 +08:00
Dhanji R. Prasanna	3fcef587e8	Fix /resume to show all sessions and use human-readable timestamps - Change run_autonomous to return Agent instead of () so session continuation is properly saved in accumulative mode - Update format_session_time to show relative times ("2 hours ago", "yesterday") for recent sessions and dates for older ones - Handle Ctrl+C cancellation gracefully with informative message	2026-01-11 06:13:27 +08:00
Dhanji R. Prasanna	8926775acb	Add session continuation symlink fix and /resume command Fix session detection: - Add save_session_continuation() calls at all session exit points - Sessions now properly create .g3/session symlink for resume detection - Fixes issue where g3 wasn't offering to resume previous sessions Add /resume command: - New list_sessions_for_directory() to scan available sessions - New switch_to_session() method to safely switch between sessions - Shows numbered list with timestamps, context %, and TODO status - Saves current session before switching (can be resumed later) - Restores full context if <80% used, otherwise uses summary - Machine mode supports /resume and /resume <number> Documentation: - Add /clear and /resume to CONTROL_COMMANDS.md - Update /help output with new commands	2026-01-11 05:30:58 +08:00
Dhanji R. Prasanna	86709834e2	Improve research tool error reporting for scout agent failures When the scout agent fails (e.g., context window exhaustion), now: - Captures both stdout and stderr from the scout process - Detects context window exhaustion errors with specific patterns - Provides detailed, actionable error messages to the user - Shows suggestions for how to work around the issue - Includes technical details (exit code, error output) for debugging Handles two failure modes: 1. Scout agent exits with non-zero status 2. Scout agent exits successfully but doesn't produce valid report markers Both cases now surface clear error messages instead of cryptic failures.	2026-01-10 20:50:43 +11:00
Dhanji R. Prasanna	60aeb67c56	Add stealth mode for Chrome headless to evade bot detection Implements comprehensive anti-detection measures: - Override navigator.webdriver to return undefined - Inject fake chrome.runtime, chrome.loadTimes, chrome.csi objects - Add realistic plugins and mimeTypes arrays - Patch permissions API to hide automation - Set realistic navigator properties (languages, hardwareConcurrency, deviceMemory) - Remove ChromeDriver-specific window properties (cdc_*) - Patch Function.prototype.toString to hide modifications - Add Chrome flags: --disable-blink-features=AutomationControlled - Set realistic user-agent without HeadlessChrome identifier - Exclude 'enable-automation' switch Tested against bot detection sites: - bot.sannysoft.com: All major tests pass - Search engines: Works with DuckDuckGo, Yahoo, Brave, Startpage - Still detected by: Google reCAPTCHA, Cloudflare Turnstile, Bing	2026-01-10 20:34:14 +11:00
Dhanji R. Prasanna	6be0a03c4c	Fix timing footer being saved to context window The timing footer (e.g., ⏱️ 19.4s \| 💭 4.7s) was being saved to the conversation history as a separate assistant message. This happened because stream_completion_with_tools returns the timing footer in TaskResult.response for display, but the caller was also saving it to context. Fix: Strip the timing footer (identified by \n\n⏱️) before saving to context window. The timing footer remains display-only. Also includes: - Research tool blank line fix: only add visual separator for research tool output, not all tools - Research tool webdriver propagation: pass parent's webdriver browser choice (Safari vs Chrome headless) to scout subprocess	2026-01-10 15:55:59 +11:00
Dhanji R. Prasanna	68c9135913	Fix research tool UI: remove duplicate header, add footer spacing, remove spinner, widen command display - Remove duplicate tool header (lib.rs already prints it) - Add newline before timing footer for visual separation - Remove spinner animation (incompatible with update_tool_output_line) - Change shell command format to " > `cmd` ..." with 60 char width	2026-01-10 15:20:40 +11:00

1 2 3 4 5 ...

366 Commits