alex/g3 - g3 - Millerson GIT hosting

alex/g3

Author	SHA1	Message	Date
Dhanji R. Prasanna	cbced3390c	feat: JIT-injectable toolsets with load_toolset tool Implement dynamic tool loading system that allows tools to be loaded on-demand rather than included in the default set. Key changes: - Add toolsets module with registry of loadable toolsets - Add load_toolset tool that returns tool definitions for a named toolset - Add <available_toolsets> section to system prompt - Track loaded toolsets in Agent, extend tool definitions dynamically - Move webdriver (15 tools) to JIT-only loading Benefits: - Leaner default context (fewer tokens consumed) - On-demand loading when agent needs specialized tools - Extensible registry for future toolsets - Idempotent loading with helpful error messages Files: - crates/g3-core/src/toolsets.rs (new) - crates/g3-core/src/tools/toolsets.rs (new) - crates/g3-core/src/tool_definitions.rs - crates/g3-core/src/tool_dispatch.rs - crates/g3-core/src/prompts.rs - crates/g3-core/src/lib.rs - crates/g3-core/src/tools/executor.rs	2026-02-06 09:35:11 +11:00
Dhanji R. Prasanna	ff15db44c0	Restore research as first-class tool, remove research skill Restores the research tool that was previously externalized as a skill: - Add pending_research.rs: PendingResearchManager with thread-safe task tracking - Add tools/research.rs: execute_research (async), execute_research_status - Add research/research_status tool definitions with exclude_research config - Integrate PendingResearchManager into Agent and ToolContext - Inject completed research results in streaming loop Remove research skill: - Clear EMBEDDED_SKILLS array in embedded.rs - Delete skills/research/ directory - Update all tests expecting embedded research skill - Update docs and memory to reflect the change The research tool now: - Spawns scout agent in background tokio task - Returns immediately with research_id - Automatically injects results into conversation when ready - Supports status checks via research_status tool	2026-02-06 07:38:06 +11:00
Dhanji R. Prasanna	b673827076	Fix embedded skill loading: stop XML-escaping location paths The <location> field in the skills XML prompt was being XML-escaped, converting <embedded:research>/SKILL.md to <embedded:research>/SKILL.md. When the LLM tried to use read_file with this escaped path, it would fail. Changes: - Remove escape_xml() call from location field in prompt.rs - Add fallback handling for escaped paths in try_read_embedded_skill() - Add tests for both prompt generation and read_file handling Fixes embedded skill loading for agents like butler running outside the g3 repo.	2026-02-05 23:16:40 +11:00
Dhanji R. Prasanna	65b2ec368f	Add Action Envelope section back to native prompt Restored the Action Envelope instructions with a clear, complete example showing how to write envelope.yaml for rulespec verification.	2026-02-05 22:27:29 +11:00
Dhanji R. Prasanna	3823f8b5f3	Optimize native system prompt - 48% size reduction Removed redundant and vague content from prompts/system/native.md: - Simplified intro from 17 lines to 3 lines - Reduced Code Search section to one line - Removed duplicate Plan Mode example (kept one) - Removed Action Envelope section (rarely used correctly) - Removed verbose Memory Format details (tool description covers it) - Removed Response Guidelines (obvious to modern LLMs) Size: 8,620 chars -> 4,498 chars Also updated: - G3_IDENTITY_LINE constant for agent mode compatibility - Test assertions to check for new prompt markers - System prompt validation to use new marker string	2026-02-05 22:16:34 +11:00
Dhanji R. Prasanna	d978032044	Remove redundant AGENTS.md heading from startup output The loaded status line (✓ AGENTS.md ✓ Memory) already indicates that AGENTS.md was loaded, so the separate '>> AGENTS.md - Machine Instructions' heading line was redundant. - Remove print_project_heading() function from display.rs - Remove extract_project_heading call from interactive.rs - Clean up unused imports	2026-02-05 21:38:47 +11:00
Dhanji R. Prasanna	c6df75d886	Fix shell tool output line clipping to account for suffix The shell tool output line was wrapping because update_tool_output_line clipped the content without reserving space for the suffix that gets appended later (line count + timing info). Added suffix_overhead of 30 chars for shell tools to reserve space for: - " (9999 lines)" = ~13 chars - " \| 99999 ◉ 999ms" = ~17 chars This ensures the complete line fits within terminal width without wrapping.	2026-02-05 21:23:00 +11:00
Dhanji R. Prasanna	7e2d9bc22c	Enforce rulespec creation with plan_write for new plans Solves the tautology problem where the LLM would write invariants after implementation, making them match what was done rather than constrain it. Changes: - plan_write now accepts 'rulespec' parameter - New plans REQUIRE rulespec (fails with helpful error if missing) - Plan updates don't require rulespec (backward compatible) - Rulespec is parsed, validated, and written atomically with plan - Updated system prompt with clear examples for new vs update - Updated tool definition schema - Updated all affected tests New flow: task → plan+rulespec → user reviews BOTH → approve → implement	2026-02-05 21:12:02 +11:00
Dhanji R. Prasanna	085688479b	Improve terminal width responsiveness for tool output Clip summary text and other long fields to fit terminal width: - Clip display_summary in print_tool_compact (e.g., "47 lines (2.0k chars)") - Account for header_suffix length when compressing paths in print_tool_output_header - Clip TODO item lines in print_todo_compact - Clip plan item descriptions, evidence, touches, checks, and paths in print_plan_compact - Replace hardcoded 70/40 char limits with dynamic terminal-width-based clipping All clipping uses clip_line() which handles UTF-8 safely and adds ellipsis.	2026-02-05 20:44:12 +11:00
Dhanji R. Prasanna	19162b1fe6	Exit plan mode when plan is completed or blocked When a plan reaches a terminal state (all items done or blocked) in interactive mode, automatically exit plan mode and return to normal prompt. Changes: - Add Agent::is_plan_terminal() method to check if plan is complete - Add check_and_exit_plan_mode_if_terminal() helper in interactive.rs - Call the helper after each execute_user_input() to detect completion Fixes issue where plan mode prompt ' >> ' persisted after plan completion.	2026-02-05 20:31:24 +11:00
Dhanji R. Prasanna	30627bce97	feat(cli): make tool output responsive to terminal width - Add terminal_width module with get_terminal_width(), clip_line(), compress_path(), and compress_command() utilities - Update ConsoleUiWriter to use dynamic terminal width for all tool output - Tool output lines are clipped to fit without wrapping - Tool headers use semantic compression (paths preserve filename, commands clip from right) - 4-character right margin for visual clarity - Minimum 40 columns, default 80 when terminal size unavailable - All truncation is UTF-8 safe (char counting, not byte slicing) - Add 13 unit tests for terminal width utilities	2026-02-05 20:18:30 +11:00
Dhanji R. Prasanna	b2fbcf33d0	Fix plan approval gate and add "Create a plan:" prefix for first message - Fix build warnings: add #[allow(dead_code)] to unused deserialization fields - Fix plan approval gate bug: block file changes when no plan exists (not just when plan exists but is unapproved) - Add "Create a plan: " prefix to first user message in plan mode - Add prepare_plan_mode_input() helper function for testability - Reset is_first_plan_message flag when entering plan mode via /plan command - Add tests for approval gate (no plan + no changes, no plan + changes) - Add tests for prepare_plan_mode_input (happy, negative, boundary cases)	2026-02-05 19:43:38 +11:00
Dhanji R. Prasanna	06d75f613c	feat(plan): display rulespec.yaml and envelope.yaml in plan_read/plan_write output - Add format_envelope_markdown() function in invariants.rs for rich markdown formatting of ActionEnvelope facts - Add format_yaml_value_markdown() helper for recursive YAML value display - Update execute_plan_read() to append rulespec and envelope sections - Update execute_plan_write() to append envelope section alongside rulespec - Add 3 tests for format_envelope_markdown (empty, with facts, null values) When plan_read or plan_write is called, the output now includes: - Plan YAML (as before) - Rulespec section (if rulespec.yaml exists) with invariants grouped by source - Envelope section (if envelope.yaml exists) with facts in readable format Missing files show placeholder text rather than errors.	2026-02-05 19:08:55 +11:00
Dhanji R. Prasanna	bc5c1bdf61	Fix plan UI formatting to handle Vec<Check> and display elegantly - Update ChecksCompact to use Vec<CheckCompact> for negative/boundary fields - Add progress bar visualization showing done/doing/blocked/todo counts - Show evidence for done items, checks for active items - Display all negative and boundary checks (not just first) - Add proper tree structure with └/├ prefixes - Truncate long descriptions and evidence paths - Add file path display with 📄 icon	2026-02-05 14:38:18 +11:00
Dhanji R. Prasanna	e34f37fd47	Merge sessions/sdlc/3b6c6c3e into main Resolved conflicts: - analysis/memory.md: kept condensed documentation from incoming branch - crates/g3-core/src/skills/embedded.rs: removed unused HashMap import, kept better doc comment Additional fix: - crates/g3-core/src/prompts.rs: updated test to match current prompt file content	2026-02-05 14:38:08 +11:00
Dhanji R. Prasanna	307f04fa25	chore: Compress workspace memory after research externalization - Remove deleted code: pending_research.rs, tools/research.rs (externalized to skill) - Merge duplicate Agent Skills entries into unified section - Update SDLC state path: analysis/sdlc/ → .g3/sdlc/ - Remove G3Status.resuming() (deleted in `6228001`) - Tighten verbose descriptions throughout Metrics: 444 → 325 lines (-27%), 23.6k → 17.0k chars (-28%) Concepts preserved: all semantic information retained Agent: huffman	2026-02-05 14:29:48 +11:00
Dhanji R. Prasanna	74c2671e1b	docs: Update documentation for Agent Skills system Document the new Skills system introduced in recent commits: - docs/architecture.md: Add Skills System section with discovery priority, embedded skills, script extraction, and key types - docs/skills.md: New comprehensive guide covering SKILL.md format, discovery priority, embedded skills, research skill usage, and troubleshooting - README.md: Update Agent Skills section with correct priority order, add embedded skills info, research skill usage, and link to Skills Guide in Documentation Map - AGENTS.md: Add skill creation to Adding Features, skill extraction to Dangerous Code Paths, and new Skills System Entry Points section All documentation links validated - no broken links or orphan files. Agent: lamport	2026-02-05 14:26:26 +11:00
Dhanji R. Prasanna	cff32bf0ba	Make research skill self-contained without external scripts - Rewrite SKILL.md with inline instructions to spawn g3 --agent scout directly - Extend read_file to handle embedded skill paths (<embedded:name>/SKILL.md) - Remove scripts field from EmbeddedSkill struct (no longer needed) - Delete extraction.rs module (was only for script extraction) - Delete g3-research bash script - Remove obsolete Async Research Tool section from workspace memory Skills are now fully portable - they work when g3 is installed as a binary without access to source files. Agents can read embedded skill content via read_file with the special <embedded:...> path syntax.	2026-02-05 14:22:17 +11:00
Dhanji R. Prasanna	c3549ce043	refactor: Remove unused functions from skills module - Remove is_embedded_skill() from discovery.rs (unused) - Remove get_embedded_skills_map() from embedded.rs (unused) - Remove associated tests for deleted functions - Inline path check in test_repo_overrides_embedded test This eliminates dead code warnings and reduces module surface area without changing any behavior. Agent: fowler	2026-02-05 14:17:56 +11:00
Dhanji R. Prasanna	38da6a56ef	analysis: Update dependency graph for commits b6d2582..9443f933 Focused analysis on past 10 commits covering: - New skills module in g3-core (parser, discovery, prompt, embedded, extraction) - Research tool externalized to skills/research/ skill - SkillsConfig added to g3-config - SDLC pipeline state moved to .g3/sdlc/ Key findings: - 4 crates changed, 29 files affected (8 added, 2 deleted, 19 modified) - No dependency cycles detected - Clean DAG structure in new skills module - Cross-crate coupling via g3-core::skills and g3-config::SkillsConfig - Compile-time coupling to skills/research/ via include_str! Agent: euler	2026-02-05 14:02:44 +11:00
Dhanji R. Prasanna	788debb93a	remove cruft from system prompt	2026-02-05 14:01:26 +11:00
Dhanji R. Prasanna	68fd7b96c1	Remove accidental Emacs lock file	2026-02-05 14:01:03 +11:00
Dhanji R. Prasanna	6cb70f26fa	Fix empty Language-Specific Guidance header in system prompt When a Rust-only workspace was detected, the Language-Specific Guidance header was appearing with no content because Rust has an empty prompt string (agent-specific prompts handle Rust instead). The fix filters out empty prompt strings in get_language_prompts_for_workspace() so the header only appears when there's actual guidance content. Added test to verify Rust-only workspaces return None.	2026-02-05 14:00:52 +11:00
Dhanji R. Prasanna	9443f9333b	refactor: Remove hardcoded Web Research section from system prompt - Web Research instructions now come from skills/research/SKILL.md - Skills are dynamically loaded and injected via generate_skills_prompt() - Remove test_both_prompts_have_web_research test (no longer applicable) - Remove unused G3Status::research_complete() function This completes the externalization of research as a skill.	2026-02-05 13:41:53 +11:00
Dhanji R. Prasanna	0b308853a0	fix: Improve research skill with ANSI stripping and fallback extraction - Add strip_ansi() function using perl for comprehensive escape sequence removal - Add fallback extraction when scout doesn't output markers - Strip g3 UI elements (session banner, tool output chrome, auto-memory messages) - Reports are now clean plaintext without terminal formatting	2026-02-05 13:35:32 +11:00
Dhanji R. Prasanna	39e586982c	feat: Externalize research tool as embedded skill Replaces the built-in research/research_status tools with a portable skill-based approach: - Add embedded skills infrastructure (skills compiled into binary) - Add repo-local skills/ directory support (highest priority) - Create research skill with SKILL.md and g3-research shell script - Script extraction to .g3/bin/ with version tracking - Filesystem-based handoff via .g3/research/<id>/status.json - Remove PendingResearchManager and all research tool code - Update system prompt to reference skill instead of tool Benefits: - No special tool infrastructure needed (just shell + read_file) - Context-efficient (reports stay on disk until needed) - Crash-resilient (state persisted to filesystem) - Portable (skill can be overridden per-workspace) Breaking change: research tool calls now return a deprecation message pointing to the research skill.	2026-02-05 13:23:26 +11:00
Dhanji R. Prasanna	bf9e3dc878	Merge sessions/interactive/213d9910	2026-02-05 13:05:57 +11:00
Dhanji R. Prasanna	89c071baf6	fix: honor --resume flag when used with --agent --chat The --resume flag was being ignored when --agent and --chat flags were used together. The if-else chain checked for chat mode first and immediately returned None, skipping the --resume check entirely. Reordered the logic to check flags.resume first, ensuring explicit --resume is always honored regardless of other flags. Fixes: --resume not working with --agent --chat	2026-02-05 13:05:48 +11:00
Dhanji R. Prasanna	bc2860dd3a	studio sdlc: merge worktree on completion, move state to .g3/ - Add merge step before worktree cleanup when pipeline completes - On success with commits: merge to main, then cleanup - On failure: preserve worktree for debugging, print path - On merge conflict: preserve worktree, print resolution instructions - Move pipeline.json from analysis/sdlc/ to .g3/sdlc/ (gitignored)	2026-02-05 13:03:54 +11:00
Dhanji R. Prasanna	0e64f13a8a	Merge feature/agent-skills-support: Agent Skills specification support	2026-02-05 12:46:53 +11:00
Dhanji R. Prasanna	6228001bfc	Remove automatic session resume suggestion on startup - Remove the interactive prompt that asked users to resume in-progress sessions - Remove unused new_session parameter from run_interactive() - Remove unused info_inline() function from G3Status - Explicit --resume <session_id> flag still works	2026-02-05 12:40:27 +11:00
Dhanji R. Prasanna	8bbaf6f02e	Tighten system prompt and tool definitions Prompt changes (native.md): - Remove duplicate 'Temporary files' section - Consolidate 'remember' instructions into single authoritative location - Remove motivational 'Benefits' list from Plan Mode - Add 'Code Search Tool Selection' guidance (code_search vs rg) Tool changes (tool_definitions.rs, tool_dispatch.rs): - Remove screenshot tool (webdriver_screenshot remains) - Remove coverage tool - Reduce plan_write description from 22 lines to 1 line - Update tool count tests (16 -> 14 core tools) Net result: ~6 lines removed from prompt, ~56 lines removed from tool definitions, clearer tool selection guidance added.	2026-02-05 12:36:49 +11:00
Dhanji R. Prasanna	b6d25824f3	Tighten system prompt	2026-02-05 12:01:01 +11:00
Dhanji R. Prasanna	25ad198b83	Sync agent plan mode state on CLI startup CLI starts in plan mode by default (when not in agent mode), but was not calling agent.set_plan_mode(true) at initialization. This meant the gate check would not run until the user explicitly entered plan mode via /plan.	2026-02-05 11:47:38 +11:00
Dhanji R. Prasanna	b86901a86b	Merge sessions/interactive/47299e3b	2026-02-05 11:47:24 +11:00
Dhanji R. Prasanna	3d3f68e6da	Externalize native system prompt to markdown file - Move system prompt for native tool calling models to prompts/system/native.md - Use include_str! to embed at compile time - Remove concatenated SHARED_* string constants - Prompt is now readable/editable as a complete markdown document - Non-native prompt still uses Rust constants (acceptable for now)	2026-02-05 11:46:49 +11:00
Dhanji R. Prasanna	0f919237ea	Make plan approval gate only active in plan mode - Add in_plan_mode flag to Agent struct - Add set_plan_mode() and is_plan_mode() methods - Gate check now only runs when in_plan_mode is true - CLI calls set_plan_mode(true) on /plan command and EnterPlanMode - CLI calls set_plan_mode(false) on approval and CTRL-D exit - Update integration test to enable plan mode - Fix test YAML to use Vec<Check> for negative/boundary checks	2026-02-05 11:41:52 +11:00
Dhanji R. Prasanna	3d284b8b60	Merge sessions/interactive/179ac8a6	2026-02-05 11:37:07 +11:00
Dhanji R. Prasanna	1f1a517620	feat(plan): support multiple negative and boundary checks Change Plan Mode to allow multiple negative and boundary checks per item, while keeping happy path as a single check. Schema change: - checks.negative: Check -> Vec<Check> (>=1 required) - checks.boundary: Check -> Vec<Check> (>=1 required) - checks.happy: Check (unchanged, single) This better reflects real-world tasks where there are often multiple error conditions and edge cases worth tracking. Changes: - Update Checks struct to use Vec<Check> for negative/boundary - Update validation to require at least 1 of each - Update prompts and tool definitions with new array syntax - Add 4 new tests for multi-check scenarios	2026-02-05 11:36:45 +11:00
Dhanji R. Prasanna	41839b909e	Remove stray test file	2026-02-05 11:34:15 +11:00
Dhanji R. Prasanna	c347a73cbd	Add plan approval gate to block file changes without approved plan - Add check_plan_approval_gate() in tools/plan.rs that runs after each tool call - Detects file changes via git status --porcelain when plan exists but not approved - Reverts changes: git checkout for modified files, rm for new untracked files - Returns blocking message instructing LLM to create/approve plan first - Add ApprovalGateResult enum with Allowed/Blocked/NotGitRepo variants - Add set_session_id() and set_working_dir() methods on Agent for testing - Add integration test using MockProvider to simulate blocked write_file	2026-02-05 11:34:10 +11:00
Dhanji R. Prasanna	add8060526	Add studio sdlc command for SDLC maintenance pipeline Implements a pipeline that orchestrates 7 g3 agents in sequence: 1. euler - dependency graph and hotspots analysis 2. breaker - whitebox exploration and edge-case discovery 3. hopper - deep testing and regression integrity 4. fowler - refactoring to deduplicate and reduce complexity 5. carmack - in-place rewriting for readability and concision 6. lamport - human-readable documentation and validation 7. huffman - semantic compression of memory Features: - Commit cursor tracking (--from flag to set starting point) - Crash recovery (resumes from last incomplete stage) - Git worktree isolation for all pipeline work - Visual pipeline display with status icons - Summary generation saved to .g3/sessions/sdlc/ - Pipeline state persisted to analysis/sdlc/pipeline.json CLI: - studio sdlc run [-c N] [--from COMMIT] - studio sdlc status - studio sdlc reset Also adds huffman agent to embedded agents list.	2026-02-05 10:46:10 +11:00
Dhanji R. Prasanna	fdb1255f02	Add --resume <session-id> flag for explicit session resumption - Add --resume CLI flag that conflicts with --new-session - Add load_continuation_by_id() to load sessions by full or partial ID - Support loading from latest.json or falling back to session.json - Handle --resume in both normal and agent modes - Agent mode validates session belongs to correct agent	2026-02-05 10:23:39 +11:00
Dhanji R. Prasanna	3046f0dd6e	feat: Add invariants system for Plan Mode verification Adds rulespec.yaml and envelope.yaml support for machine-readable invariant checking during plan completion. - Add invariants module with Rulespec, ActionEnvelope, and evaluation logic - Add Invariants section to system prompt with workflow instructions - Show rulespec/envelope file status in plan verification output - Rulespec written during planning (captures constraints from task) - Envelope written after implementation (documents what was built)	2026-02-04 20:49:58 +11:00
Dhanji R. Prasanna	a5f6475603	feat: implement Agent Skills specification support Implements the Agent Skills specification (https://agentskills.io) for portable skill packages that give the agent new capabilities. Changes: - Add skills module with SKILL.md parser (YAML frontmatter + markdown body) - Implement skill discovery from ~/.g3/skills/, config extra_paths, and .g3/skills/ - Generate <available_skills> XML for system prompt injection - Add SkillsConfig to g3-config with enabled flag and extra_paths - Wire skills discovery into CLI startup - Add 29 unit tests for parser, discovery, and prompt generation - Update README with Agent Skills documentation Skill locations (priority order): 1. ~/.g3/skills/ (global) 2. Config extra_paths 3. .g3/skills/ (workspace, highest priority) At startup, g3 scans skill directories and injects a summary into the system prompt. When the agent needs a skill, it reads the full SKILL.md using the read_file tool.	2026-02-04 12:58:57 +11:00
Dhanji R. Prasanna	95d9847354	Update dependency analysis artifacts with detailed evidence - hotspots.md: Added specific dependent file lists for each hotspot - hotspots.md: Added cross-crate coupling points table - hotspots.md: Added crate-level coupling scores - limitations.md: Expanded coverage of unobservable patterns - limitations.md: Added confidence levels for inferences - limitations.md: Added extraction method details table Agent: euler	2026-02-02 17:20:15 +11:00
Dhanji R. Prasanna	263a838d31	Remove redundant 'No plan exists' message from plan_read output The UI already shows 'empty' via print_plan_compact, so returning an empty string avoids duplicate output.	2026-02-02 17:19:01 +11:00
Dhanji R. Prasanna	e332109273	Auto-approve plans in non-interactive (autonomous/one-shot) mode - Add auto-approval logic in execute_plan_write() when ctx.is_autonomous is true - Update system prompt to document auto-approval behavior - Plans still require explicit approval in interactive mode	2026-02-02 17:16:21 +11:00
Dhanji R. Prasanna	0aead8d86d	fix: Enable compact UI output for plan_approve tool Added plan_approve to the compact tool list in format_tool_result_summary() so it displays in the same format as other tools like read_file and write_file. The format_plan_approve_summary() function already existed but was never called because plan_approve was missing from the matches! block.	2026-02-02 17:06:10 +11:00
Dhanji R. Prasanna	f8448e5622	feat: Plan Mode interactive flow with approval shortcuts - Start g3 in plan mode with ' >>' prompt and welcome message - Add is_approval_input() to detect 'approve', 'a', 'yes', etc. and misspellings - Allow trailing punctuation (!, ., ,) on approval words - Call plan_approve tool directly without LLM when approval detected - Add synthetic assistant message after approval for LLM context - Exit plan mode after successful approval, return to 'g3>' prompt - CTRL-D in plan mode exits plan mode first, then exits g3 - /plan command enters plan mode and shows welcome message - Agent mode (--agent) does not start in plan mode - Add CommandResult enum to signal plan mode entry from commands	2026-02-02 16:59:52 +11:00

1 2 3 4 5 ...

790 Commits