Remove vision tools (except take_screenshot) and macax tools
Vision tools removed: - extract_text (OCR from image files) - extract_text_with_boxes (OCR with bounding boxes) - vision_find_text (find text in app windows) - vision_click_text (find and click on text) - vision_click_near_text (click near text labels) macax tools removed: - macax_list_apps - macax_get_frontmost_app - macax_activate_app - macax_press_key - macax_type_text The LLM can now read images directly via read_image tool. take_screenshot is retained for capturing application windows. Files deleted: - crates/g3-core/src/tools/vision.rs - crates/g3-core/src/tools/macax.rs - docs/macax-tools.md Updated tool counts: 12 core + 15 webdriver = 27 total
This commit is contained in:
@@ -1,4 +1,4 @@
|
||||
//! Miscellaneous tools: final_output, take_screenshot, extract_text, code_coverage, code_search.
|
||||
//! Miscellaneous tools: final_output, take_screenshot, code_coverage, code_search.
|
||||
|
||||
use anyhow::Result;
|
||||
use tracing::debug;
|
||||
@@ -118,35 +118,6 @@ pub async fn execute_take_screenshot<W: UiWriter>(
|
||||
}
|
||||
}
|
||||
|
||||
/// Execute the `extract_text` tool.
|
||||
pub async fn execute_extract_text<W: UiWriter>(
|
||||
tool_call: &ToolCall,
|
||||
ctx: &ToolContext<'_, W>,
|
||||
) -> Result<String> {
|
||||
debug!("Processing extract_text tool call");
|
||||
|
||||
let controller = match ctx.computer_controller {
|
||||
Some(c) => c,
|
||||
None => {
|
||||
return Ok(
|
||||
"❌ Computer control not enabled. Set computer_control.enabled = true in config."
|
||||
.to_string(),
|
||||
)
|
||||
}
|
||||
};
|
||||
|
||||
let path = tool_call
|
||||
.args
|
||||
.get("path")
|
||||
.and_then(|v| v.as_str())
|
||||
.ok_or_else(|| anyhow::anyhow!("Missing path argument"))?;
|
||||
|
||||
match controller.extract_text_from_image(path).await {
|
||||
Ok(text) => Ok(format!("✅ Extracted text:\n{}", text)),
|
||||
Err(e) => Ok(format!("❌ Failed to extract text: {}", e)),
|
||||
}
|
||||
}
|
||||
|
||||
/// Execute the `code_coverage` tool.
|
||||
pub async fn execute_code_coverage<W: UiWriter>(
|
||||
tool_call: &ToolCall,
|
||||
|
||||
Reference in New Issue
Block a user