Tools Overview
Explore 30+ MCP tools for browser automation
🛠️ 30+ MCP Tools
AIPex provides comprehensive automation capabilities through 30+ specialized MCP (Model Context Protocol) tools. Each tool is designed to handle specific browser automation tasks with AI-powered intelligence.
Tool Categories
🗂️ Tab Management (8 tools)
Complete tab control and navigation across all browser windows.
📄 Page Content & Interaction (14 tools)
Content extraction, analysis, and page interaction capabilities.
⬇️ Downloads & Files (4 tools)
Download control and file management automation.
📸 Screenshots (3+ tools)
Visual capture and screenshot management tools.
Quick Reference
🗂️ Tab Management Tools
| Tool | Description | Use Case |
|---|---|---|
get_all_tabs | Get all open tabs across all windows | Tab inventory, bulk operations |
get_current_tab | Get information about the active tab | Current context awareness |
switch_to_tab | Switch to a specific tab by ID | Navigation automation |
create_new_tab | Create a new tab with specified URL | Multi-page workflows |
get_tab_info | Get detailed information about a tab | Tab metadata extraction |
duplicate_tab | Duplicate an existing tab | Quick page duplication |
close_tab | Close a specific tab | Cleanup automation |
get_current_tab_content | Get visible text content | Content analysis |
📄 Page Content & Interaction Tools
| Tool | Description | Use Case |
|---|---|---|
get_page_metadata | Get page metadata (title, description, keywords) | SEO analysis, page info |
extract_page_text | Extract text content with word count | Content extraction |
get_page_links | Get all links from the current page | Link analysis, scraping |
search_page_text | Search for text on the current page | Find specific content |
get_interactive_elements | Get all interactive elements (optimized) | Element discovery |
click_element | Click an element using CSS selector | Interaction automation |
summarize_page | Summarize page content with key points | Content analysis |
fill_input | Fill an input field with text | Form automation |
clear_input | Clear the content of an input field | Form reset |
get_input_value | Get the current value of an input field | Form validation |
submit_form | Submit a form using CSS selector | Form submission |
get_form_elements | Get all form elements and input fields | Form analysis |
scroll_to_element | Scroll to a DOM element and center it | Navigation |
highlight_element | Permanently highlight DOM elements | Visual marking |
⬇️ Downloads & Files Tools
| Tool | Description | Use Case |
|---|---|---|
download_text_as_markdown | Download text content as markdown file | Content export |
download_image | Download an image from base64 data | Image saving |
download_chat_images | Download multiple images from chat | Batch download |
download_current_chat_images | Download all images from current AI chat | Chat export |
📸 Screenshot Tools
| Tool | Description | Use Case |
|---|---|---|
capture_screenshot | Capture screenshot of current visible tab | Visual documentation |
capture_tab_screenshot | Capture screenshot of a specific tab by ID | Multi-tab capture |
capture_screenshot_to_clipboard | Capture screenshot and save to clipboard | Quick sharing |
Tool Usage Examples
Tab Management Example
// Get all open tabs
const tabs = await get_all_tabs();
// Switch to a specific tab
await switch_to_tab(tabId);
// Create a new tab
await create_new_tab("https://example.com");Page Interaction Example
// Extract page text
const text = await extract_page_text();
// Click an element
await click_element("button.submit");
// Fill a form
await fill_input("input[name='email']", "user@example.com");
await submit_form("form#contact");Download Example
// Download page content as markdown
await download_text_as_markdown(content, "page-content.md");
// Download an image
await download_image(base64Data, "screenshot.png");Screenshot Example
// Capture current tab
await capture_screenshot();
// Capture specific tab
await capture_tab_screenshot(tabId);
// Copy to clipboard
await capture_screenshot_to_clipboard();Natural Language Commands
Instead of using tools directly, you can use natural language commands:
# Tab management
"Close all tabs except the current one"
"Switch to the tab with 'GitHub' in the title"
"Open a new tab with google.com"
# Page interaction
"Click the submit button"
"Fill the email field with user@example.com"
"Scroll to the footer"
# Data extraction
"Get all links from this page"
"Extract the main content"
"Summarize this article"
# Screenshots
"Take a screenshot of the entire page"
"Capture this element"Tool Performance
All tools are optimized for performance:
- ⚡ Fast Execution: Most tools complete in < 100ms
- 🎯 Accurate: AI-powered element detection
- 🔄 Reliable: Automatic retry and error handling
- 📊 Scalable: Handle large pages and many tabs
Advanced Features
Batch Operations
Execute multiple tools in sequence:
// Research workflow
await create_new_tab("https://example.com");
const text = await extract_page_text();
const summary = await summarize_page();
await download_text_as_markdown(summary, "research.md");Error Handling
All tools include built-in error handling:
try {
await click_element("button.submit");
} catch (error) {
console.error("Element not found:", error);
}Next Steps
Tab Management Tools
Learn about all 8 tab management tools
Page Interaction Tools
Explore 14+ page content and interaction tools
Natural Language Guide
Master natural language commands
API Reference
Complete API documentation for developers
Pro Tip: Combine multiple tools to create powerful automation workflows. The AI will automatically chain tools together based on your natural language commands.
AIPex Docs