+
Skip to content

AIPexStudio/AIPex

Repository files navigation

🤖 AIPex - AI-Powered Browser Automation Extension

Automate your browser with natural language commands - The open source browser-use solution

Chrome Web Store GitHub stars GitHub forks GitHub issues GitHub pull requests License: MIT TypeScript React

**⭐ Star this repo if you find it helpful! ⭐**

Chrome Web Store

🤖 What is AIPex?

AIPex is a revolutionary Chrome extension that transforms your browser into an intelligent automation platform. Using natural language commands and AI-powered intelligence, AIPex can automate virtually any browser task - from complex multi-step workflows to simple repetitive actions.

📊 Comparison with Similar Projects

AIPex vs Dia/Comet vs Playwright-based Solutions

Comparison Dimension AIPex Comet (Dia) Playwright-based MCP Server
Resource Usage ✅ No need to launch independent browser process, directly utilizes user's already open Chrome browser ⚠️ Need to download and install Dia (another browser) ❌ Requires launching independent browser process, installing Playwright dependencies, downloading browser binaries, etc.
Transfer Cost ✅ No, use your current browser now, no need to transfer ⚠️ Yes, you need to adapt to new browser ❌ Requires launching browser process
User Session Reuse ✅ Automatically uses existing login state ✅ Automatically uses existing login state ❌ Requires re-login
Browser Environment ✅ Fully preserves user environment ✅ Fully preserves user environment ❌ Clean environment lacks user settings
API Access ✅ Full Browser API Access ⚠️ Limited Browser API Access ⚠️ Limited to Playwright API
Startup Process ✅ Quick, only need to install extension ⚠️ Slow, need to download and install ❌ Requires launching browser process
LLM Token Cost ✅ Low, optimized tools and agent loop ✅ Low ⚠️ Medium to High
Speed ✅ Quick ✅ Quick ⚠️ 50-200ms inter-process communication
User Knowledge ✅ Low, click extension then you can use, no need any technical background ⚠️ Need to adapt to new browser ⚠️ Requires technical knowledge
Chat to Multiple Tabs ✅ Yes, you can use tabs, bookmarks, history to talk ⚠️ Yes, but you can only choose tabs ❌ Limited multi-tab support
Action Coverage ✅ Full, covers almost 100% of browser automation scenarios ⚠️ Partially covered, and you can't apply actions in Dia ⚠️ Limited to Playwright capabilities
BYOK (Bring Your Own Key) ✅ AIPex supports using your own LLM token ❌ No ✅ Yes

🎯 Why Choose AIPex for Browser Automation?

  • 🧠 Natural Language Control: Command your browser in plain English - no coding required
  • 🤖 AI-Powered Intelligence: 30+ MCP tools that understand context and adapt to your needs
  • Multi-Step Automation: Execute complex workflows with single commands
  • 🔄 Universal Compatibility: Works with any website - no special setup needed
  • 📊 Smart Data Extraction: Automatically collect and organize information from web pages
  • 🎯 Precision Actions: Click, fill, scroll, and interact with elements using AI vision
  • 📝 Form Automation: Fill out forms, submit data, and handle complex interactions
  • 🖼️ At Tabs: Read & Act with Multiple tabs
  • 🔧 Developer Friendly: Open source with extensive API for custom automation
  • 🚀 Lightning Fast: Execute automation tasks in seconds, not minutes

✨ Core Automation Features

📊 Intelligent Data Extraction

  • Smart Content Analysis: Extract structured data from any webpage
  • Price Monitoring: Track prices across multiple e-commerce sites
  • Research Automation: Gather information from multiple sources automatically

Data Extraction

🎯 Precision Element Interaction

  • Visual Element Detection: AI can see and interact with page elements
  • Form Automation: Fill out complex forms with intelligent field mapping
  • Dynamic Content Handling: Adapt to changing page layouts and content

Element Interaction

📝 Content Processing & Analysis

  • Text Highlighting & Summarization: Automatically highlight and summarize important content
  • Document Processing: Extract and organize information from web documents
  • Smart Note-Taking: Capture and organize insights from web browsing

Content Processing

🗂️ Advanced Tab & Window Management

  • AI-Powered Organization: Automatically group and organize tabs by topic
  • Smart Tab Switching: Find and switch between tabs using natural language
  • Multi-Window Coordination: Manage complex workflows across multiple browser windows

Tab Management

🚀 Getting Started

Quick Start

  1. Install from Chrome Web Store (Recommended)

  2. Open AIPex

    • Press ⌘+M (Mac) or Ctrl+M (Windows/Linux)
    • Or click the AIPex icon in your toolbar

🛠️ Development & Contributing

We love contributions! Here's how you can help make AIPex even better:

📖 For detailed development setup, build instructions, and contribution guidelines, please see DEVELOPMENT.md

📊 Tool Categories Overview

🗂️ Tab Management - 8 tools

Complete tab control and navigation:

  • get_all_tabs - Get all open tabs across all windows
  • get_current_tab - Get information about the currently active tab
  • switch_to_tab - Switch to a specific tab by ID
  • create_new_tab - Create a new tab with the specified URL
  • get_tab_info - Get detailed information about a specific tab
  • duplicate_tab - Duplicate an existing tab
  • close_tab - Close a specific tab
  • get_current_tab_content - Get the visible text content of the current tab
📄 Page Content & Interaction - 14 tools

Content extraction, analysis, and page interaction:

  • get_page_metadata - Get page metadata including title, description, keywords
  • extract_page_text - Extract text content with word count and reading time
  • get_page_links - Get all links from the current page
  • search_page_text - Search for text on the current page
  • get_interactive_elements - Get all interactive elements (links, buttons, inputs) with optimized performance
  • click_element - Click an element using CSS selector
  • summarize_page - Summarize page content with key points
  • fill_input - Fill an input field with text
  • clear_input - Clear the content of an input field
  • get_input_value - Get the current value of an input field
  • submit_form - Submit a form using CSS selector
  • get_form_elements - Get all form elements and input fields
  • scroll_to_element - Scroll to a DOM element and center it
  • highlight_element - Permanently highlight DOM elements
  • highlight_text_inline - Highlight specific words or phrases within text
⬇️ Downloads & Files - 4 tools

Download control and file management:

  • download_text_as_markdown - Download text content as markdown file
  • download_image - Download an image from base64 data
  • download_chat_images - Download multiple images from chat messages
  • download_current_chat_images - Download all images from current AI chat
📸 Screenshots - 3 tools

Visual capture and screenshot management:

  • capture_screenshot - Capture screenshot of current visible tab
  • capture_tab_screenshot - Capture screenshot of a specific tab by ID
  • capture_screenshot_to_clipboard - Capture screenshot and save to clipboard
🔧 Advanced Features - 3+ tools

Advanced browser automation and utilities:

  • Additional specialized tools for enhanced browser control
  • AI-powered content analysis and processing
  • Custom automation workflows

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Support & Community

🏆 Contributors

Thank you to all the amazing contributors who help make AIPex better:


ropzislaw

56 commits

Codexiaoyi

10 commits

buttercannfly

5 commits

Total Contributors: 3 | Total Commits: 71


Want to contribute? Check out our Contributing Guide and help make AIPex even better!

🌟 Star History

Star History Chart


Made with ❤️ by the AIPex Team

GitHub Chrome Web Store

About

AIPex: agentic assistant in your browser, automate your browser using natural language. Comet Alternative

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载