me.openbrowser/openbrowser-ai
AI browser automation. Write async Python to navigate, click, type, and extract data.
★ 212MITai-ml
Install
Config snippet generator goes here (5 client tabs)
README
# OpenBrowser
**Automating Walmart Product Scraping:**
https://github.com/user-attachments/assets/c517c739-9199-47b0-bac7-c2c642a21094
**OpenBrowserAI Automatic Flight Booking:**
https://github.com/user-attachments/assets/632128f6-3d09-497f-9e7d-e29b9cb65e0f
[](https://pypi.org/project/openbrowser-ai/)
[](https://pepy.tech/projects/openbrowser-ai)
[](https://www.python.org/downloads/)
[](https://opensource.org/licenses/MIT)
[](https://github.com/billy-enrizky/openbrowser-ai/actions)
[](https://codecov.io/gh/billy-enrizky/openbrowser-ai)
<!-- mcp-name: me.openbrowser/openbrowser-ai -->
**AI-powered browser automation using CodeAgent and CDP (Chrome DevTools Protocol)**
OpenBrowser is a framework for intelligent browser automation. It combines direct CDP communication with a CodeAgent architecture, where the LLM writes Python code executed in a persistent namespace, to navigate, interact with, and extract information from web pages autonomously.
## Table of Contents
- [Documentation](#documentation)
- [Key Features](#key-features)
- [Installation](#installation)
- [Quick Start](#quick-start)
- [Configuration](#configuration)
- [Supported LLM Providers](#supported-llm-providers)
- [Claude Code Plugin](#claude-code-plugin)
- [Codex](#codex)
- [OpenCode](#opencode)
- [OpenClaw](#openclaw)
- [MCP Server](#mcp-server)
- [Benchmark: Token Efficiency](#benchmark-token-efficiency)
- [CLI Usage](#cli-usage)
- [Project Structure](#project-structure)
- [Backend and Frontend Deployment](#backend-and-frontend-deployment)
- [Testing](#testing)
- [Contributing](#contributing)
- [License](#license)
- [Contact](#contact)
## Documentation
**Full documentation**: [https://docs.openbrowser.me](https://docs.openbrowser.me)
## Key Features
- **CodeAgent Architecture** - LLM writes Python code in a persistent Jupyter-like namespace for browser automation
- **Raw CDP Communication** - Direct Chrome DevTools Protocol for maximum control and speed
- **Vision Support** - Screenshot analysis for visual understanding of pages
- **12+ LLM Providers** - OpenAI, Anthropic, Google, Groq, AWS Bedrock, Azure OpenAI, Ollama, and more
- **MCP Server** - Model Context Protocol support for Claude Desktop integration
- **CLI Daemon** - Persistent browser daemon with `-c` flag for direct code execution from Bash
- **Video Recording** - Record browser sessions as video files
## Installation
### Quick install (macOS / Linux)
```bash
curl -fsSL https://raw.githubusercontent.com/billy-enrizky/openbrowser-ai/main/install.sh | sh
```
### Quick install (Windows PowerShell)
```powershell
irm https://raw.githubusercontent.com/billy-enrizky/openbrowser-ai/main/install.ps1 | iex
```
Detects `uv`, `pipx`, or `pip` and installs OpenBrowser automatically.
Install to `~/.local/bin` without sudo:
```bash
curl -fsSL https://raw.githubusercontent.com/billy-enrizky/openbrowser-ai/main/install.sh | sh -s -- --local
```
### Homebrew (macOS / Linux)
```bash
brew tap billy-enrizky/openbrowser
brew install openbrowser-ai
```
### pip
```bash
pip install openbrowser-ai
```
### uv (recommended)
```bash
uv pip install openbrowser-ai
```
### uvx (zero install)
Run directly without installing -- `uvx` downloads and caches the package automatically:
```bash
# MCP server mode
uvx openbrowser-ai --mcp
# CLI daemon mode
uvx openbrowser-ai -c "await navigate('https://example.com')"
```
### pipx
```bash
pipx install openbrowser-ai
```
### From source
```bash
git clone https://github.com/billy-enrizky/openbrowser-ai.git
cd openbrowser-ai
uv pip install -e ".[agent]"
```
### Optional Dependencies
```bash
pip install openbrowser-ai[agent] # LLM agent support (langgraph, langchain, litellm)
pip install openbrowser-ai[all] # All LLM providers
pip install openbrowser-ai[anthropic] # Anthropic Claude
pip install openbrowser-ai[groq] # Groq
pip install openbrowser-ai[ollama] # Ollama (local models)
pip install openbrowser-ai[aws] # AWS Bedrock
pip install openbrowser-ai[azure] # Azure OpenAI
pip install openbrowser-ai[video] # Video recording support
```
> **No separate browser install needed.** OpenBrowser auto-detects any installed Chromium-based browser (Chrome, Edge, Brave, Chromium) and uses it directly. If none is found and `uvx` is available, Chromium is installed automatically on first run. To pre-install manually (requires `uvx`): `openbrowser-ai install`
## Quick Start
### Basic Usage
```python
import asyncio
from openbrowser import Code