Show HN: MCP Browser Agent – Autonomous Browser Automation for Claude Desktop

4 points by ivanimprv 2 months ago

Hey HN! Following up on MCPs for Claude, I've created another Model Context Protocol server that gives Claude Desktop autonomous browser automation capabilities.

The Browser Agent MCP lets Claude control a headful browser to navigate websites, capture screenshots, manipulate DOM elements, execute JavaScript, and perform API requests—all through natural language instructions.

Check out a demo here: https://www.youtube.com/watch?v=0lMsKiTy7TE Repository: https://github.com/imprvhub/mcp-browser-agent

With this MCP integration, you can: - Navigate to any URL with configurable load strategies - Capture full-page or element-specific screenshots - Perform precise DOM interactions (click, fill, select, hover) - Execute arbitrary JavaScript with console logs capture - Make HTTP requests (GET, POST, PUT, PATCH, DELETE) with headers and body

What makes this a true AI agent is its persistent browser session. Claude maintains state across multiple commands, captures console logs, stores screenshots as resources, and can chain operations for complex workflows.

It's particularly useful for web testing, form automation, data extraction, or creating visual documentation. The setup requires Node.js, Claude Desktop, and Playwright dependencies. The browser runs headful by default with chrome (you can change to firefox, chromium and more).

If you've been exploring MCP development or browser automation, I'd love to hear about your use cases. What other browser capabilities would enhance your AI Agents workflows?

The project is open source under Mozilla Public License 2.0, and I welcome contributions if you're interested in MCP development or want to extend the automation features.