browser-act

Browser-act provides a command-line interface for AI agents to perform browser automation, including navigation, data extraction, form interaction, and session management.

74.5K

Installs

Use cases

5/10

Quality

Is browser-act safe to install?

Review the source first

Review the source first: our audit of browser-act's source files found 2 shell commands, 0 external URLs, file reads and writes (high risk). Every command and URL listed appears verbatim in the skill's source. The tool executes shell commands, manages browser profiles, and performs network requests. It requires user confirmation for sensitive operations like browser creation, login, and form submission.

How we audit skills: our security review methodology.

Who is this skill for?

AI agents and developers requiring browser-based automation, web scraping, or interaction with JavaScript-heavy websites.

What can you do with it?

Fetching and extracting rendered content from URLs
Handling interactive verification prompts and captchas
Managing authenticated sessions and browser profiles
Filling forms, clicking elements, and uploading files
Capturing screenshots and network traffic
Executing tasks across multiple browsers in parallel

How good is this skill?

Quality score: 5/10. The documentation is clear, provides specific installation instructions, and outlines necessary safety protocols for agent-based browser automation.

What does the skill file contain?

SKILL.md

# browser-act

Browser automation CLI for AI agents. Runs a full browser engine: navigation &
interaction, data extraction & network capture, screenshots, form automation,
multi-browser parallel operation, user-configured proxy support, and
human-agent collaboration.

### Features

- Lightweight extraction — fast JS-rendered content fetch without opening a browser session, advanced WebFetch/curl replacement
- Session management — multi-browser isolation, multi-account parallel operation
- Verification assistance — when automation encounters interactive challenges, assists completion...

Frequently asked questions

Where is my browser data stored?

All cookies, login sessions, page content, and browser profile data remain on the local machine.

How do I initialize the tool?

Run 'browser-act get-skills core --skill-version 2.0.2' to load operational directives and environment state.

Does the tool upload my data?

No, the tool processes data locally. The only outbound data is the captcha challenge image sent when the solve-captcha command is invoked.

Data sourced from browser-act/skills on GitHub. Install counts from skills.sh. The summary and security audit are derived from the skill's source files: every command and URL listed appears verbatim in the source.

Install

Works with Claude Code, Cursor, Codex CLI, and 50+ agents.

Manual install paths per agent

Claude Code: .claude/skills/ or ~/.claude/skills/

Codex CLI: .codex/skills/ or ~/.codex/skills/

GitHub Copilot: .github/skills/ or ~/.copilot/skills/

Amp: .agents/skills/ or ~/.config/agents/skills/

Copy the skill folder (with SKILL.md) into the project or global directory.

Security audit

high risk

Commands

URLs

Yes

File I/O

The tool executes shell commands, manages browser profiles, and performs network requests. It requires user confirmation for sensitive operations like browser creation, login, and form submission.

View commands and endpoints

$ uv tool install browser-act-cli --python 3.12

$ browser-act get-skills core --skill-version 2.0.2

Sourcebrowser-act/skills

Installs74.5K

Quality5/10

automation browser scraping cli agent-tools

Related skills

find-skills

2.3M

Users seeking to extend agent capabilities with specialized tools, workflows, or knowledge packages

The find-skills skill enables agents to search for, discover, and install modular packages from the open agent skills ecosystem using the Skills CLI.

highclipackage-managervercel-labs

agent-browser

506.7K

AI agents and developers requiring programmatic web interaction, exploratory testing, or automation of Electron desktop applications

The agent-browser CLI provides browser automation for AI agents using Chrome or Chromium via CDP. It supports page navigation, form interaction, data extraction, and testing. The tool utilizes accessibility-tree snapshots and element references for interaction.

highbrowser-automationclivercel-labs

video-edit

338.7K

Users of the RunComfy CLI who need to automate video editing tasks like restyling, background swapping, or motion transfer

The video-edit skill acts as a router for the RunComfy CLI, selecting between Wan 2.7 Edit-Video, Kling 2.6 Pro Motion Control, and Lucy Edit Restyle models based on user intent to perform video transformations.

highvideo-editingai-agentagentspace-so

agentspace

324.0K

Developers and AI agent users who need to share, monitor, or collaborate on agent-generated files and workspaces in real-time

Agentspace provides a mechanism to share local agent files, logs, and artifacts via a live browser-accessible URL. It enables remote viewing, commenting, and editing of specified local paths through the ascli command-line tool.

highcollaborationfile-sharingagentspace-so