scrape

The scrape skill provides an interface to the Bright Data CLI for extracting web content as markdown, HTML, JSON, or screenshots. It manages URL fetching, handles geo-targeting, and includes verification steps to ensure content quality and detect block pages.

10.6K
Installs
5
Use cases
5/10
Quality

Is scrape safe to install?

Review the source first

Review the source first: our audit of scrape's source files found 9 shell commands, 0 external URLs, file reads and writes (high risk). Every command and URL listed appears verbatim in the skill's source. The skill executes shell commands via the Bright Data CLI and writes output files to the local filesystem.

How we audit skills: our security review methodology.

Who is this skill for?

Developers and data analysts using the Bright Data platform to extract web content.

What can you do with it?

  • Fetch single web pages as markdown, HTML, or JSON.
  • Capture visual snapshots of websites.
  • Perform geo-targeted web scraping.
  • Process lists of URLs using parallel execution.
  • Extract content from paginated website listings.

How good is this skill?

Quality score: 5/10. The skill documentation is clear, provides specific command examples, and includes a robust verification and escalation process.

What does the skill file contain?

SKILL.md
# Bright Data — Scrape

Get clean content (markdown, HTML, JSON, screenshot) from one or more URLs via the Bright Data CLI. This skill owns the "fetch raw or lightly-structured content" job. For platform-specific structured data (Amazon, LinkedIn, TikTok, etc.), **stop and use `data-feeds` instead** — you'll get clean JSON without selector logic.

## Setup gate (run first)

Before any scrape, verify the CLI is installed and authenticated:

```bash
if ! command -v bdata >/dev/null 2>&1; then
    echo "bdata CLI not installed — see bright-data-best-practices/references/cli-setup.md"
elif ! bdata...

Frequently asked questions

When should I use this skill instead of data-feeds?

Use this skill for general web scraping. Use data-feeds for structured data from specific platforms like Amazon, LinkedIn, TikTok, Instagram, YouTube, or Reddit.

How does the skill handle blocked requests?

The skill verifies output for block page signatures like 'Access Denied' or 'captcha'. On failure, it suggests retrying with a different country code or escalating to the browser-based scraping tool.

What are the prerequisites for using this skill?

The skill requires the Bright Data CLI to be installed and authenticated via the bdata login command.

Data sourced from brightdata/skills on GitHub. Install counts from skills.sh. The summary and security audit are derived from the skill's source files: every command and URL listed appears verbatim in the source.

Related skills

find-skills

2.3M

Users seeking to extend agent capabilities with specialized tools, workflows, or knowledge packages

The find-skills skill enables agents to search for, discover, and install modular packages from the open agent skills ecosystem using the Skills CLI.

highclipackage-managervercel-labs

agent-browser

506.7K

AI agents and developers requiring programmatic web interaction, exploratory testing, or automation of Electron desktop applications

The agent-browser CLI provides browser automation for AI agents using Chrome or Chromium via CDP. It supports page navigation, form interaction, data extraction, and testing. The tool utilizes accessibility-tree snapshots and element references for interaction.

highbrowser-automationclivercel-labs

video-edit

338.7K

Users of the RunComfy CLI who need to automate video editing tasks like restyling, background swapping, or motion transfer

The video-edit skill acts as a router for the RunComfy CLI, selecting between Wan 2.7 Edit-Video, Kling 2.6 Pro Motion Control, and Lucy Edit Restyle models based on user intent to perform video transformations.

highvideo-editingai-agentagentspace-so

agentspace

324.0K

Developers and AI agent users who need to share, monitor, or collaborate on agent-generated files and workspaces in real-time

Agentspace provides a mechanism to share local agent files, logs, and artifacts via a live browser-accessible URL. It enables remote viewing, commenting, and editing of specified local paths through the ascli command-line tool.

highcollaborationfile-sharingagentspace-so