browser-screenshot

The browser-screenshot skill captures focused, cropped images of specific web page regions. It uses a research-first workflow to identify target content, navigates via a browser-automation tool, locates elements using DOM selectors, and processes images with ImageMagick to ensure clean, professional outputs.

1.5K

Installs

Use cases

9/10

Quality

Is browser-screenshot safe to install?

Review the source first

Review the source first: our audit of browser-screenshot's source files found 13 shell commands, 7 external URLs, file reads and writes (high risk). Every command and URL listed appears verbatim in the skill's source. The skill executes shell commands to control a browser and manipulate image files. It performs network requests to navigate to arbitrary URLs and reads/writes local files for screenshot storage and processing.

How we audit skills: our security review methodology.

Who is this skill for?

Users who need to generate high-quality, focused screenshots of web content for articles, reports, or documentation without including distracting UI elements like navigation bars or sidebars.

What can you do with it?

Capturing a specific Reddit post or tweet for social media analysis
Extracting a data table or chart from a technical blog post
Saving a clean visual of a product hero section or model card
Documenting API specifications from official documentation pages

How good is this skill?

Quality score: 9/10. The skill documentation is highly structured and provides clear, actionable workflows. It includes specific safety checks like Retina display handling and pre-flight URL validation. The reliance on external tools like ImageMagick is clearly defined.

What does the skill file contain?

SKILL.md

# Skill: Browser Screenshot

Take focused screenshots of specific regions on web pages — a Reddit post, a tweet, an article section, a chart, etc. — not just a full-page dump.

> **Prerequisite**: agent-browser must be installed and Chrome must have remote debugging enabled. See `references/agent-browser-setup.md` if unsure.

---

## Overview

This skill handles the full pipeline:

1. **Research** the best page to screenshot (web search, fetch)
2. **Navigate** to the right page in the browser
3. **Locate** the target element/region on the page
4. **Capture** a focused, cropped screenshot of ju...

Frequently asked questions

Why should I use WebSearch before opening the browser?

The browser is for capturing, not browsing. Using WebSearch and WebFetch first allows the agent to validate if a page contains the necessary information, saving time by avoiding unnecessary browser navigation.

How does the skill handle Retina displays?

The skill detects the scale factor by comparing viewport size to actual image dimensions. It then multiplies the bounding box coordinates by the scale factor before cropping to ensure the final image is accurate.

What should I do if the target element is hard to find?

The skill provides a fallback method using JS-injected highlighting. It draws a red border around the candidate element, takes a screenshot for visual confirmation, and allows for adjustment before the final crop.

Data sourced from zc277584121/marketing-skills on GitHub. Install counts from skills.sh. The summary and security audit are derived from the skill's source files: every command and URL listed appears verbatim in the source.

Install

Works with Claude Code, Cursor, Codex CLI, and 50+ agents.

Manual install paths per agent

Claude Code: .claude/skills/ or ~/.claude/skills/

Codex CLI: .codex/skills/ or ~/.codex/skills/

GitHub Copilot: .github/skills/ or ~/.copilot/skills/

Amp: .agents/skills/ or ~/.config/agents/skills/

Copy the skill folder (with SKILL.md) into the project or global directory.

Security audit

high risk

Commands

URLs

Yes

File I/O

The skill executes shell commands to control a browser and manipulate image files. It performs network requests to navigate to arbitrary URLs and reads/writes local files for screenshot storage and processing.

View commands and endpoints

$ agent-browser --auto-connect tab list

$ agent-browser --auto-connect open <url>

$ agent-browser --auto-connect wait --load networkidle

$ agent-browser --auto-connect screenshot --annotate

$ agent-browser --auto-connect snapshot -i

$ agent-browser --auto-connect get box "<selector>"

$ agent-browser --auto-connect eval "document.querySelector('article')?.getBoundingClientRect()"

$ agent-browser --auto-connect get count "article[data-testid='tweet']"

https://www.google.com/search?q=<encoded-query>

https://www.reddit.com/search/?q=<query>

https://x.com/search?q=<query>

https://www.linkedin.com/search/results/content/?keywords=<query>

https://hn.algolia.com/?q=<query>

https://github.com/search?q=<query>

https://www.youtube.com/results?search_query=<query>

Sourcezc277584121/marketing-skills

Installs1.5K

Quality9/10

browser automation screenshot image-processing web-scraping

Related skills

lark-workflow-meeting-summary

219.7K

Users who need to summarize meeting minutes, generate weekly reports, or review meeting history within the Lark ecosystem

The lark-workflow-meeting-summary skill provides a structured workflow for agents to aggregate meeting minutes from Lark VC, generate summaries, and create or update reports in Lark Docs.

highlarkmeeting-summarylarksuite

programmatic-seo

83.5K

Content strategists, SEO specialists, and marketers building large-scale, data-driven website content

The programmatic-seo skill provides a framework for generating SEO-optimized pages at scale using templates and data. It guides users through keyword research, data sourcing, template design, internal linking, and indexation strategies while emphasizing content uniqueness to avoid search engine penalties.

lowseomarketingcoreyhaines31

ad-creative

64.0K

Performance marketers, growth teams, and advertisers who need to produce ad copy at scale or optimize existing ad creative

The ad-creative skill generates, iterates, and scales ad copy for paid advertising platforms including Google, Meta, LinkedIn, TikTok, and Twitter. It provides frameworks for creating ad headlines, descriptions, and primary text based on product context and performance data.

highmarketingadvertisingcoreyhaines31

recipe-save-email-attachments

16.9K

Users who manage email attachments and Google Drive storage using the Google Workspace CLI

This recipe automates the process of locating Gmail messages containing attachments and saving those files to a specified Google Drive folder.

highgmailgoogle-drivegoogleworkspace