browser-screenshot
The browser-screenshot skill captures focused, cropped images of specific web page regions. It uses a research-first workflow to identify target content, navigates via a browser-automation tool, locates elements using DOM selectors, and processes images with ImageMagick to ensure clean, professional outputs.
Is browser-screenshot safe to install?
Review the source first: our audit of browser-screenshot's source files found 13 shell commands, 7 external URLs, file reads and writes (high risk). Every command and URL listed appears verbatim in the skill's source. The skill executes shell commands to control a browser and manipulate image files. It performs network requests to navigate to arbitrary URLs and reads/writes local files for screenshot storage and processing.
How we audit skills: our security review methodology.
Who is this skill for?
Users who need to generate high-quality, focused screenshots of web content for articles, reports, or documentation without including distracting UI elements like navigation bars or sidebars.
What can you do with it?
- Capturing a specific Reddit post or tweet for social media analysis
- Extracting a data table or chart from a technical blog post
- Saving a clean visual of a product hero section or model card
- Documenting API specifications from official documentation pages
How good is this skill?
Quality score: 9/10. The skill documentation is highly structured and provides clear, actionable workflows. It includes specific safety checks like Retina display handling and pre-flight URL validation. The reliance on external tools like ImageMagick is clearly defined.
What does the skill file contain?
# Skill: Browser Screenshot Take focused screenshots of specific regions on web pages — a Reddit post, a tweet, an article section, a chart, etc. — not just a full-page dump. > **Prerequisite**: agent-browser must be installed and Chrome must have remote debugging enabled. See `references/agent-browser-setup.md` if unsure. --- ## Overview This skill handles the full pipeline: 1. **Research** the best page to screenshot (web search, fetch) 2. **Navigate** to the right page in the browser 3. **Locate** the target element/region on the page 4. **Capture** a focused, cropped screenshot of ju...
Frequently asked questions
Why should I use WebSearch before opening the browser?
The browser is for capturing, not browsing. Using WebSearch and WebFetch first allows the agent to validate if a page contains the necessary information, saving time by avoiding unnecessary browser navigation.
How does the skill handle Retina displays?
The skill detects the scale factor by comparing viewport size to actual image dimensions. It then multiplies the bounding box coordinates by the scale factor before cropping to ensure the final image is accurate.
What should I do if the target element is hard to find?
The skill provides a fallback method using JS-injected highlighting. It draws a red border around the candidate element, takes a screenshot for visual confirmation, and allows for adjustment before the final crop.
Related skills
lark-workflow-meeting-summary
219.7KUsers who need to summarize meeting minutes, generate weekly reports, or review meeting history within the Lark ecosystem
The lark-workflow-meeting-summary skill provides a structured workflow for agents to aggregate meeting minutes from Lark VC, generate summaries, and create or update reports in Lark Docs.
programmatic-seo
83.5KContent strategists, SEO specialists, and marketers building large-scale, data-driven website content
The programmatic-seo skill provides a framework for generating SEO-optimized pages at scale using templates and data. It guides users through keyword research, data sourcing, template design, internal linking, and indexation strategies while emphasizing content uniqueness to avoid search engine penalties.
ad-creative
64.0KPerformance marketers, growth teams, and advertisers who need to produce ad copy at scale or optimize existing ad creative
The ad-creative skill generates, iterates, and scales ad copy for paid advertising platforms including Google, Meta, LinkedIn, TikTok, and Twitter. It provides frameworks for creating ad headlines, descriptions, and primary text based on product context and performance data.
recipe-save-email-attachments
16.9KUsers who manage email attachments and Google Drive storage using the Google Workspace CLI
This recipe automates the process of locating Gmail messages containing attachments and saving those files to a specified Google Drive folder.