web-content-fetcher
The web-content-fetcher skill extracts article content from URLs and converts it into clean Markdown. It uses a local Python script with Scrapling for primary extraction and Jina Reader as a fallback for simple pages.
Is web-content-fetcher safe to install?
Review the source first: our audit of web-content-fetcher's source files found 2 shell commands, 1 external URL, no file writes (high risk). Every command and URL listed appears verbatim in the skill's source. The skill executes arbitrary Python scripts and makes network requests to external URLs, including the Jina Reader API and target websites.
How we audit skills: our security review methodology.
Who is this skill for?
Users who need to extract, scrape, or summarize content from web pages, including news articles, blog posts, and documentation.
What can you do with it?
- Extracting main content from news articles and blog posts
- Converting web pages into clean Markdown for reading or summarization
- Fetching content from JavaScript-rendered sites like WeChat or Zhihu
- Scraping documentation pages
How good is this skill?
Quality score: 9/10. The skill provides clear instructions, a domain routing table for optimization, and specific failure handling rules.
What does the skill file contain?
# Web Content Fetcher
Given a URL, return its main content as clean Markdown — headings, links, images, lists, code blocks all preserved.
## Extraction Strategy
Always try **one method per URL** — don't cascade blindly. Pick the right one upfront.
```
URL
│
├─ 1. Scrapling script (preferred)
│ Run fetch.py — check the domain routing table to decide fast vs --stealth.
│ Works for most sites. Returns clean Markdown directly.
│
└─ 2. Jina Reader (fallback — only if Scrapling fails or dependencies not installed)
web_fetch("https://r.jina.ai/<url>")
Free tier: 200 re...Frequently asked questions
Which extraction methods does the skill use?
It uses a local Scrapling script as the primary method and the Jina Reader API as a fallback.
How does the skill handle JavaScript-heavy websites?
The skill uses a --stealth flag with the Scrapling script to trigger a headless browser for sites like WeChat, Zhihu, and Juejin.
What happens if a URL fails to extract?
The skill stops after one failed attempt and informs the user that it cannot extract content from that URL.
Related skills
lark-workflow-meeting-summary
219.7KUsers who need to summarize meeting minutes, generate weekly reports, or review meeting history within the Lark ecosystem
The lark-workflow-meeting-summary skill provides a structured workflow for agents to aggregate meeting minutes from Lark VC, generate summaries, and create or update reports in Lark Docs.
programmatic-seo
83.5KContent strategists, SEO specialists, and marketers building large-scale, data-driven website content
The programmatic-seo skill provides a framework for generating SEO-optimized pages at scale using templates and data. It guides users through keyword research, data sourcing, template design, internal linking, and indexation strategies while emphasizing content uniqueness to avoid search engine penalties.
ad-creative
64.0KPerformance marketers, growth teams, and advertisers who need to produce ad copy at scale or optimize existing ad creative
The ad-creative skill generates, iterates, and scales ad copy for paid advertising platforms including Google, Meta, LinkedIn, TikTok, and Twitter. It provides frameworks for creating ad headlines, descriptions, and primary text based on product context and performance data.
recipe-save-email-attachments
16.9KUsers who manage email attachments and Google Drive storage using the Google Workspace CLI
This recipe automates the process of locating Gmail messages containing attachments and saving those files to a specified Google Drive folder.