web-content-fetcher

The web-content-fetcher skill extracts article content from URLs and converts it into clean Markdown. It uses a local Python script with Scrapling for primary extraction and Jina Reader as a fallback for simple pages.

2.7K

Installs

Use cases

9/10

Quality

Is web-content-fetcher safe to install?

Review the source first

Review the source first: our audit of web-content-fetcher's source files found 2 shell commands, 1 external URL, no file writes (high risk). Every command and URL listed appears verbatim in the skill's source. The skill executes arbitrary Python scripts and makes network requests to external URLs, including the Jina Reader API and target websites.

How we audit skills: our security review methodology.

Who is this skill for?

Users who need to extract, scrape, or summarize content from web pages, including news articles, blog posts, and documentation.

What can you do with it?

Extracting main content from news articles and blog posts
Converting web pages into clean Markdown for reading or summarization
Fetching content from JavaScript-rendered sites like WeChat or Zhihu
Scraping documentation pages

How good is this skill?

Quality score: 9/10. The skill provides clear instructions, a domain routing table for optimization, and specific failure handling rules.

What does the skill file contain?

SKILL.md

# Web Content Fetcher

Given a URL, return its main content as clean Markdown — headings, links, images, lists, code blocks all preserved.

## Extraction Strategy

Always try **one method per URL** — don't cascade blindly. Pick the right one upfront.

```
URL
 │
 ├─ 1. Scrapling script (preferred)
 │     Run fetch.py — check the domain routing table to decide fast vs --stealth.
 │     Works for most sites. Returns clean Markdown directly.
 │
 └─ 2. Jina Reader (fallback — only if Scrapling fails or dependencies not installed)
       web_fetch("https://r.jina.ai/<url>")
       Free tier: 200 re...

Frequently asked questions

Which extraction methods does the skill use?

It uses a local Scrapling script as the primary method and the Jina Reader API as a fallback.

How does the skill handle JavaScript-heavy websites?

The skill uses a --stealth flag with the Scrapling script to trigger a headless browser for sites like WeChat, Zhihu, and Juejin.

What happens if a URL fails to extract?

The skill stops after one failed attempt and informs the user that it cannot extract content from that URL.

Data sourced from shirenchuang/web-content-fetcher on GitHub. Install counts from skills.sh. The summary and security audit are derived from the skill's source files: every command and URL listed appears verbatim in the source.

Install

Works with Claude Code, Cursor, Codex CLI, and 50+ agents.

Manual install paths per agent

Claude Code: .claude/skills/ or ~/.claude/skills/

Codex CLI: .codex/skills/ or ~/.codex/skills/

GitHub Copilot: .github/skills/ or ~/.copilot/skills/

Amp: .agents/skills/ or ~/.config/agents/skills/

Copy the skill folder (with SKILL.md) into the project or global directory.

Security audit

high risk

Commands

URLs

File I/O

The skill executes arbitrary Python scripts and makes network requests to external URLs, including the Jina Reader API and target websites.

View commands and endpoints

$ python3 <SKILL_DIR>/scripts/fetch.py "<url>" [max_chars] [--stealth]

$ pip install scrapling html2text

https://r.jina.ai/<url>

Sourceshirenchuang/web-content-fetcher

Installs2.7K

Quality9/10

scraping markdown web-fetcher automation

Related skills

lark-workflow-meeting-summary

219.7K

Users who need to summarize meeting minutes, generate weekly reports, or review meeting history within the Lark ecosystem

The lark-workflow-meeting-summary skill provides a structured workflow for agents to aggregate meeting minutes from Lark VC, generate summaries, and create or update reports in Lark Docs.

highlarkmeeting-summarylarksuite

programmatic-seo

83.5K

Content strategists, SEO specialists, and marketers building large-scale, data-driven website content

The programmatic-seo skill provides a framework for generating SEO-optimized pages at scale using templates and data. It guides users through keyword research, data sourcing, template design, internal linking, and indexation strategies while emphasizing content uniqueness to avoid search engine penalties.

lowseomarketingcoreyhaines31

ad-creative

64.0K

Performance marketers, growth teams, and advertisers who need to produce ad copy at scale or optimize existing ad creative

The ad-creative skill generates, iterates, and scales ad copy for paid advertising platforms including Google, Meta, LinkedIn, TikTok, and Twitter. It provides frameworks for creating ad headlines, descriptions, and primary text based on product context and performance data.

highmarketingadvertisingcoreyhaines31

recipe-save-email-attachments

16.9K

Users who manage email attachments and Google Drive storage using the Google Workspace CLI

This recipe automates the process of locating Gmail messages containing attachments and saving those files to a specified Google Drive folder.

highgmailgoogle-drivegoogleworkspace