content-parser

The content-parser skill extracts and normalizes text content from URLs using the ListenHub API. It supports optional configuration for summarization, content length limits, and Twitter/X tweet counts. The skill saves extracted content as markdown and raw JSON files to the local directory.

904
Installs
4
Use cases
5/10
Quality

Is content-parser safe to install?

Review the source first

Review the source first: our audit of content-parser's source files found 6 shell commands, 2 external URLs, file reads and writes (high risk). Every command and URL listed appears verbatim in the skill's source. The skill executes shell commands to manage local configuration files and performs network requests to an external API. It writes files to the current working directory.

How we audit skills: our security review methodology.

Who is this skill for?

Users who need to extract structured text, metadata, and references from web pages or social media profiles for use in other AI agent workflows.

What can you do with it?

  • Extracting article text from a URL for summarization
  • Fetching recent tweets from a Twitter/X profile
  • Preprocessing web content for audio or video generation skills
  • Saving web content as local markdown files for documentation

How good is this skill?

Quality score: 5/10. The skill documentation is comprehensive, providing clear instructions, API references, and specific bash commands for implementation.

What does the skill file contain?

SKILL.md
## When to Use

- User provides a URL and wants to extract/read its content
- Another skill needs to parse source material from a URL before generation
- User says "parse this URL", "extract content from this link"
- User says "解析链接", "提取内容"

## When NOT to Use

- User already has text content and doesn't need URL parsing
- User wants to generate audio/video content (not content extraction)
- User wants to read a local file (use standard file reading tools)

## Purpose

Extract and normalize content from URLs across supported platforms. Returns structured data including content body, metadata,...

Frequently asked questions

What environment variables are required?

The skill requires the LISTENHUB_API_KEY environment variable to authenticate requests.

Where does the skill save extracted content?

The skill saves extracted content as .md and .json files in the current working directory.

Can I configure the extraction process?

Yes, users can configure options such as summarization, maximum content length, and the number of tweets to fetch for Twitter/X URLs.

How does the skill handle long-running extractions?

The skill uses a polling mechanism with a 5-second interval to check the status of the extraction task via the API.

Data sourced from marswaveai/skills on GitHub. Install counts from skills.sh. The summary and security audit are derived from the skill's source files: every command and URL listed appears verbatim in the source.

Related skills

firecrawl-lead-gen

12.8K

Sales professionals, recruiters, and researchers who need to compile prospect lists from public web sources

The firecrawl-lead-gen skill extracts prospect information from web directories and databases using the Firecrawl browser. It generates structured lead lists in JSON, CSV, or Markdown formats based on user-defined criteria.

highlead-generationweb-scrapingfirecrawl

firecrawl-seo-audit

12.8K

SEO specialists, content marketers, and website owners who need automated, data-driven site audits

The firecrawl-seo-audit skill performs website SEO audits by mapping site structures, scraping key pages, and comparing content against target keywords and competitor search results.

highseoauditfirecrawl

fetching-salesforce-docs

1.5K

AI agents requiring authoritative, grounded information from Salesforce-owned documentation domains

A retrieval skill for fetching and grounding AI agent responses in official Salesforce documentation. It provides a playbook for navigating developer, help, architect, and design system portals while handling JS-heavy or shell-rendered pages.

highsalesforcedocumentationforcedotcom

chrome-automation

1.3K

Developers and AI agents requiring browser automation capabilities within a user's existing, authenticated Chrome session

The chrome-automation skill provides a command-line interface for agents to interact with a user's active Chrome browser session. It supports navigation, form filling, data extraction, and replaying DevTools recordings by connecting to a running Chrome instance with remote debugging enabled.

highbrowser-automationchromezc277584121