Extract data from any URL
Send a URL, get back clean markdown, metadata, JSON-LD schemas, and AI-powered extractions. Handles JavaScript-rendered pages out of the box.
# Response — extracted content
"markdown": "# Example Domain\nThis domain is for use in...",
"metadata": { "title": "Example Domain", "language": "en" },
"credits_used": 1
}
From URL to structured data
A single API call handles fetching, parsing, and extraction.
Fetch
The page is fetched via HTTP or rendered with a headless browser for JS-heavy sites.
Parse
HTML is cleaned, content extracted, and metadata/schemas parsed automatically.
Return
Clean markdown, metadata, schemas, and optional AI enrichments are returned instantly.
Multiple output formats
Markdown
Clean, readable markdown with preserved heading structure, links, and formatting.
HTML
Raw or cleaned HTML. Great for custom parsing pipelines or archival.
Metadata
Title, description, language, Open Graph tags, Twitter cards, and more.
JSON-LD / Schema
Structured data extracted from JSON-LD, microdata, and RDFa embedded in the page.
AI Summary
LLM-generated summary of the page content. Concise and accurate.
AI Extraction
Extract structured data using a custom prompt. Define your own schema.