Extract structured data from any website
Send a URL and get structured JSON back. Define the fields you need — the API handles fetching, rendering, and cleaning the page automatically. No scraper to build or maintain.
No credit card required — start with free trial credits
One output feeds the next
Website Extraction is part of a complete content pipeline. One key, one credit pool, and structured JSON responses designed to chain together.
Mix and match freely
Extract data from a document, generate visuals from the results, then compile everything into a finished report. Mix, match, and build your own pipeline.
Three steps to your first extraction
Send a URL
Pass any public URL. Static and JavaScript-rendered pages are both supported. No configuration required.
- Any public URL
- Static and JavaScript-rendered pages
Define a schema
Describe the fields you want to extract — prices, descriptions, contact details, tables. The page is fetched, rendered, and cleaned internally before your schema is applied.
- Named entities, numbers, dates, and nested lists
- Nested arrays for tables and repeating sections
Get structured data
Receive JSON with extracted fields, confidence scores, and source citations from the page. Feed it straight into any next pipeline step without transformation.
- Confidence scores for every field
- Source citations from the page content
JavaScript Rendering
JavaScript-rendered pages are handled automatically. Dynamic content and client-side navigation work without any configuration.
Automatic Proxy Rotation
Requests are automatically routed to handle bot detection and access restrictions. Most public sites work out of the box without any setup.
Deep Content Understanding
Pages aren't parsed as raw text patterns. The API understands what a page depicts — product listings, article content, embedded tables — and extracts field values from that meaning.
Schema-Driven Results
Define typed fields — prices, dates, text, nested lists — and get structured JSON back. No HTML parsing, no prompt engineering.
Built-In Trust Scores
Every extracted value includes a confidence score and a source citation from the page. Route low-confidence results to human review.
Your data stays in the EU
Your data is processed on EU servers and never stored beyond temporary logs. Zero retention, GDPR-compliant by design, with a Data Processing Agreement available for every customer. Learn more about our security practices .
No data storage
We don't store your files or processing results. Logs are automatically deleted after 90 days.
EU-hosted infrastructure
All processing runs on servers located in the European Union. Your data never leaves the EU.
GDPR-compliant by design
Full compliance with EU data protection regulations. Data Processing Agreement available for all customers.
Pricing
Start with free trial credits. No credit card required.
Developer
For individuals & small projects
-
1,000 credits / monthThat's either: 1,000 image transformations 500 document generations 500 image generations 500 sheet generations 200 document extractions (5-page docs) 200 markdown conversions (5-page docs)
-
All APIs included
-
Free trial credits per API
-
Email support
-
Budget caps per key
-
Optional auto top-up
Startup
Save 40%For growing teams
-
5,000 credits / monthThat's either: 5,000 image transformations 2,500 document generations 2,500 image generations 2,500 sheet generations 1,000 document extractions (5-page docs) 1,000 markdown conversions (5-page docs)
-
All APIs included
-
Free trial credits per API
-
Priority support
-
Budget caps per key
-
Optional auto top-up
Business
Save 47%For high-volume workloads
-
15,000 credits / monthThat's either: 15,000 image transformations 7,500 document generations 7,500 image generations 7,500 sheet generations 3,000 document extractions (5-page docs) 3,000 markdown conversions (5-page docs)
-
All APIs included
-
Free trial credits per API
-
Priority support
-
Budget caps per key
-
Optional auto top-up