Skip to main content

Scrapix

Scrapix is an AI-powered web scraping and data extraction platform. It lets you scrape pages, collect links, crawl entire sites, and extract structured data — all through a simple API or SDK, without managing proxies, headless browsers, or anti-bot infrastructure.


Core Operations

OperationWhat it does
ScrapeFetch a single URL and return its content as HTML, text, Markdown, PDF, or DOCX
CollectDiscover all URLs linked from a page, with path filtering and AI-powered selection
CrawlCollect URLs from a page and scrape each one (collect + scrape combined)
ExtractUse AI to extract structured data or generate a summary from any page

How to Use Scrapix

Scrapix offers three ways to interact with the platform:

Playground

An interactive workbench in your browser for running Scrape, Crawl, Collect, and Extract against any URL. Tune parameters, inspect live responses, and copy-paste ready-to-use request payloads — the fastest way to validate a use case before writing a single line of code.

Open the Playground guide →

REST API

A language-agnostic HTTPS API secured with an API key. Submit jobs, poll status, and fetch results as JSON — pair it with any stack (cURL, Postman, Go, Ruby, n8n, Zapier) and scale from a one-off script to production pipelines.

Read the API docs →

SDKs

Official client libraries for Python and TypeScript/JavaScript. Typed methods, built-in authentication, automatic retries, and idiomatic error handling — drop them into your project and start scraping in a few lines of code.


Built-in Capabilities

Every Scrapix operation supports these features out of the box:

  • JavaScript rendering — load pages in a headless browser for JS-heavy sites
  • Premium proxies — route requests through residential proxies to bypass blocks
  • Auto-proxy mode — automatically escalate through proxy tiers and rendering
  • CAPTCHA solving — solve CAPTCHA challenges automatically
  • Caching — serve recently fetched pages instantly at reduced cost
  • Structured extraction — define a JSON schema and let AI extract data into it
  • Summarization — get AI-generated summaries of any page
  • Multiple output formats — HTML, text, Markdown, PDF, DOCX, base64, JSON, XML, YAML, TOML

Quick Start

Get your first result in under 5 minutes:

curl -X POST https://api-scrapix.promptcloud.com/v1/scrape \
-H "Content-Type: application/json" \
-H "x-api-key: YOUR_API_KEY" \
-d '{"url": "https://example.com"}'

Full getting started guide →


What's Next?

  • Getting Started — Create an account, get an API key, make your first request
  • Core Concepts — Understand how operations, credits, and authentication work
  • Operations — Deep dive into each operation's parameters and responses