Websets

Structured web data,
built for your workflow

Websets turns any web search into a clean, structured dataset — automatically enriched, deduplicated, and refreshed on your schedule. No scraping, no pipelines, no maintenance.

websets-example.js
// Create a Webset via API
const webset = await client.websets.create({
  query: "AI startups founded after 2022 with series A funding",
  schema: {
    company_name: "string",
    founded_year: "number",
    funding_stage: "string",
    total_raised: "string",
    headquarters: "string",
    description: "string",
  },
  numResults: 500,
  refresh: "daily",
})

// Stream results as they arrive
for await (const record of webset.stream()) {
  console.log(record.company_name, record.total_raised)
}

Everything you need

Powerful features built for production data pipelines

Structured web datasets

Turn any web search into a clean, structured dataset. Define the schema once and Websets extracts and normalises data from across the web.

Always up-to-date

Websets automatically refresh on a schedule you choose — hourly, daily, or weekly. Your dataset stays current without any manual work.

Enterprise-grade quality

Built on RunoxAI's 10B+ page index with real-time crawling fallback. Deduplication, normalization, and confidence scores included.

API & export

Access your Websets via REST API, export to CSV/JSON, or connect directly to your data pipeline. Integrates with Zapier, Make, and more.

Team collaboration

Share Websets across your team with granular access controls. Comment, tag, and version your datasets for seamless collaboration.

Custom schemas

Define exactly what fields you need. Websets will extract, validate, and type-check every field using AI-powered understanding.

Built for every team

How teams use Websets to move faster

Sales & GTM

Build your ICP lead list automatically

Define your ideal customer profile and Websets will continuously find and enrich matching companies. Get firmographics, tech stack, recent news, and contact info — all structured and ready to import.

70M+
companies indexed
<2s
per record
Recruiting

Source candidates from across the web

Search for candidates by role, skills, location, and more. Websets aggregates profiles from LinkedIn, GitHub, personal sites, and beyond into a single enriched dataset.

500M+
profiles
Real-time
updates
Market Intelligence

Track your competitive landscape

Monitor competitor pricing, product updates, job postings, and press mentions. Websets delivers structured intelligence on any market segment, updated on your schedule.

10B+
pages tracked
Daily
refresh

Start building with Websets

Free tier available. No credit card required. Start extracting structured web data in minutes.