The API to search, scrape, and interact with the web at scale. 🔥
-
Updated
Jun 2, 2026 - TypeScript
The API to search, scrape, and interact with the web at scale. 🔥
Python scraper based on AI
A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.
Python client library for Diffbot APIs
Export Safari reading list to JSON or CSV
Web Data Frames
The official Node.js SDK for Spidra.
The web data layer for AI agents — fetch, search, crawl, extract, screenshot, and monitor the web with 50+ domain extractors and MCP.
Agent skill that gives it hands in the browser. 25+ tools to navigate, extract data, execute scripts, intercept APIs — all in user's own Chrome with their login sessions. No passwords needed. 给Agent一双手,像用户一样使用浏览器,25+自动化工具,数据完全本地处理。
Amazon product data analysis with Python & Jupyter. Includes cleaning, stats, and visualizations of categories, prices, ratings, and availability.
Analyze and parse HTML responses, programmatically scrape web data, and utilize Pandas DataFrames to store, transform, and merge tables.
High-performance web scraping engine that converts any web page into clean markdown --- with 3-layer fallback (Cheerio --> Playwright --> Abrasio) and AI-powered structured extraction
🚀 Analyze your website's AI readiness and optimize for performance with real-time scoring, recommendations, and detailed metrics.
Let AI agents fetch live social media and web data with the official Social Fetch MCP server.
Extract text from images using a robust OCR model designed for accuracy and efficiency in varied visual contexts.
Sync Notion workspace data to local SQLite and Markdown for offline search, version control, and analysis without relying on the Notion interface.
Get 15% OFF on 5 powerful products during Crawlbase Cyberweek 2025. Includes Crawling API, Smart AI Proxy, Crawler, Cloud Storage & LinkedIn Scraper.
Coresignal is a data-as-a-service company providing access to public web data on companies, employees, and jobs through a suite of REST APIs. The platform aggregates and refines more than 4.5 billion data records covering 75M+ companies (with 500+ data fields), 865M+ employee profiles (300+ fields), and 461M+ job postings (85+ fields).
Add a description, image, and links to the web-data topic page so that developers can more easily learn about it.
To associate your repository with the web-data topic, visit your repo's landing page and select "manage topics."