Skip to content
Change the repository type filter

All

    Repositories list

    • This project is the 🏠 home of Apify actor template projects to help users quickly get started.
      Python
      152672Updated Oct 22, 2024Oct 22, 2024
    • Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
      Python
      Apache License 2.0
      2864.2k6915Updated Oct 22, 2024Oct 22, 2024
    • workflows

      Public
      Apify's reusable github workflows
      Python
      3624Updated Oct 22, 2024Oct 22, 2024
    • Utilities and constants shared across Apify projects.
      TypeScript
      Apache License 2.0
      111241Updated Oct 22, 2024Oct 22, 2024
    • openapi

      Public
      An OpenAPI specification for the Apify API.
      JavaScript
      MIT License
      02183Updated Oct 22, 2024Oct 22, 2024
    • Apify SDK monorepo
      TypeScript
      Apache License 2.0
      35123106Updated Oct 21, 2024Oct 21, 2024
    • This project is the home of Apify's documentation.
      API Blueprint
      Apache License 2.0
      73286222Updated Oct 21, 2024Oct 21, 2024
    • Apify API client for Python
      Python
      Apache License 2.0
      1147101Updated Oct 21, 2024Oct 21, 2024
    • The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
      Python
      Apache License 2.0
      11117120Updated Oct 21, 2024Oct 21, 2024
    • Apify API client for JavaScript / Node.js.
      JavaScript
      Apache License 2.0
      2767164Updated Oct 21, 2024Oct 21, 2024
    • Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
      TypeScript
      Apache License 2.0
      1009391810Updated Oct 21, 2024Oct 21, 2024
    • apify-cli

      Public
      Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
      TypeScript
      18122348Updated Oct 21, 2024Oct 21, 2024
    • crawlee

      Public
      Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
      TypeScript
      Apache License 2.0
      65915k11515Updated Oct 18, 2024Oct 18, 2024
    • RAG Web Browser is a tool to provide your RAG pipelines with up-to-date information from the web.
      TypeScript
      0100Updated Oct 17, 2024Oct 17, 2024
    • Apify ESLint preset to be shared between projects
      JavaScript
      Apache License 2.0
      0211Updated Oct 16, 2024Oct 16, 2024
    • Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
      JavaScript
      Apache License 2.0
      143842911Updated Oct 15, 2024Oct 15, 2024
    • A Homebrew tap for Apify tools
      Ruby
      1804Updated Oct 14, 2024Oct 14, 2024
    • This tool integrates with AWS to monitor service usage costs and posts a summary of these costs to a Slack channel. The summary includes costs for various AWS services along with a chart that provides a visual breakdown of the costs over time.
      TypeScript
      MIT License
      0001Updated Oct 14, 2024Oct 14, 2024
    • Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
      TypeScript
      25000Updated Oct 9, 2024Oct 9, 2024
    • This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
      0264Updated Oct 8, 2024Oct 8, 2024
    • Base Docker images for Apify actors.
      Dockerfile
      Apache License 2.0
      236992Updated Oct 8, 2024Oct 8, 2024
    • Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
      Python
      Apache License 2.0
      4300Updated Oct 6, 2024Oct 6, 2024
    • airbyte

      Public
      Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
      Python
      Other
      4.1k000Updated Oct 3, 2024Oct 3, 2024
    • Apify extractor for Keboola Connection
      JavaScript
      Apache License 2.0
      0050Updated Oct 2, 2024Oct 2, 2024
    • The official integration for Apify and Haystack 2.0
      Python
      Apache License 2.0
      0100Updated Sep 23, 2024Sep 23, 2024
    • This action simplify creating of release PR
      JavaScript
      Apache License 2.0
      0010Updated Sep 12, 2024Sep 12, 2024
    • idcac

      Public
      I Don't Care About Cookies extension compiled for use with Playwright/Puppeteer
      JavaScript
      GNU General Public License v3.0
      0901Updated Sep 9, 2024Sep 9, 2024
    • An example repository with multiple Apify Actors sharing code between each other.
      JavaScript
      5111Updated Sep 6, 2024Sep 6, 2024
    • Custom Algolia search modal for Apify Documentation.
      TypeScript
      MIT License
      1002Updated Sep 5, 2024Sep 5, 2024
    • Apify integration for Zapier
      JavaScript
      Apache License 2.0
      1850Updated Aug 27, 2024Aug 27, 2024