📄 NEW: Free Data Engineering Cheatsheet 2026 — SQL, Airflow, Spark, Kafka, dbt & more →
Airbyte logo

Airbyte

Free tier available

Open-source data integration platform for ELT pipelines

ETL/ELT open-source elt connectors

📖 Overview

Airbyte is an open-source data integration platform that helps you consolidate data from various sources into your data warehouse. With 350+ pre-built connectors and the ability to build custom ones, it's become a popular alternative to commercial ETL tools. Founded in 2020, Airbyte has grown rapidly to become the most-starred data integration project on GitHub (13,000+ stars). It raised $181M in funding and powers data pipelines at thousands of companies from startups to enterprises.

Key Features

  • 300+ Connectors: Pre-built sources and destinations covering most common data sources
  • CDC Support: Change data capture for real-time data syncing
  • Custom Connectors: Build your own with the Connector Development Kit (CDK)
  • Normalization: Optional automatic schema normalization
  • Self-hosted or Cloud: Run on your infrastructure or use Airbyte Cloud
  • dbt Integration: Native integration with dbt for transformations

💰 Pricing

Model
open source
Starting Price
$0
Cloud/Pro
$1.25/credit
Free tier available 🏢 Enterprise plans available

👍 Pros

  • + Fully open-source core with active community
  • + Massive connector library (largest in OSS space)
  • + Self-hosted option for data-sensitive organizations
  • + Generous free tier on cloud
  • + Active development and frequent releases

👎 Cons

  • Self-hosting requires DevOps resources
  • Some connectors less mature than others
  • Can be resource-intensive at scale
  • Cloud pricing can add up with high volume

🎯 Best For

Teams who want control over their data pipelines, need specific connectors, or have data residency requirements. Great for startups and mid-size companies building modern data stacks. **Common use cases:** - Syncing SaaS data (Salesforce, HubSpot, Stripe) to Snowflake/BigQuery - Database replication with CDC (PostgreSQL, MySQL to warehouse) - Marketing analytics pipelines (Google Ads, Facebook Ads, LinkedIn) - Building a centralized data lake from distributed sources - Feeding data to dbt for downstream transformations

🔗 Works With

📁 More ETL/ELT Tools

View all ETL/ELT tools →