📄 NEW: Free Data Engineering Cheatsheet 2026 — SQL, Airflow, Spark, Kafka, dbt & more →
Dremio logo

Dremio

Free tier available

Lakehouse platform with SQL query engine

Data Lakes lakehouse sql query-engine

📖 Overview

Dremio is a lakehouse platform that provides a fast SQL query engine directly on your data lake. Built on Apache Arrow, it offers excellent performance without requiring data movement. Strong focus on Apache Iceberg and open formats.

Key Features

  • Arrow-based Engine: Fast columnar execution
  • Data Reflections: Automatic query acceleration
  • Iceberg Native: First-class Iceberg support
  • Semantic Layer: Curated datasets and views
  • Federation: Query across sources
  • Nessie Integration: Git-like data versioning

💰 Pricing

Model
open source
Starting Price
$0
Cloud/Pro
Pay-as-you-go
Free tier available 🏢 Enterprise plans available

👍 Pros

  • + Query data lake directly (no ETL)
  • + Excellent Arrow Flight performance
  • + Strong open-source commitment
  • + Good Iceberg integration
  • + Self-service data access

👎 Cons

  • Less mature than Databricks/Snowflake
  • Cloud offering still growing
  • Can be complex to tune
  • Smaller ecosystem

🎯 Best For

Organizations wanting to query data lakes directly without moving data to a warehouse. Ideal for Iceberg-centric architectures.

🔗 Works With

📁 More Data Lakes Tools

View all Data Lakes tools →