Categories

Data Engineering & Integration

Data Engineering & Integration open-source alternatives curated by the directory.

Data Engineering & Integration

Use the same directory search, ordering, and pagination controls inside this collection.

15 of 15 tools

Lightdash

MIT

Self-serve analytics platform that transforms dbt models into interactive dashboards, enabling data-driven decisions across organizations.

Power BITableauDatabox
5.9K 729 Last commit 2 days ago
CodexClassroom

Airbyte

Unknown

Seamlessly sync data from any source to any destination with a flexible, extensible platform that grows with your data needs.

SupermetricsFivetranMatillion
21K 5.2K Last commit 14 hours ago
CodexClassroom

CloudQuery

MPL-2.0

CloudQuery is an open-source ELT platform that enables easy data integration from hundreds of cloud and security tools to any destination.

SnowflakeBigQuerySupermetrics
6.4K 547 Last commit 1 day ago
CodexClassroom

CocoIndex

Apache-2.0

Open-source ETL framework built in Rust for AI workloads. Features incremental processing, data lineage, and observability tools for semantic search and RAG applications.

PipedreamAmazon API GatewaySegment
10K 803 Last commit 21 hours ago
CodexClassroom

Cube

Unknown

Cube is a universal semantic layer that connects data sources to analytics tools, providing consistent definitions and fast queries.

SnowflakeHasuraLooker
20K 2.0K Last commit 23 hours ago
CodexClassroom

Elementary Data

Apache-2.0

Elementary provides dbt-native data observability to detect issues, understand root causes, and resolve problems quickly in data pipelines.

2.4K 221 Last commit 5 days ago
CodexClassroom

Gigapipe

AGPL-3.0

Unified platform for logs, metrics, traces and profiles with native compatibility for popular tools like OpenTelemetry, Prometheus, and Loki. No data silos, no usage limits.

DataDogSplunkElasticSearch
1.7K 90 Last commit 2 days ago
CodexClassroom

Impler

MIT

Open-source solution for effortless data importing, mapping, and validation in web applications

284 58 Last commit 2 months ago
CodexClassroom

Jitsu

MIT

Collect, transform, and sync data across your entire infrastructure with a flexible, code-based approach to data integration.

SupermetricsSegmentFivetran
4.8K 354 Last commit 3 days ago
CodexClassroom

Kestra

Apache-2.0

YAML-based orchestration platform with 1400+ plugins for running data pipelines, AI workflows, and infrastructure automation across teams at scale.

n8nMakeZapier
27K 2.6K Last commit 22 hours ago
CodexClassroom

Logstash

Unknown

Logstash is a free and open server-side data processing pipeline that ingests data from multiple sources, transforms it, and sends it to your desired destination.

DataDogSplunkTableau
15K 3.5K Last commit 2 days ago
CodexClassroom

Mage

Apache-2.0

Open-source data pipeline platform for effortless data integration, transformation, and orchestration using Python, SQL, and R.

PipedreamSupermetricsAmazon API Gateway
8.8K 970 Last commit 5 days ago
CodexClassroom

Open Wearables

MIT

Self-hosted health intelligence platform with open algorithms, AI reasoning engine, and zero per-user fees. Connect Apple Health, Whoop, Garmin, Oura, and more.

Terra APISpike APIJunction
1.9K 339 Last commit 19 hours ago
CodexClassroom

Orbital

Unknown

Automated data integration platform that connects APIs, databases, and event streams using semantic schemas. Deploy secure integrations in minutes with Git-based workflows.

KongApollo GraphQL
359 12 Last commit 3 months ago
CodexClassroom

Timeplus

Apache-2.0

Timeplus is a lightweight, powerful, and cost-efficient stream processing platform for real-time analytics, deployed as a single binary.

2.2K 111 Last commit 3 days ago
CodexClassroom