Best Open-source ETL & Data Integration tools

Explore curated open-source tools in the ETL & Data Integration category. Compare technologies, see alternatives, and find the right solution for your workflow.

9 projects · Page 1 of 1

Airbyte logo

Airbyte

Data integration platform for ELT pipelines from any source

Stars
20,507
License
Last commit
9 hours ago
PythonActive
CocoIndex logo

CocoIndex

Ultra-performant data transformation framework for AI pipelines

Stars
5,899
License
Apache-2.0
Last commit
10 hours ago
RustActive
OLake logo

OLake

Blazing-fast database replication to Apache Iceberg tables

Stars
1,272
License
Apache-2.0
Last commit
10 hours ago
GoActive
Apache Spark logo

Apache Spark

Fast, unified engine for large-scale data analytics

Stars
42,672
License
Apache-2.0
Last commit
10 hours ago
ScalaActive
Artie Transfer logo

Artie Transfer

Real-time CDC replication from OLTP to OLAP databases

Stars
793
License
Last commit
16 hours ago
GoActive
Meltano logo

Meltano

Declarative code-first data integration engine for modern pipelines

Stars
2,323
License
MIT
Last commit
1 day ago
PythonActive
CloudQuery logo

CloudQuery

High-performance ELT framework powered by Apache Arrow

Stars
6,308
License
MPL-2.0
Last commit
2 days ago
GoActive
Apache SeaTunnel logo

Apache SeaTunnel

Multimodal distributed data integration for massive-scale synchronization

Stars
9,063
License
Apache-2.0
Last commit
3 days ago
JavaActive
Mara Pipelines logo

Mara Pipelines

Lightweight Python ETL framework with PostgreSQL and web UI

Stars
2,086
License
MIT
Last commit
2 years ago
PythonDormant