Best Open-source ETL & Data Integration tools

Explore curated open-source tools in the ETL & Data Integration category. Compare technologies, see alternatives, and find the right solution for your workflow.

9 projects · Page 1 of 1

CloudQuery logo

CloudQuery

High-performance ELT framework powered by Apache Arrow

Stars
6,257
License
MPL-2.0
Last commit
3 days ago
GoActive
Apache SeaTunnel logo

Apache SeaTunnel

Multimodal distributed data integration for massive-scale synchronization

Stars
8,928
License
Apache-2.0
Last commit
3 days ago
JavaActive
Airbyte logo

Airbyte

Data integration platform for ELT pipelines from any source

Stars
20,155
License
Unknown
Last commit
3 days ago
PythonActive
Artie Transfer logo

Artie Transfer

Real-time CDC replication from OLTP to OLAP databases

Stars
683
License
Unknown
Last commit
3 days ago
GoActive
OLake logo

OLake

Blazing-fast database replication to Apache Iceberg tables

Stars
1,198
License
Apache-2.0
Last commit
3 days ago
GoActive
CocoIndex logo

CocoIndex

Ultra-performant data transformation framework for AI pipelines

Stars
3,486
License
Apache-2.0
Last commit
3 days ago
RustActive
Apache Spark logo

Apache Spark

Fast, unified engine for large-scale data analytics

Stars
42,415
License
Apache-2.0
Last commit
3 days ago
ScalaActive
Meltano logo

Meltano

Declarative code-first data integration engine for modern pipelines

Stars
2,279
License
MIT
Last commit
4 days ago
PythonActive
Mara Pipelines logo

Mara Pipelines

Lightweight Python ETL framework with PostgreSQL and web UI

Stars
2,085
License
MIT
Last commit
1 year ago
PythonDormant