- Stars
- 42,415
- License
- Apache-2.0
- Last commit
- 3 days ago
Best ETL & Data Integration Tools
Explore leading tools in the ETL & Data Integration category, including open-source options and SaaS products. Compare features, use cases, and find the best fit for your workflow.
9 open-source projects · 6 SaaS products
Top open-source ETL & Data Integration
These projects are active, self-hostable choices for knowledge management teams evaluating alternatives to SaaS tools.
- Stars
- 20,155
- License
- Unknown
- Last commit
- 3 days ago

Apache SeaTunnel
Multimodal distributed data integration for massive-scale synchronization
- Stars
- 8,928
- License
- Apache-2.0
- Last commit
- 3 days ago
- Stars
- 6,257
- License
- MPL-2.0
- Last commit
- 3 days ago
- Stars
- 3,486
- License
- Apache-2.0
- Last commit
- 3 days ago
- Stars
- 2,279
- License
- MIT
- Last commit
- 4 days ago
CloudQuery is a composable data movement framework that extracts from cloud infrastructure and SaaS APIs to any destination, running entirely on your infrastructure.
Popular SaaS Platforms to Replace
Understand the commercial incumbents teams migrate from and how many open-source alternatives exist for each product.
Airbyte
Open-source data integration engine for ELT pipelines across data sources
Azure Data Factory
Cloud-based data integration service to create, schedule, and orchestrate ETL/ELT data pipelines at scale
Fivetran
Managed ELT data pipelines into warehouses
Hevo Data
No-code ETL and data integration platform for analytics-ready data
Matillion
Cloud-native ETL for data integration and transformation
Talend Data Fabric
Complete data management platform combining integration, quality, and governance
Azure Data Factory is a fully managed, serverless data integration service that allows users to create data-driven workflows (pipelines) for orchestrating and automating data movement and transformation. It supports connecting to on-premises and cloud data sources, enabling ETL/ELT operations for analytics and BI, with a code-free UI and the ability to schedule and monitor data pipelines to integrate data across various sources and destinations.
Frequently replaced when teams want private deployments and lower TCO.
Explore related categories
Browse neighbouring categories in Data Engineering to widen your evaluation.
- Data Catalogs & GovernanceMetadata catalogs with governance, discovery and lineage across data assets.
- Stream Processing EnginesFrameworks for real-time processing of streaming data and events.
- Web Scraping & CrawlingFrameworks and services for large-scale web data extraction with headless browsers and crawlers.
- Workflow Orchestration ToolsWorkflow managers for scheduling and orchestrating data pipelines.




