
Airbyte
Open-source data integration engine for ELT pipelines across data sources
Discover top open-source software, updated regularly with real-world adoption signals.

High-performance ELT framework powered by Apache Arrow
CloudQuery is a composable data movement framework that extracts from cloud infrastructure and SaaS APIs to any destination, running entirely on your infrastructure.

CloudQuery is a high-performance data movement framework designed for developers who need complete control over their data pipelines. Built on Apache Arrow, it extracts data from cloud infrastructure, SaaS platforms, and APIs, delivering it to any destination—all while running entirely on your infrastructure.
Engineering and security teams leverage CloudQuery for cloud security posture management (CSPM), asset inventory, FinOps, and attack surface management. Data engineers use it as a flexible ELT platform to eliminate data silos across security, infrastructure, marketing, and finance teams.
The framework offers a code-first, extensible plugin system with no vendor lock-in. Its composable architecture integrates with your existing languages, destinations, and orchestrators. Specialized plugins provide first-class support for complex data sources including AWS, GCP, Azure, and hundreds of other integrations. Because your data never touches external servers, CloudQuery fits regulated, secure, and performance-critical environments where privacy is paramount.
Built in Go and distributed under MPL-2.0, CloudQuery combines the flexibility of open-source tooling with enterprise-grade performance for large-scale data movement.
When teams consider CloudQuery, these hosted platforms usually appear on the same shortlist.
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Cloud Security Posture Management
Monitor and enforce security policies across AWS, GCP, and Azure infrastructure with continuous compliance scanning and unified visibility.
Multi-Cloud Asset Inventory
Collect and centralize cloud configuration data from all major providers into a single queryable database for governance and auditing.
Cloud FinOps Optimization
Unify billing data across cloud providers to identify cost-saving opportunities and track spending trends in real time.
AI Model Data Pipelines
Feed LLM pipelines and AI applications with high-volume data from diverse sources using Apache Arrow's efficient columnar format.
No. CloudQuery runs entirely on your infrastructure. Your data never touches CloudQuery's servers, ensuring complete privacy and compliance with data residency requirements.
CloudQuery supports hundreds of integrations including AWS, GCP, Azure, Kubernetes, GitHub, and many SaaS platforms. Destinations include PostgreSQL, BigQuery, Snowflake, S3, and more. Check the integrations hub for the full list.
CloudQuery is code-first and optimized for cloud infrastructure and security data, running on your infrastructure. It excels at CSPM, asset inventory, and FinOps use cases with specialized plugins, while Airbyte and Fivetran focus more on SaaS-to-warehouse replication.
Yes. CloudQuery provides an open plugin SDK supporting multiple languages. You can develop, extend, and ship custom plugins without vendor approval or lock-in.
CloudQuery framework, CLI, SDK, and some integrations are licensed under MPL-2.0, allowing commercial use with specific copyleft requirements for modifications.
Project at a glance
ActiveLast synced 4 days ago