
Dropbox
Cloud file storage and sync for teams and individuals
Discover top open-source software, updated regularly with real-world adoption signals.

Cloud-native distributed file and object storage system
CNCF-graduated distributed storage platform offering POSIX, HDFS, and S3 protocols with scalable metadata, multi-tenancy, and hybrid cloud acceleration for modern data infrastructure.

CubeFS is a cloud-native distributed file and object storage system designed for organizations building scalable data infrastructure. As a CNCF graduated project, it serves enterprises needing datacenter filesystems, data lake storage, or hybrid cloud solutions. It's particularly valuable for teams running databases, search systems, and AI/ML workloads that benefit from storage/compute separation.
The platform provides multiple access protocols including POSIX, HDFS, S3, and REST API, enabling seamless integration with existing toolchains. Its highly scalable metadata service ensures strong consistency while optimizing performance for both large and small files across sequential and random write patterns. CubeFS supports multi-tenancy with robust isolation, flexible storage policies ranging from high-performance replication to cost-effective erasure coding, and multi-level caching for hybrid cloud I/O acceleration.
CubeFS runs on-premises as datacenter infrastructure, in private or hybrid cloud environments, and can layer atop public cloud storage like S3 to provide filesystem semantics and cache acceleration. Built in Go and licensed under Apache-2.0, it's designed for Kubernetes-native deployments and large-scale container platforms.
When teams consider CubeFS, these hosted platforms usually appear on the same shortlist.
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Data Lake Storage Infrastructure
Scalable, multi-protocol storage foundation supporting analytics workloads with HDFS and S3 compatibility
AI/ML Training Pipeline
Decoupled storage and compute enabling elastic scaling of training jobs with POSIX filesystem semantics
Hybrid Cloud Acceleration
Multi-level caching layer over public cloud S3 reducing latency and egress costs for on-premises applications
Multi-Tenant SaaS Platform
Isolated storage namespaces with flexible policies balancing performance and cost per tenant
CubeFS supports POSIX, HDFS, S3, and its own REST API, enabling integration with diverse application ecosystems and toolchains.
Yes, CubeFS is a CNCF graduated project with production deployments. Use stable releases rather than the master branch for production environments.
CubeFS provides a highly scalable metadata service with strong consistency, designed to handle large-scale deployments efficiently.
Yes, CubeFS can run atop public cloud storage like S3, providing filesystem semantics and cache acceleration for hybrid cloud architectures.
CubeFS supports flexible policies including high-performance replication for speed and low-cost erasure coding for capacity optimization.
Project at a glance
ActiveLast synced 4 days ago