
Alation
Data catalog platform for data discovery, governance, and lineage
Discover top open-source software, updated regularly with real-world adoption signals.

Unified data discovery, lineage, and observability platform
A modern platform that centralizes data cataloging, lineage, quality, and security, enabling teams to discover, monitor, and govern data assets across diverse sources.

The Open Data Discovery (ODD) platform provides a federated data catalog with end‑to‑end lineage, quality dashboards, and security tagging. Designed for data engineers, analysts, and ML practitioners, it consolidates metadata from hundreds of sources, offering a single pane of glass to understand how data flows through pipelines, dashboards, and models.
ODD integrates with tools such as Airflow, DBT, Great Expectations, and many databases via native adapters. It logs ML experiment parameters automatically and supports reference data management for master data. Deployments are container‑first: run a single Docker image, use the provided docker‑compose demo, or install via Helm charts on Kubernetes. PostgreSQL stores all metadata, configured through environment variables.
By shortening discovery cycles, providing transparent usage insights, and enabling proactive data quality monitoring, ODD helps organizations foster a data‑centric culture while maintaining compliance and governance.
When teams consider ODD Platform, these hosted platforms usually appear on the same shortlist.
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Accelerate dashboard creation
Analysts quickly locate source tables and understand lineage, reducing time to build reliable BI reports.
Automate data quality monitoring
Data engineers integrate Great Expectations tests, view failures in the DQ dashboard, and receive alerts to prevent downstream issues.
Track ML experiment provenance
ML teams log parameters and results automatically, enabling reproducibility and comparison across model runs.
Govern reference data
Data stewards manage lookup tables centrally, ensuring consistent codes and compliance across pipelines.
It stores metadata in PostgreSQL; you configure the connection via environment variables.
Yes, you can start a Docker container or use the provided docker‑compose demo.
ODD provides proxy adapters and native connectors for tools like Airflow, DBT, Great Expectations, and many databases.
The UI includes end‑to‑end lineage graphs for datasets, transformers, and consumers.
Yes, ODD is a reference implementation of the Open Data Discovery spec.
Project at a glance
StableLast synced 4 days ago