
Alation
Data catalog platform for data discovery, governance, and lineage
Discover top open-source software, updated regularly with real-world adoption signals.

Unified metadata platform for modern data discovery and governance
DataHub provides a centralized catalog and real‑time metadata graph, enabling teams to discover, understand, and govern data across the modern data stack with extensible connectors and Kubernetes‑ready deployment.

DataHub is a centralized metadata platform that powers data discovery, lineage, and governance across the modern data stack. It ingests metadata from a wide range of sources via extensible connectors and stores it in a real‑time graph, enabling instant search and impact analysis.
Designed for data engineers, analysts, and governance teams, DataHub can be run locally with a single‑command Docker quickstart or scaled in production using the provided Helm charts on Kubernetes. The platform includes a web UI, GraphQL API, and a suite of actions that react to metadata changes in real time.
Backed by LinkedIn and a growing open‑source community, DataHub is used by enterprises such as LinkedIn, Expedia, and Udemy. Documentation, Slack support, monthly town halls, and a hosted demo environment help teams get up to speed quickly.
When teams consider DataHub, these hosted platforms usually appear on the same shortlist.
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Cross‑source Data Discovery
Analysts can search across databases, dashboards, and pipelines from a single UI, reducing time to find relevant assets.
Data Governance and Lineage
Compliance teams visualize end‑to‑end data flow, enabling impact analysis and policy enforcement.
Data Mesh Catalog
Domain teams publish and consume metadata in a shared graph, fostering federated ownership and discoverability.
CI/CD Impact Analysis for dbt
GitHub Action comments on pull requests with downstream impact, helping developers avoid breaking changes.
The core platform is written in Java, with supporting services in Python, TypeScript, and Scala.
Yes, a hosted demo environment is available at demo.datahub.com.
You can run a local Docker quickstart or deploy to production using Helm charts on Kubernetes.
Yes, a Slack workspace is provided for discussions, announcements, and help.
Yes, the platform includes a real‑time metadata graph and actions framework that react to changes as they occur.
Project at a glance
ActiveLast synced 4 days ago