Best Data Discovery & Classification Tools

Explore leading tools in the Data Discovery & Classification category, including open-source options and SaaS products. Compare features, use cases, and find the best fit for your workflow.

5 open-source projects · 4 SaaS products

Top open-source Data Discovery & Classification

These projects are active, self-hostable choices for knowledge management teams evaluating alternatives to SaaS tools.

Presidio logo

Presidio

Context‑aware, extensible SDK for detecting and redacting PII

Stars
6,247
License
MIT
Last commit
3 days ago
PythonActive
DataProfiler logo

DataProfiler

Instantly profile data and uncover hidden sensitive information

Stars
1,529
License
Apache-2.0
Last commit
2 months ago
PythonActive
PIICatcher logo

PIICatcher

Detect and tag PII across databases and data warehouses

Stars
327
License
Apache-2.0
Last commit
1 year ago
PythonDormant
Most starred project
6,247★

Context‑aware, extensible SDK for detecting and redacting PII

Recently updated
3 days ago

Presidio provides a pluggable framework to identify, mask, and anonymize personally identifiable information in text, images, and structured data, supporting custom recognizers, multiple languages, and deployment via Python, Docker, or Kubernetes.

Dominant language
Python • 5 projects

Expect a strong Python presence among maintained projects.

Popular SaaS Platforms to Replace

Understand the commercial incumbents teams migrate from and how many open-source alternatives exist for each product.

Amazon Macie logo

Amazon Macie

Managed sensitive data discovery and protection for Amazon S3.

Data Discovery & Classification
Alternatives tracked
5 alternatives
BigID logo

BigID

Data intelligence platform focused on data privacy, security, and governance through sensitive data discovery and classification

Data Discovery & Classification
Alternatives tracked
5 alternatives
OneTrust logo

OneTrust

Unified trust platform for privacy, consent, data governance, and compliance automation.

Compliance Automation & GRCData Discovery & Classification
Alternatives tracked
5 alternatives
Securiti logo

Securiti

DSPM and Data+AI security platform for discovery, classification, and governance.

Data Discovery & ClassificationCompliance Automation & GRC
Alternatives tracked
5 alternatives
Most compared product
5 open-source alternatives

Amazon Macie uses ML and pattern matching to automatically discover, classify, and monitor sensitive data in S3, providing visibility into risks and enabling automated protection.

Leading hosted platforms

Frequently replaced when teams want private deployments and lower TCO.

Explore related categories

Browse neighbouring categories in Security to widen your evaluation.