Best Data De-Identification Tools for Apache Spark

Find and compare the best Data De-Identification tools for Apache Spark in 2025

Use the comparison tool below to compare the top Data De-Identification tools for Apache Spark on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    Protegrity Reviews
    Our platform allows businesses to use data, including its application in advanced analysis, machine learning and AI, to do great things without worrying that customers, employees or intellectual property are at risk. The Protegrity Data Protection Platform does more than just protect data. It also classifies and discovers data, while protecting it. It is impossible to protect data you don't already know about. Our platform first categorizes data, allowing users the ability to classify the type of data that is most commonly in the public domain. Once those classifications are established, the platform uses machine learning algorithms to find that type of data. The platform uses classification and discovery to find the data that must be protected. The platform protects data behind many operational systems that are essential to business operations. It also provides privacy options such as tokenizing, encryption, and privacy methods.
  • 2
    PHEMI Health DataLab Reviews
    Unlike most data management systems, PHEMI Health DataLab is built with Privacy-by-Design principles, not as an add-on. This means privacy and data governance are built-in from the ground up, providing you with distinct advantages: Lets analysts work with data without breaching privacy guidelines Includes a comprehensive, extensible library of de-identification algorithms to hide, mask, truncate, group, and anonymize data. Creates dataset-specific or system-wide pseudonyms enabling linking and sharing of data without risking data leakage. Collects audit logs concerning not only what changes were made to the PHEMI system, but also data access patterns. Automatically generates human and machine-readable de- identification reports to meet your enterprise governance risk and compliance guidelines. Rather than a policy per data access point, PHEMI gives you the advantage of one central policy for all access patterns, whether Spark, ODBC, REST, export, and more
  • 3
    Tonic Reviews
    Tonic provides an automated solution for generating mock data that retains essential features of sensitive datasets, enabling developers, data scientists, and sales teams to operate efficiently while ensuring confidentiality. By simulating your production data, Tonic produces de-identified, realistic, and secure datasets suitable for testing environments. The data is crafted to reflect your actual production data, allowing you to convey the same narrative in your testing scenarios. With Tonic, you receive safe and practical data designed to emulate your real-world data at scale. This tool generates data that not only resembles your production data but also behaves like it, facilitating safe sharing among teams, organizations, and across borders. It includes features for identifying, obfuscating, and transforming personally identifiable information (PII) and protected health information (PHI). Tonic also ensures the proactive safeguarding of sensitive data through automatic scanning, real-time alerts, de-identification processes, and mathematical assurances of data privacy. Moreover, it offers advanced subsetting capabilities across various database types. In addition to this, Tonic streamlines collaboration, compliance, and data workflows, delivering a fully automated experience to enhance productivity. With such robust features, Tonic stands out as a comprehensive solution for data security and usability, making it indispensable for organizations dealing with sensitive information.
  • 4
    Mage Platform Reviews
    Protect, Monitor, and Discover enterprise sensitive data across multiple platforms and environments. Automate your subject rights response and demonstrate regulatory compliance - all in one solution
  • Previous
  • You're on page 1
  • Next