Best Data Discovery Software for AWS Glue

Find and compare the best Data Discovery software for AWS Glue in 2026

Use the comparison tool below to compare the top Data Discovery software for AWS Glue on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    DataHub Reviews
    See Software
    Learn More
    Locating the appropriate data shouldn't resemble the daunting task of finding a needle in a haystack. DataHub's advanced discovery engine empowers users to pinpoint exactly what they seek through intuitive natural language searches, intelligent recommendations, and extensive contextual insights. Effortlessly explore datasets, dashboards, pipelines, and more, with results organized by relevance, popularity, and your team's engagement patterns. Each data asset is accompanied by detailed context—such as descriptions, schemas, sample datasets, usage metrics, and quality indicators—enabling users to assess the suitability of the data before getting started. Interactive features like discussions, annotations, and documentation make shared knowledge accessible and easy to search. DataHub adapts to user interactions, highlighting frequently accessed assets and recommending related data that has proven beneficial for others. Whether you are a data scientist in search of training data, an analyst crafting a report, or a business user tackling an urgent inquiry, DataHub streamlines your journey to the right data.
  • 2
    Protegrity Reviews
    Our platform allows businesses to use data, including its application in advanced analysis, machine learning and AI, to do great things without worrying that customers, employees or intellectual property are at risk. The Protegrity Data Protection Platform does more than just protect data. It also classifies and discovers data, while protecting it. It is impossible to protect data you don't already know about. Our platform first categorizes data, allowing users the ability to classify the type of data that is most commonly in the public domain. Once those classifications are established, the platform uses machine learning algorithms to find that type of data. The platform uses classification and discovery to find the data that must be protected. The platform protects data behind many operational systems that are essential to business operations. It also provides privacy options such as tokenizing, encryption, and privacy methods.
  • 3
    Select Star Reviews

    Select Star

    Select Star

    $270 per month
    In just 15 minutes, you can set up your automated data catalogue and receive column-level lines, Entity Relationship diagrams, and auto-populated documentation in 24 hours. You can easily tag, find, and add documentation to data so everyone can find the right one for them. Select Star automatically detects your column-level data lineage and displays it. Now you can trust the data by knowing where it came. Select Star automatically displays how your company uses data. This allows you to identify relevant data fields without having to ask anyone else. Select Star ensures that your data is protected with AICPA SOC2 Security, Confidentiality and Availability standards.
  • 4
    Privacera Reviews
    Multi-cloud data security with a single pane of glass Industry's first SaaS access governance solution. Cloud is fragmented and data is scattered across different systems. Sensitive data is difficult to access and control due to limited visibility. Complex data onboarding hinders data scientist productivity. Data governance across services can be manual and fragmented. It can be time-consuming to securely move data to the cloud. Maximize visibility and assess the risk of sensitive data distributed across multiple cloud service providers. One system that enables you to manage multiple cloud services' data policies in a single place. Support RTBF, GDPR and other compliance requests across multiple cloud service providers. Securely move data to the cloud and enable Apache Ranger compliance policies. It is easier and quicker to transform sensitive data across multiple cloud databases and analytical platforms using one integrated system.
  • 5
    Amazon DataZone Reviews
    Amazon DataZone serves as a comprehensive data management solution that empowers users to catalog, explore, share, and regulate data from various sources, including AWS, on-premises systems, and third-party platforms. It provides administrators and data stewards with the ability to manage and oversee data access with precision, guaranteeing that users possess the correct level of permissions and contextual understanding. This service streamlines data access for a diverse range of professionals, such as engineers, data scientists, product managers, analysts, and business users, thereby promoting insights driven by data through enhanced collaboration. Among its notable features are a business data catalog that enables searching and requesting access to published datasets, tools for project collaboration to oversee and manage data assets, a user-friendly web portal offering tailored views for data analysis, and regulated data sharing workflows that ensure proper access. Furthermore, Amazon DataZone leverages machine learning to automate the processes of data discovery and cataloging, making it an invaluable resource for organizations striving to maximize their data utility. As a result, it significantly enhances the efficiency of data governance and utilization across various business functions.
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB