Best Metadata Management Software for Databricks

Find and compare the best Metadata Management software for Databricks in 2026

Use the comparison tool below to compare the top Metadata Management software for Databricks on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    DataHub Reviews
    See Software
    Learn More
    Metadata serves as the essential framework for today's data ecosystems, and how well it is managed can make the difference between order and disorder. DataHub offers a robust solution for metadata management that can accommodate anywhere from thousands to millions of data entities, all while ensuring a swift and user-friendly experience. You can easily ingest metadata from over 100 different sources via adaptable push and pull methods, consolidate it into a cohesive graph model, and access it through high-speed APIs. The metadata architecture of DataHub is designed to be flexible—allowing you to incorporate custom attributes, entity types, and relationships without requiring code modifications. Monitor the evolution of your metadata with comprehensive versioning and audit trails to see how schemas, ownership, and policies shift over time. Additionally, you can automatically propagate metadata across interconnected entities; for instance, tagging a dataset will ensure those tags are seamlessly transmitted to related dashboards.
  • 2
    Alation Reviews
    The Alation Agentic Data Intelligence Platform is designed to transform how enterprises manage, govern, and use data for AI and analytics. It combines search, cataloging, governance, lineage, and analytics into one unified solution, turning metadata into actionable insights. AI-powered agents automate critical tasks like documentation, data quality monitoring, and product creation, freeing teams from repetitive manual work. Its Active Metadata Graph and workflow automation capabilities ensure that data remains accurate, consistent, and trustworthy across systems. With 120+ pre-built connectors, including integrations with AWS, Snowflake, Salesforce, and Databricks, Alation integrates seamlessly into enterprise ecosystems. The platform enables organizations to govern AI responsibly, ensuring compliance, transparency, and ethical use of data. Enterprises benefit from improved self-service analytics, faster data-driven decisions, and a stronger data culture. With industry leaders like Salesforce and 40% of the Fortune 100 relying on it, Alation is proven to help businesses unlock the value of their data.
  • 3
    ER/Studio Enterprise Edition Reviews
    ER/Studio is an enterprise data modeling and architecture solution that helps organizations structure, align, and govern data across complex, distributed environments, including data warehouses, lakehouses, data mesh frameworks, and data vault architectures. It bridges business intent and technical execution through integrated conceptual, logical, and physical modeling, enabling teams to move from strategy to implementation with clarity and control. The result is a consistent architectural foundation that supports analytics, AI initiatives, modernization, regulatory requirements, and operational systems. Collaboration is built into the platform through a centralized, multi-user repository and the web-based Team Server portal. The repository manages version control, role-based permissions, and parallel development so teams can work concurrently while preserving model integrity and full audit history. Team Server extends visibility beyond architects, allowing business and technical stakeholders to review models, explore definitions, and contribute feedback through a browser interface. ER/Studio reinforces governance by embedding standardized definitions, business glossaries, and data dictionaries directly within technical models. Impact analysis provides insight into downstream dependencies before changes are implemented, helping reduce risk and improve coordination. Integrations with Microsoft Purview and Collibra extend metadata into broader governance ecosystems, strengthening lineage tracking, documentation accuracy, and compliance oversight. Available in Standard, Professional, and Enterprise editions, ER/Studio scales from focused modeling teams to enterprise-wide data architecture programs with advanced collaboration and governance requirements.
  • 4
    Hackolade Reviews

    Hackolade

    Hackolade

    €175 per month
    Hackolade Studio is a comprehensive data modeling platform built for today’s complex and hybrid data ecosystems. Originally developed to address the lack of visual design tools for NoSQL databases, Hackolade has evolved into a multi-model solution that supports the broadest range of data technologies in the industry. The platform enables agile, iterative schema design and governance for both structured and semi-structured data, making it ideal for organizations working across traditional RDBMS, modern data warehouses, NoSQL stores, and streaming systems. Hackolade supports technologies such as Oracle, PostgreSQL, BigQuery, Databricks, Redshift, Snowflake, MongoDB, Cassandra, DynamoDB, Neo4j, Kafka (with Confluent Schema Registry), OpenAPI, GraphQL, and more. Beyond databases, Hackolade Studio offers robust capabilities for API modeling, supporting OpenAPI (Swagger) and GraphQL, as well as native modeling for data exchange formats like JSON Schema, Avro, Protobuf, Parquet, and YAML. It also integrates with metadata and data governance platforms like Unity Catalog and Collibra, making it a powerful enabler for organizations focused on data quality, lineage, and compliance. Key features include reverse and forward engineering, schema versioning, data type mapping, and team collaboration tools. Whether you're building data products, managing data contracts, or migrating between systems, Hackolade Studio provides a unified interface for modeling, documenting, and evolving your schemas. Hackolade is trusted by enterprises across finance, retail, healthcare, and telecom to align data architecture with real-world delivery. It’s an essential tool for teams implementing data mesh, data fabric, microservices, or API-first strategies.
  • 5
    Decube Reviews
    Decube is a comprehensive data management platform designed to help organizations manage their data observability, data catalog, and data governance needs. Our platform is designed to provide accurate, reliable, and timely data, enabling organizations to make better-informed decisions. Our data observability tools provide end-to-end visibility into data, making it easier for organizations to track data origin and flow across different systems and departments. With our real-time monitoring capabilities, organizations can detect data incidents quickly and reduce their impact on business operations. The data catalog component of our platform provides a centralized repository for all data assets, making it easier for organizations to manage and govern data usage and access. With our data classification tools, organizations can identify and manage sensitive data more effectively, ensuring compliance with data privacy regulations and policies. The data governance component of our platform provides robust access controls, enabling organizations to manage data access and usage effectively. Our tools also allow organizations to generate audit reports, track user activity, and demonstrate compliance with regulatory requirements.
  • 6
    Collate Reviews
    Collate is a metadata platform powered by AI that equips data teams with automated tools for discovery, observability, quality, and governance, utilizing agent-based workflows for efficiency. It is constructed on the foundation of OpenMetadata and features a cohesive metadata graph, providing over 90 seamless connectors for gathering metadata from various sources like databases, data warehouses, BI tools, and data pipelines. This platform not only offers detailed column-level lineage and data profiling but also implements no-code quality tests to ensure data integrity. The AI agents play a crucial role in streamlining processes such as data discovery, permission-sensitive querying, alert notifications, and incident management workflows on a large scale. Furthermore, the platform includes real-time dashboards, interactive analyses, and a shared business glossary that cater to both technical and non-technical users, facilitating the management of high-quality data assets. Additionally, its continuous monitoring and governance automation help uphold compliance with regulations such as GDPR and CCPA, which significantly minimizes the time taken to resolve data-related issues and reduces the overall cost of ownership. This comprehensive approach to data management not only enhances operational efficiency but also fosters a culture of data stewardship across the organization.
  • 7
    Ataccama ONE Reviews
    Ataccama is a revolutionary way to manage data and create enterprise value. Ataccama unifies Data Governance, Data Quality and Master Data Management into one AI-powered fabric that can be used in hybrid and cloud environments. This gives your business and data teams unprecedented speed and security while ensuring trust, security and governance of your data.
  • 8
    Informatica Enterprise Data Catalog Reviews
    Efficiently scan and catalog metadata, uncover and characterize data, while offering comprehensive lineage tracking over millions of datasets. Organize and classify data assets across diverse environments to enhance their value and facilitate reuse. Perform automated scanning across multi-cloud environments, business intelligence tools, ETL processes, and external metadata catalogs, along with various data types. Utilize AI-driven capabilities for domain discovery, data similarity assessment, business term linkages, and tailored recommendations. Monitor data movement with precision, ranging from overarching system perspectives to detailed column-level lineage, accompanied by thorough impact assessments. Access the Data Asset Analytics dashboard to gain insights into asset utilization, enrichment processes, and collaborative efforts. Examine data quality protocols, scorecards, metric clusters, and profiling statistics within their relevant contexts. Engage with shared data intelligence through certifications, ratings and feedback, a Q&A feature, and timely change alerts. What truly distinguishes Informatica is its extensive and robust suite of enterprise-grade data management solutions, ensuring comprehensive support for diverse data needs. With such capabilities, organizations can navigate their data landscapes more effectively and make informed decisions.
  • 9
    Select Star Reviews

    Select Star

    Select Star

    $270 per month
    In just 15 minutes, you can set up your automated data catalogue and receive column-level lines, Entity Relationship diagrams, and auto-populated documentation in 24 hours. You can easily tag, find, and add documentation to data so everyone can find the right one for them. Select Star automatically detects your column-level data lineage and displays it. Now you can trust the data by knowing where it came. Select Star automatically displays how your company uses data. This allows you to identify relevant data fields without having to ask anyone else. Select Star ensures that your data is protected with AICPA SOC2 Security, Confidentiality and Availability standards.
  • 10
    OpenMetadata Reviews
    OpenMetadata serves as a comprehensive, open platform for unifying metadata, facilitating data discovery, observability, and governance through a single interface. By utilizing a Unified Metadata Graph alongside over 80 ready-to-use connectors, it aggregates metadata from various sources such as databases, pipelines, BI tools, and ML systems, thereby offering an extensive context for teams to effectively search, filter, and visualize assets throughout their organization. The platform is built on an API- and schema-first architecture, which provides flexible metadata entities and relationships, allowing organizations to tailor their metadata structure with precision. Comprising only four essential system components, OpenMetadata is crafted for straightforward installation and operation, ensuring scalable performance that empowers both technical and non-technical users to work together seamlessly on discovery, lineage tracking, quality assurance, observability, collaboration, and governance tasks without the need for intricate infrastructure. This versatility makes it an invaluable tool for organizations aiming to harness their data assets more effectively.
  • 11
    Acryl Data Reviews
    Bid farewell to abandoned data catalogs. Acryl Cloud accelerates time-to-value by implementing Shift Left methodologies for data producers and providing an easy-to-navigate interface for data consumers. It enables the continuous monitoring of data quality incidents in real-time, automating anomaly detection to avert disruptions and facilitating swift resolutions when issues arise. With support for both push-based and pull-based metadata ingestion, Acryl Cloud simplifies maintenance, ensuring that information remains reliable, current, and authoritative. Data should be actionable and operational. Move past mere visibility and leverage automated Metadata Tests to consistently reveal data insights and identify new opportunities for enhancement. Additionally, enhance clarity and speed up resolutions with defined asset ownership, automatic detection, streamlined notifications, and temporal lineage for tracing the origins of issues while fostering a culture of proactive data management.
  • 12
    DataGalaxy Reviews
    DataGalaxy is redefining how organizations govern and activate their data through a single, collaborative platform built for both business and technical teams. Its data and analytics governance solution provides the visibility, control, and alignment needed to transform data into a true business asset. The platform unites automated data cataloging, AI-driven lineage, and value-based prioritization to ensure every initiative is intentional and measurable. With features like the strategy cockpit and value tracking center, organizations can connect business objectives to actionable data outcomes and monitor ROI in real time. Over 70 native connectors integrate seamlessly with tools like Snowflake, Azure Synapse, Databricks, Power BI, and HubSpot, breaking down data silos across hybrid environments. DataGalaxy also embeds AI-powered assistants and compliance automation for frameworks like GDPR, HIPAA, and SOC 2, making governance intuitive and secure. Trusted by global enterprises including Airbus and Bank of China, the platform is both scalable and enterprise-ready. By blending data discovery, collaboration, and security, DataGalaxy helps organizations move from reactive governance to proactive value creation.
  • 13
    Talend Data Catalog Reviews
    Talend Data Catalog provides your organization with a single point of control for all your data. Data Catalog provides robust tools for search, discovery, and connectors that allow you to extract metadata from almost any data source. It makes it easy to manage your data pipelines, protect your data, and accelerate your ETL process. Data Catalog automatically crawls, profiles and links all your metadata. Data Catalog automatically documents up to 80% of the data associated with it. Smart relationships and machine learning keep the data current and up-to-date, ensuring that the user has the most recent data. Data governance can be made a team sport by providing a single point of control that allows you to collaborate to improve data accessibility and accuracy. With intelligent data lineage tracking and compliance tracking, you can support data privacy and regulatory compliance.
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB