Best Data Lineage Tools of 2024

Find and compare the best Data Lineage tools in 2024

Use the comparison tool below to compare the top Data Lineage tools on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    Atlan Reviews
    The modern data workspace. All your data assets, from data tables to reports, will be instantly discoverable. The combination of powerful search algorithms and easy browsing makes it easy to find the right asset. Atlan automatically generates data quality profiles that make it easy to detect bad data. We have you covered, from automatic variable type detection and frequency distribution to missing values or outlier detection. Atlan takes the hassle out of managing and governing your data ecosystem. Atlan's bots analyze SQL query history to automatically construct data lineage. They also auto-detect PII information. This allows you to create dynamic access policies and best-in-class governance. Our Excel-like query builder allows anyone to query multiple data lakes, warehouses, and DBs. Native integrations with tools such as Tableau and Jupyter make data collaboration possible.
  • 2
    SolarWinds Database Mapper Reviews
    Do you want to generate documentation automatically from multiple data sources more easily? You wish you had a better understanding about the origin of your data and who has handled it. SolarWinds Database Mapper (formerly SentryOne Document), provides powerful documentation and data lineage analysis capabilities via a cloud or software solution. SolarWinds Database Mapper makes it easy to maintain current documentation and ensure compliance to data privacy regulations and business rules. It also tracks data lineage accurately. SolarWinds Database Mapper provides powerful tools to ensure that your databases are accurately and continuously documented. Data lineage analysis capabilities provide visual representations of the origin of your data to help you ensure compliance. Visual displays that clearly show data dependencies throughout your environment help you track data lineage. You can easily manage documentation tasks and view logs using an easy-to use cloud or software solution.
  • 3
    Axon Data Governance Reviews
    To support data-driven decision making, your teams need reliable data. Ensure they have it with automated, intelligent, and integrated data governance at scale. Axon Data Governance is the data marketplace and collaboration hub for successful, scalable data management programs. Facilitate knowledge transfer between communities and stakeholders to enable teams to learn from each other. With a carefully curated data marketplace, teams can quickly access, access, and understand data that is relevant to their analytics needs. Use governed data to support key initiatives, such as improving customer experience, and to deliver consistent, trusted results throughout your organization. To ensure compliance with regulations such as GDPR and CCPA, you should build governance and data privacy into your projects and processes from the beginning. To provide consistent business context across multiple tools, create a common data dictionary.
  • 4
    Privacera Reviews

    Privacera

    Privacera

    Multi-cloud data security with a single pane of glass Industry's first SaaS access governance solution. Cloud is fragmented and data is scattered across different systems. Sensitive data is difficult to access and control due to limited visibility. Complex data onboarding hinders data scientist productivity. Data governance across services can be manual and fragmented. It can be time-consuming to securely move data to the cloud. Maximize visibility and assess the risk of sensitive data distributed across multiple cloud service providers. One system that enables you to manage multiple cloud services' data policies in a single place. Support RTBF, GDPR and other compliance requests across multiple cloud service providers. Securely move data to the cloud and enable Apache Ranger compliance policies. It is easier and quicker to transform sensitive data across multiple cloud databases and analytical platforms using one integrated system.
  • 5
    Secuvy AI Reviews

    Secuvy AI

    Secuvy AI

    Secuvy, a next-generation cloud platform, automates data security, privacy compliance, and governance via AI-driven workflows. Unstructured data is treated with the best data intelligence. Secuvy, a next-generation cloud platform that automates data security, privacy compliance, and governance via AI-driven workflows is called Secuvy. Unstructured data is treated with the best data intelligence. Automated data discovery, customizable subjects access requests, user validations and data maps & workflows to comply with privacy regulations such as the ccpa or gdpr. Data intelligence is used to locate sensitive and private information in multiple data stores, both in motion and at rest. Our mission is to assist organizations in protecting their brand, automating processes, and improving customer trust in a world that is rapidly changing. We want to reduce human effort, costs and errors in handling sensitive data.
  • 6
    Coalesce Reviews
    It takes a lot time and manual coding to build and manage a fully documented data project. But not anymore. We can show you how we can help transform data faster. Column-aware architecture allows for reusable data patterns and change management at large scale. For safer and more predictable data operations, visibility is key to change management and impact analysis. Coalesce offers curated packages that use best-practice templates to generate native-SQL for Snowflakeâ„¢ automatically. Have a unique need? Templates are easily customizable. Coalesce makes it easy to navigate your data pipeline. Every screen and button are designed to give you easy access to all the information you need. Your data team has greater control over every project. You can instantly see audit and project history, as well as code comparisons side-by-side. Automatically, lineage at the table- and column-levels is provided and kept up-to-date.
  • 7
    SAP Information Steward Reviews
    SAP Information Steward software allows for data profiling, monitoring, and information policy management. It is the information governance layer of SAP Business Technology Platform and can help you to anticipate risk and achieve better business outcomes. To gain continuous insight into your enterprise's data model integrity, combine data profiling, metadata management, and data lineage. You will gain a better understanding about the data quality in your data management landscape while accessing and analysing metrics using intuitive dashboards and scorecards. Supporting analysts, data stewards, IT experts, and other professionals with consistent validation rules, guidelines, can improve enterprise information management initiatives. Data profiling and metadata management are two solutions that can help you discover, assess, define and monitor the quality of your enterprise's data assets. Run what-if analyses to forecast the savings that improved data quality could bring.
  • 8
    Datakin Reviews

    Datakin

    Datakin

    $2 per month
    You can instantly see the order in your complex data world and know exactly where to find answers. Datakin automatically tracks data lineage and displays your entire data ecosystem as a rich visual graph. It clearly shows the upstream and downstream relationships of each dataset. The Duration tab summarizes the job's performance and its upstream dependencies in a Gantt-style graph. This makes it easy to identify bottlenecks. The Compare tab allows you to see how your jobs and data have changed over time. Sometimes jobs that run well can produce poor output. The Quality tab shows you the most important data quality metrics and how they change over time. This makes anomalies easily visible. Datakin allows you to quickly identify the root cause of problems and prevent them from happening again.
  • 9
    Blindata Reviews

    Blindata

    Blindata

    $2000/year/user
    Blindata is a comprehensive Data Governance program that includes all functions. Data Catalog, Data Lineage & Business Glossary provide a complete and integrated view of your Data. Data Classification gives data a semantic meaning, while Data Quality Modules, Issue Management and Data Stewardship modules increase the reliability and trust of data. Privacy compliance can also be facilitated by specific features. Registry of processing activities, central management of privacy notes, consent registry with Blockchain integration. Blindata Agent is able to connect to multiple data sources and collect metadata, such as data structures (Tables Views Fields ...), data Quality metrics, reverse lineage etc.). Blindata's modular architecture is entirely API-based, allowing for systematic integration with business systems of the highest importance (DBMS, Active Directory e-commerce and Data Platforms). Blindata can be purchased as a SaaS or installed "on Premise", or it can be purchased from AWS Marketplace.
  • 10
    Montara Reviews

    Montara

    Montara

    $100/user/month
    Montara enables BI Teams and Data Analysts to model and transform data using SQL alone, easily and seamlessly, and enjoy benefits such a modular code, CI/CD and versioning, automated testing and documentation. With Montara, analysts are able to quickly understand the impact of changes in models on analysis, reports, and dashboards. Report-level lineage is supported, as well as support for 3rd-party visualization tools like Tableau and Looker. BI teams can also perform ad hoc analysis, create dashboards and reports directly on Montara.
  • 11
    Foundational Reviews

    Foundational

    Foundational

    Identify code issues and optimize code in real-time. Prevent data incidents before deployment. Manage code changes that impact data from the operational database all the way to the dashboard. Data lineage is automated, allowing for analysis of every dependency, from the operational database to the reporting layer. Foundational automates the enforcement of data contracts by analyzing each repository, from upstream to downstream, directly from the source code. Use Foundational to identify and prevent code and data issues. Create controls and guardrails. Foundational can be configured in minutes without requiring any code changes.
  • 12
    Collibra Reviews
    The Collibra Data Intelligence Cloud offers a best-in class catalog, flexible governance and continuous quality. It also has built-in privacy. A best-in-class data catalogue that supports your users includes embedded governance, privacy, and quality. You can raise the bar by ensuring that teams can quickly access, understand, and access data from all sources, including business applications and data science tools, in one central location. Your data deserves privacy. Automate, centralize and guide workflows to encourage collaboration and operationalize privacy. Collibra Data Lineage gives you the complete story about your data. Automatically map relationships between applications, systems, and reports to provide a context rich view of the enterprise. Focus on the data that you are most concerned about and make sure it is accurate, complete, and trustworthy.
  • 13
    Trifacta Reviews
    The fastest way to prepare data and build data pipelines in cloud. Trifacta offers visual and intelligent guidance to speed up data preparation to help you get to your insights faster. Poor data quality can cause problems in any analytics project. Trifacta helps you to understand your data and can help you quickly and accurately clean up it. All the power without any code. Trifacta offers visual and intelligent guidance to help you get to the right insights faster. Manual, repetitive data preparation processes don't scale. Trifacta makes it easy to build, deploy, and manage self-service data networks in minutes instead of months.
  • 14
    Databand Reviews
    Monitor your data health, and monitor your pipeline performance. Get unified visibility for all pipelines that use cloud-native tools such as Apache Spark, Snowflake and BigQuery. A platform for Data Engineers that provides observability. Data engineering is becoming more complex as business stakeholders demand it. Databand can help you catch-up. More pipelines, more complexity. Data engineers are working with more complex infrastructure and pushing for faster release speeds. It is more difficult to understand why a process failed, why it is running late, and how changes impact the quality of data outputs. Data consumers are frustrated by inconsistent results, model performance, delays in data delivery, and other issues. A lack of transparency and trust in data delivery can lead to confusion about the exact source of the data. Pipeline logs, data quality metrics, and errors are all captured and stored in separate, isolated systems.
  • 15
    IBM DataStage Reviews
    Cloud-native data integration with IBM Cloud Pak data enables you to accelerate AI innovation AI-powered data integration from anywhere. Your AI and analytics can only be as good as the data they are powered by. IBM®, DataStage®, for IBM Cloud Pak®, for Data provides high-quality data through a container-based architecture. It combines industry-leading data integration, DataOps, governance, and analytics on one data and AI platform. Automation speeds up administrative tasks, helping to reduce TCO. AI-based design accelerators, out-of-the box integration with DataOps or data science services accelerate AI innovation. Multicloud integration and parallelism allow you to deliver trusted data across hybrid and multicloud environments. The IBM Cloud Pak for Data platform allows you to manage the data and analytics lifecycle. Data science, event messaging, and data warehousing are some of the services offered. Automated load balancing and parallel engine.
  • 16
    ASG Data Intelligence Reviews
    There is a greater demand for data-driven insight and innovation than ever before. To maintain a competitive edge in today’s global enterprises, it is essential to be able to use trusted data to make informed business decisions. Despite the fact that most companies have a lot of data, business leaders often don't know how to access it. ASG Data Intelligence is the solution to data distrust. It is a metadata-driven platform which makes technical data "smarter". It provides end-to-end views and movements of data (data lineage), as well as business meanings and usage guardrails. Data value can be unleashed when it is made available, understood, and trusted by all users in your organization, including data scientists, analysts and marketers. Improved understanding of data's origins, business context and processes will help you build trust in it.
  • 17
    Kylo Reviews

    Kylo

    Teradata

    Kylo is an enterprise-ready open-source data lake management platform platform for self-service data ingestion and data preparation. It integrates metadata management, governance, security, and best practices based on Think Big's 150+ big-data implementation projects. Self-service data ingest that includes data validation, data cleansing, and automatic profiling. Visual sql and an interactive transformation through a simple user interface allow you to manage data. Search and explore data and metadata. View lineage and profile statistics. Monitor the health of feeds, services, and data lakes. Track SLAs and troubleshoot performance. To enable user self-service, create batch or streaming pipeline templates in Apache NiFi. While organizations can spend a lot of engineering effort to move data into Hadoop, they often struggle with data governance and data quality. Kylo simplifies data ingest and shifts it to data owners via a simple, guided UI.
  • 18
    Tokern Reviews
    Open source data governance suite to manage data lakes and databases. Tokern is an easy-to-use toolkit for collecting, organizing and analysing metadata from data lakes. Runs as a command-line application for quick tasks. Run as a service to continuously collect metadata. Use reporting dashboards to analyze lineage, access control, and PII data. Or programmatically in Jupyter notebooks. Tokern is an open-source data governance suite for data lakes and databases. You can improve the ROI of your data, comply to regulations like HIPAA, CCPA, and GDPR, and protect your data from insider threats with confidence. Centralized metadata management for users, jobs, and datasets. Other data governance features are powered by this feature. Track column-level data lineage for Snowflake and AWS Redshift. You can build lineage using query history or ETL scripts. Interactive graphs and programming with APIs and SDKs allow you to explore lineage.
  • 19
    Apache Atlas Reviews

    Apache Atlas

    Apache Software Foundation

    Atlas is a flexible and extensible set core foundational governance services that enable enterprises to efficiently and effectively meet their compliance requirements within Hadoop. It also allows integration with the entire enterprise data ecosystem. Apache Atlas offers open metadata management and governance capabilities that allow organizations to create a catalog of their data assets, classify, govern and provide collaboration capabilities around these assets for data scientists, analysts, and the data governance group. Pre-defined types to manage various Hadoop and non Hadoop metadata. Ability to create new types to manage metadata. Types can inherit from other types, and can have simple attributes, complex attributes, and object references. Type instances, also known as entities, are able to capture metadata object details and their relationships. REST APIs allow for easier integration with types and instances.
  • 20
    Truedat Reviews

    Truedat

    Bluetab Solutions

    Bluetab Solutions developed Truedat, an open-source data governance business solution tool. It was created to help our clients become data-driven businesses. We assist in defining business processes, roles and responsibilities. We can also help you put these processes into action. Integration and customization of truedat’s open-source components to support data governance processes. We guarantee the maintenance and support of the solution modules we have installed. Based on our extensive experience, we have created a solution that addresses the need for Data Governance. This allows you to manage complex and changing data architectures. Truedat is becoming more important due to the increasing migration of enterprise IT platforms into cloud, multi-cloud, hybrid architectures. This increases the complexity, sources, and types of data. Our Data Governance consulting and development experience spans more than 8 years.
  • 21
    Talend Data Catalog Reviews
    Talend Data Catalog provides your organization with a single point of control for all your data. Data Catalog provides robust tools for search, discovery, and connectors that allow you to extract metadata from almost any data source. It makes it easy to manage your data pipelines, protect your data, and accelerate your ETL process. Data Catalog automatically crawls, profiles and links all your metadata. Data Catalog automatically documents up to 80% of the data associated with it. Smart relationships and machine learning keep the data current and up-to-date, ensuring that the user has the most recent data. Data governance can be made a team sport by providing a single point of control that allows you to collaborate to improve data accessibility and accuracy. With intelligent data lineage tracking and compliance tracking, you can support data privacy and regulatory compliance.
  • 22
    Data360 Govern Reviews
    Although your organization understands the value of data, and how it must be accessible to business users for maximum impact, enterprise data governance can make it difficult to trust, understand, or find that data. Data360 Govern, an enterprise data governance, metadata, and catalog management solution, gives you confidence in your data's quality, value, and trustworthiness. It automates governance tasks and stewardship tasks, helping you to answer critical questions about your data's origin, use, meaning, ownership and quality. Data360 Govern allows you to make faster decisions about data usage and management, foster collaboration across your organization, and give users the ability to get the answers they require - whenever they need them. Transparency into your company's data landscape allows you to track the most critical data that aligns with your business goals.
  • 23
    Global IDs Reviews

    Global IDs

    Global IDs

    Global IDs offers a variety of Enterprise Data Solutions, including data governance, cloud migration, compliance, privacy, analytics, and rationalization. Global IDs EDA Platform features include automated discovery and profiling as well as data classification, data lineage and data quality. These functions make data transparent, trustworthy, and easily understandable for all members of the ecosystem. Global IDs EDA platform architecture was designed to integrate from the ground up, with all platform functionality available via APIs. Global IDs EDA platform automates data administration for enterprises of all sizes and data ecosystems.
  • 24
    DataHawk Reviews

    DataHawk

    We-Bridge

    Visualize data lineage automatically extracting data flow data source to target. Data lineage management software that automatically collects and analyzes mission-critical data. It also visualizes data flow and derivation rules from data source to target. Data Lineage refers to the flow of data between the source and the target. Tracking Data Lineage is about understanding the flow and derivation rules of data processed, transformed, and used. Multi-tier column-level data lineage graph and list, from source to destination. Drill down data lineage at the business system, column and table levels. Provide parsers to support analysis of Big Data technologies and various environments. Our patented technology allows for path sensitive dynamic string analysis and data flow analysis within programs.
  • 25
    Sifflet Reviews
    Automate the automatic coverage of thousands of tables using ML-based anomaly detection. 50+ custom metrics are also available. Monitoring of metadata and data. Comprehensive mapping of all dependencies between assets from ingestion to reporting. Collaboration between data consumers and data engineers is enhanced and productivity is increased. Sifflet integrates seamlessly with your data sources and preferred tools. It can run on AWS and Google Cloud Platform as well as Microsoft Azure. Keep an eye on your data's health and notify the team if quality criteria are not being met. In a matter of seconds, you can set up the basic coverage of all your tables. You can set the frequency, criticality, and even custom notifications. Use ML-based rules for any anomaly in your data. There is no need to create a new configuration. Each rule is unique because it learns from historical data as well as user feedback. A library of 50+ templates can be used to complement the automated rules.