Best Data Lineage Tools for GitHub

Find and compare the best Data Lineage tools for GitHub in 2026

Use the comparison tool below to compare the top Data Lineage tools for GitHub on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    AnalyticsCreator Reviews
    See Tool
    Learn More
    Elevate your data governance strategy by incorporating robust lineage tracking features that provide a thorough understanding of your data's origins and its transformations. This enhanced visibility not only supports compliance by maintaining verifiable lineage records but also accelerates root cause analysis for any data quality concerns. Rapidly pinpoint and address data quality challenges through actionable insights. With AnalyticsCreator, boost transparency, ensure compliance, and enhance data reliability by offering an in-depth lineage overview of your entire data landscape. Equip your teams to conduct impact assessments and make well-informed decisions quickly, all while enjoying a visual representation of data relationships and movement.
  • 2
    Microsoft Purview Reviews
    Microsoft Purview serves as a comprehensive data governance platform that facilitates the management and oversight of your data across on-premises, multicloud, and software-as-a-service (SaaS) environments. With its capabilities in automated data discovery, sensitive data classification, and complete data lineage tracking, you can effortlessly develop a thorough and current representation of your data ecosystem. This empowers data users to access reliable and valuable data easily. The service provides automated identification of data lineage and classification across various sources, ensuring a cohesive view of your data assets and their interconnections for enhanced governance. Through semantic search, users can discover data using both business and technical terminology, providing insights into the location and flow of sensitive information within a hybrid data environment. By leveraging the Purview Data Map, you can lay the groundwork for effective data utilization and governance, while also automating and managing metadata from diverse sources. Additionally, it supports the classification of data using both predefined and custom classifiers, along with Microsoft Information Protection sensitivity labels, ensuring that your data governance framework is robust and adaptable. This combination of features positions Microsoft Purview as an essential tool for organizations seeking to optimize their data management strategies.
  • 3
    SQLFlow Reviews

    SQLFlow

    Gudu Software

    $49.99 per month
    SQLFlow offers a comprehensive visual overview of data flow through various systems. It automates the analysis of SQL data lineage across a multitude of platforms, including databases, ETL processes, business intelligence tools, and environments like cloud and Hadoop, by effectively parsing SQL scripts and stored procedures. The tool graphically illustrates all data movements, supporting over 20 leading databases and continuously expanding its capabilities. It allows for automation in lineage construction regardless of the SQL's location, whether in databases, file systems, or repositories such as GitHub and Bitbucket. The user-friendly interface ensures that data flows are presented in a clear and easily understandable manner. By providing complete visibility into your business intelligence environment, it aids in pinpointing the root causes of reporting errors, fostering invaluable confidence in business processes. Additionally, it streamlines regulatory compliance efforts, while the visualization of data lineage enhances transparency and auditability. Users can conduct impact analysis at a detailed level, enabling a thorough examination of lineage down to tables, columns, and queries. With SQLFlow, you can seamlessly integrate powerful data lineage analysis capabilities into your product, thereby elevating your data management strategy. This tool not only simplifies complex tasks but also empowers teams to make informed decisions based on reliable insights.
  • 4
    Y42 Reviews

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud for production-ready data pipelines on top of Google BigQuery and Snowflake.
  • 5
    Mozart Data Reviews
    Mozart Data is the all-in-one modern data platform for consolidating, organizing, and analyzing your data. Set up a modern data stack in an hour, without any engineering. Start getting more out of your data and making data-driven decisions today.
  • 6
    SAS Viya Reviews
    SAS Viya is an advanced cloud-native data and AI platform designed to help organizations manage data, develop AI models, and operationalize analytics from one unified environment. The platform combines data access, machine learning, analytics, governance, and decision deployment into a scalable system built for enterprise use. SAS Viya enables businesses to connect to data across multiple sources while maintaining transparency, lineage, governance, and auditability throughout the AI lifecycle. Organizations can use the platform to accelerate model development, streamline workflows, and deploy trusted AI solutions faster and more efficiently. The platform includes built-in governance features that support fairness, explainability, compliance, and responsible AI practices across teams and business processes. SAS Viya also supports secure AI agent integration through the SAS Viya MCP Server, allowing AI-driven tools and copilots to interact with enterprise workflows responsibly. Businesses can deploy the platform in cloud, hybrid, or on-premises environments based on operational and security requirements. SAS Viya is used across industries for applications such as fraud detection, healthcare analytics, forecasting, customer intelligence, and AI model operationalization. The platform is designed to improve productivity by simplifying complex AI workflows and enabling collaboration between data scientists, analysts, and business users. Backed by decades of analytics expertise, SAS Viya helps organizations transform raw data into transparent and actionable business decisions at scale.
  • 7
    Foundational Reviews
    Detect and address code and optimization challenges in real-time, mitigate data incidents before deployment, and oversee data-affecting code modifications comprehensively—from the operational database to the user interface dashboard. With automated, column-level data lineage tracing the journey from the operational database to the reporting layer, every dependency is meticulously examined. Foundational automates the enforcement of data contracts by scrutinizing each repository in both upstream and downstream directions, directly from the source code. Leverage Foundational to proactively uncover code and data-related issues, prevent potential problems, and establish necessary controls and guardrails. Moreover, implementing Foundational can be achieved in mere minutes without necessitating any alterations to the existing codebase, making it an efficient solution for organizations. This streamlined setup promotes quicker response times to data governance challenges.
  • 8
    Matia Reviews
    Matia serves as a comprehensive DataOps platform aimed at streamlining contemporary data management by merging essential functions into a cohesive system. By integrating ETL, reverse ETL, data observability, and a data catalog, it removes the reliance on various isolated tools, thereby simplifying the challenges associated with managing disjointed data environments. This platform empowers teams to efficiently and reliably transfer data from diverse sources into data warehouses, utilizing sophisticated ingestion features that include real-time updates and effective error management. Furthermore, it facilitates the return of dependable data to operational tools for practical business applications. Matia prioritizes inherent observability throughout the data pipeline, offering capabilities such as monitoring, anomaly detection, and automated quality assessments to maintain data integrity and reliability, ultimately preventing potential issues from affecting downstream processes. As a result, organizations can achieve a more streamlined workflow and enhanced data utilization across their operations.
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB