dbt
dbt Labs is redefining how data teams work with SQL. Instead of waiting on complex ETL processes, dbt lets data analysts and data engineers build production-ready transformations directly in the warehouse, using code, version control, and CI/CD. This community-driven approach puts power back in the hands of practitioners while maintaining governance and scalability for enterprise use.
With a rapidly growing open-source community and an enterprise-grade cloud platform, dbt is at the heart of the modern data stack. It’s the go-to solution for teams who want faster analytics, higher quality data, and the confidence that comes from transparent, testable transformations.
Learn more
DataHub
DataHub is a versatile open-source metadata platform crafted to enhance data discovery, observability, and governance within various data environments. It empowers organizations to easily find reliable data, providing customized experiences for users while avoiding disruptions through precise lineage tracking at both the cross-platform and column levels. By offering a holistic view of business, operational, and technical contexts, DataHub instills trust in your data repository. The platform features automated data quality assessments along with AI-driven anomaly detection, alerting teams to emerging issues and consolidating incident management. With comprehensive lineage information, documentation, and ownership details, DataHub streamlines the resolution of problems. Furthermore, it automates governance processes by classifying evolving assets, significantly reducing manual effort with GenAI documentation, AI-based classification, and intelligent propagation mechanisms. Additionally, DataHub's flexible architecture accommodates more than 70 native integrations, making it a robust choice for organizations seeking to optimize their data ecosystems. This makes it an invaluable tool for any organization looking to enhance their data management capabilities.
Learn more
MANTA
Manta is a unified data lineage platform that serves as the central hub of all enterprise data flows.
Manta can construct lineage from report definitions, custom SQL code, and ETL workflows. Lineage is analyzed based on actual code, and both direct and indirect flows can be visualized on the map. Data paths between files, report fields, database tables, and individual columns are displayed to users in an intuitive user interface, enabling teams to understand data flows in context.
Learn more
Google Cloud Analytics Hub
Google Cloud's Analytics Hub serves as a data exchange platform that empowers organizations to share data assets securely and efficiently beyond their internal boundaries, tackling issues related to data integrity and associated costs. Leveraging the robust scalability and adaptability of BigQuery, it enables users to create a comprehensive library encompassing both internal and external datasets, including distinctive data like Google Trends. The platform simplifies the publication, discovery, and subscription processes for data exchanges, eliminating the need for data transfers and enhancing the ease of access to data and analytical resources. Additionally, Analytics Hub ensures privacy-safe and secure data sharing through stringent governance practices, incorporating advanced security features and encryption protocols from BigQuery, Cloud IAM, and VPC Security Controls. By utilizing Analytics Hub, organizations can maximize the return on their data investment through effective data exchange strategies, while also fostering collaboration across different departments. Ultimately, this innovative platform enhances data-driven decision-making by providing seamless access to a wider array of data assets.
Learn more