dbt
dbt Labs is redefining how data teams work with SQL. Instead of waiting on complex ETL processes, dbt lets data analysts and data engineers build production-ready transformations directly in the warehouse, using code, version control, and CI/CD. This community-driven approach puts power back in the hands of practitioners while maintaining governance and scalability for enterprise use.
With a rapidly growing open-source community and an enterprise-grade cloud platform, dbt is at the heart of the modern data stack. It’s the go-to solution for teams who want faster analytics, higher quality data, and the confidence that comes from transparent, testable transformations.
Learn more
DataHub
DataHub is a versatile open-source metadata platform crafted to enhance data discovery, observability, and governance within various data environments. It empowers organizations to easily find reliable data, providing customized experiences for users while avoiding disruptions through precise lineage tracking at both the cross-platform and column levels. By offering a holistic view of business, operational, and technical contexts, DataHub instills trust in your data repository. The platform features automated data quality assessments along with AI-driven anomaly detection, alerting teams to emerging issues and consolidating incident management. With comprehensive lineage information, documentation, and ownership details, DataHub streamlines the resolution of problems. Furthermore, it automates governance processes by classifying evolving assets, significantly reducing manual effort with GenAI documentation, AI-based classification, and intelligent propagation mechanisms. Additionally, DataHub's flexible architecture accommodates more than 70 native integrations, making it a robust choice for organizations seeking to optimize their data ecosystems. This makes it an invaluable tool for any organization looking to enhance their data management capabilities.
Learn more
Digna
digna is a next-generation European data quality and observability platform that empowers organizations to improve data trust, reduce downtime, and uncover actionable insights.
Its five independent modules — Data Anomalies, Data Analytics, Data Timeliness, Data Validation, and Data Schema Tracker — address both data quality and operational/business monitoring. From detecting unexpected drops in record counts to spotting surges in product sales, digna gives you visibility across your entire data ecosystem.
Key advantages:
• In-database processing for full privacy & compliance
• AI-powered anomaly detection with zero manual rules
• Business trend analysis through statistical insights
• Regulatory compliance with flexible validation rules
• Pipeline protection via schema change tracking
Trusted in finance, healthcare, telecom, and government, digna integrates seamlessly with Snowflake, Databricks, Teradata, and more — whether on-premises, in the cloud, or hybrid.
With digna, your data is not just monitored — it’s understood.
Use Cases
Banking & Finance – Detect unusual spikes in transaction volumes to ensure both regulatory compliance and fraud prevention.
Healthcare – Monitor data timeliness to guarantee patient records and lab results arrive on time for critical decision-making.
Retail & eCommerce – Track sales trends and product anomalies to quickly identify fast-moving or underperforming items.
Telecommunications – Prevent schema drift in massive customer databases to avoid broken pipelines and billing errors.
Learn more
Validio
Examine the usage of your data assets, focusing on aspects like popularity, utilization, and schema coverage. Gain vital insights into your data assets, including their quality and usage metrics. You can easily locate and filter the necessary data by leveraging metadata tags and descriptions. Additionally, these insights will help you drive data governance and establish clear ownership within your organization. By implementing a streamlined lineage from data lakes to warehouses, you can enhance collaboration and accountability. An automatically generated field-level lineage map provides a comprehensive view of your entire data ecosystem. Moreover, anomaly detection systems adapt by learning from your data trends and seasonal variations, ensuring automatic backfilling with historical data. Thresholds driven by machine learning are specifically tailored for each data segment, relying on actual data rather than just metadata to ensure accuracy and relevance. This holistic approach empowers organizations to better manage their data landscape effectively.
Learn more