dbt Labs is redefining how data teams work with SQL. Instead of waiting on complex ETL processes, dbt lets data analysts and data engineers build production-ready transformations directly in the warehouse, using code, version control, and CI/CD. This community-driven approach puts power back in the hands of practitioners while maintaining governance and scalability for enterprise use.
With a rapidly growing open-source community and an enterprise-grade cloud platform, dbt is at the heart of the modern data stack. It’s the go-to solution for teams who want faster analytics, higher quality data, and the confidence that comes from transparent, testable transformations.
Learn more

Okyline is an Executable Data Design (EDD) platform focused on executable validation contracts and operational data quality control.
Rather than managing separate specifications, validation code, tests, and monitoring dashboards, Okyline centralizes validation and quality supervision around a single readable executable contract acting as the operational reference for enterprise data flows.
The same contract powers deterministic validation, advanced business invariant checks, multi-format execution, data quality gates, and historical quality analytics across APIs, events, files, LLM structured outputs, and distributed operational systems.
Contracts are designed directly from annotated sample data, making validation rules immediately understandable for developers, architects, QA teams, and business analysts.
The Community Edition includes the public specification, a free Java runtime engine, a Claude AI assistant for contract generation, and an online studio supporting executable JSON validation contracts and JSON Schema transpilation.
The Enterprise Edition adds native validation for JSONL, XML, CSV, FIXED, and EDI flows together with operational quality dashboards and data quality gates, without requiring databases or centralized infrastructure.erprise Edition supports direct validation of JSON, JSONL, XML, CSV, FIXED, and EDI flows with operational quality dashboards and analytics, without databases.
Learn more
YData
Embracing data-centric AI has become remarkably straightforward thanks to advancements in automated data quality profiling and synthetic data creation. Our solutions enable data scientists to harness the complete power of their data. YData Fabric allows users to effortlessly navigate and oversee their data resources, providing synthetic data for rapid access and pipelines that support iterative and scalable processes. With enhanced data quality, organizations can deliver more dependable models on a larger scale. Streamline your exploratory data analysis by automating data profiling for quick insights. Connecting to your datasets is a breeze via a user-friendly and customizable interface. Generate synthetic data that accurately reflects the statistical characteristics and behaviors of actual datasets. Safeguard your sensitive information, enhance your datasets, and boost model efficiency by substituting real data with synthetic alternatives or enriching existing datasets. Moreover, refine and optimize workflows through effective pipelines by consuming, cleaning, transforming, and enhancing data quality to elevate the performance of machine learning models. This comprehensive approach not only improves operational efficiency but also fosters innovative solutions in data management.
Learn more
FinetuneDB
Capture production data. Evaluate outputs together and fine-tune the performance of your LLM. A detailed log overview will help you understand what is happening in production. Work with domain experts, product managers and engineers to create reliable model outputs. Track AI metrics, such as speed, token usage, and quality scores. Copilot automates model evaluations and improvements for your use cases. Create, manage, or optimize prompts for precise and relevant interactions between AI models and users. Compare fine-tuned models and foundation models to improve prompt performance. Build a fine-tuning dataset with your team. Create custom fine-tuning data to optimize model performance.
Learn more