DataBuck
Big Data Quality must always be verified to ensure that data is safe, accurate, and complete. Data is moved through multiple IT platforms or stored in Data Lakes. The Big Data Challenge: Data often loses its trustworthiness because of (i) Undiscovered errors in incoming data (iii). Multiple data sources that get out-of-synchrony over time (iii). Structural changes to data in downstream processes not expected downstream and (iv) multiple IT platforms (Hadoop DW, Cloud). Unexpected errors can occur when data moves between systems, such as from a Data Warehouse to a Hadoop environment, NoSQL database, or the Cloud. Data can change unexpectedly due to poor processes, ad-hoc data policies, poor data storage and control, and lack of control over certain data sources (e.g., external providers). DataBuck is an autonomous, self-learning, Big Data Quality validation tool and Data Matching tool.
Learn more
Okyline
Okyline is an Executable Data Design (EDD) platform focused on executable validation contracts and operational data quality control.
Rather than managing separate specifications, validation code, tests, and monitoring dashboards, Okyline centralizes validation and quality supervision around a single readable executable contract acting as the operational reference for enterprise data flows.
The same contract powers deterministic validation, advanced business invariant checks, multi-format execution, data quality gates, and historical quality analytics across APIs, events, files, LLM structured outputs, and distributed operational systems.
Contracts are designed directly from annotated sample data, making validation rules immediately understandable for developers, architects, QA teams, and business analysts.
The Community Edition includes the public specification, a free Java runtime engine, a Claude AI assistant for contract generation, and an online studio supporting executable JSON validation contracts and JSON Schema transpilation.
The Enterprise Edition adds native validation for JSONL, XML, CSV, FIXED, and EDI flows together with operational quality dashboards and data quality gates, without requiring databases or centralized infrastructure.erprise Edition supports direct validation of JSON, JSONL, XML, CSV, FIXED, and EDI flows with operational quality dashboards and analytics, without databases.
Learn more
Verodat
Verodat, a SaaS-platform, gathers, prepares and enriches your business data, then connects it to AI Analytics tools. For results you can trust. Verodat automates data cleansing & consolidates data into a clean, trustworthy data layer to feed downstream reporting. Manages data requests for suppliers. Monitors data workflows to identify bottlenecks and resolve issues. The audit trail is generated to prove quality assurance for each data row. Validation & governance can be customized to your organization. Data preparation time is reduced by 60% allowing analysts to focus more on insights. The central KPI Dashboard provides key metrics about your data pipeline. This allows you to identify bottlenecks and resolve issues, as well as improve performance. The flexible rules engine allows you to create validation and testing that suits your organization's requirements. It's easy to integrate your existing tools with the out-of-the box connections to Snowflake and Azure.
Learn more
Data8
Data8 provides an extensive range of cloud-based solutions focused on data quality, ensuring your information remains clean, precise, and current. Our offerings include tailored services for data validation, cleansing, migration, and monitoring to address specific organizational requirements. Among our validation services are real-time verification tools that cover address autocomplete, postcode lookup, bank account validation, email verification, name and phone validation, as well as business insights, all designed to capture accurate customer data during initial entry. To enhance both B2B and B2C databases, Data8 offers various services such as appending and enhancement, email and phone validation, suppression of records for individuals who have moved or passed away, deduplication, merging of records, PAF cleansing, and preference services. Additionally, Data8 features an automated deduplication solution that seamlessly integrates with Microsoft Dynamics 365, allowing for the efficient deduplication, merging, and standardization of multiple records. This comprehensive approach not only improves data integrity but also streamlines operations, ultimately supporting better decision-making within your organization.
Learn more