Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

Deequ is an innovative library that extends Apache Spark to create "unit tests for data," aiming to assess the quality of extensive datasets. We welcome any feedback and contributions from users. The library requires Java 8 for operation. It is important to note that Deequ version 2.x is compatible exclusively with Spark 3.1, and the two are interdependent. For those using earlier versions of Spark, the Deequ 1.x version should be utilized, which is maintained in the legacy-spark-3.0 branch. Additionally, we offer legacy releases that work with Apache Spark versions ranging from 2.2.x to 3.0.x. The Spark releases 2.2.x and 2.3.x are built on Scala 2.11, while the 2.4.x, 3.0.x, and 3.1.x releases require Scala 2.12. The primary goal of Deequ is to perform "unit-testing" on data to identify potential issues early on, ensuring that errors are caught before the data reaches consuming systems or machine learning models. In the sections that follow, we will provide a simple example to demonstrate the fundamental functionalities of our library, highlighting its ease of use and effectiveness in maintaining data integrity.

Description

QikkDB is a high-performance, GPU-accelerated columnar database designed to excel in complex polygon computations and large-scale data analytics. If you're managing billions of data points and require immediate insights, qikkDB is the solution you need. It is compatible with both Windows and Linux operating systems, ensuring flexibility for developers. The project employs Google Tests for its testing framework, featuring hundreds of unit tests alongside numerous integration tests to maintain robust quality. For those developing on Windows, it is advisable to use Microsoft Visual Studio 2019, with essential dependencies that include at least CUDA version 10.2, CMake 3.15 or a more recent version, vcpkg, and Boost libraries. Meanwhile, Linux developers will also require a minimum of CUDA version 10.2, CMake 3.15 or newer, and Boost for optimal operation. This software is distributed under the Apache License, Version 2.0, allowing for a wide range of usage. To simplify the installation process, users can opt for either an installation script or a Dockerfile to get qikkDB up and running seamlessly. Additionally, this versatility makes it an appealing choice for various development environments.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

AWS Marketplace
Apache Spark

Integrations

AWS Marketplace
Apache Spark

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Deequ

Website

github.com/awslabs/deequ

Vendor Details

Company Name

qikkDB

Website

github.com/qikkDB

Product Features

Product Features

Alternatives

Alternatives

Spark Streaming Reviews

Spark Streaming

Apache Software Foundation
Miele-LXIV Reviews

Miele-LXIV

DICOM Software
MLlib Reviews

MLlib

Apache Software Foundation
InfiniDB Reviews

InfiniDB

Database of Databases
Apache Spark Reviews

Apache Spark

Apache Software Foundation
Apache Mahout Reviews

Apache Mahout

Apache Software Foundation