Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

A web-based notebook facilitates interactive data analytics and collaborative documentation using SQL, Scala, and other languages. With an IPython interpreter, it delivers a user experience similar to that of Jupyter Notebook. The latest version introduces several enhancements, including a dynamic form at the note level, a note revision comparison tool, and the option to execute paragraphs sequentially rather than simultaneously, as was the case in earlier versions. Additionally, an interpreter lifecycle manager ensures that idle interpreter processes are automatically terminated, freeing up resources when they are not actively being utilized. This improvement not only optimizes performance but also enhances the overall user experience.

Description

Deequ is an innovative library that extends Apache Spark to create "unit tests for data," aiming to assess the quality of extensive datasets. We welcome any feedback and contributions from users. The library requires Java 8 for operation. It is important to note that Deequ version 2.x is compatible exclusively with Spark 3.1, and the two are interdependent. For those using earlier versions of Spark, the Deequ 1.x version should be utilized, which is maintained in the legacy-spark-3.0 branch. Additionally, we offer legacy releases that work with Apache Spark versions ranging from 2.2.x to 3.0.x. The Spark releases 2.2.x and 2.3.x are built on Scala 2.11, while the 2.4.x, 3.0.x, and 3.1.x releases require Scala 2.12. The primary goal of Deequ is to perform "unit-testing" on data to identify potential issues early on, ensuring that errors are caught before the data reaches consuming systems or machine learning models. In the sections that follow, we will provide a simple example to demonstrate the fundamental functionalities of our library, highlighting its ease of use and effectiveness in maintaining data integrity.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Apache Spark
Alluxio
Apache Flink
Apache Geode
Apache HBase
Apache Hive
Apache Ignite
Domino Enterprise AI Platform
Elasticsearch
Java
JavaScript
Markdown
OctoData
Python
R
Scala
Timbr.ai
Warp 10
Yandex Data Proc
Zepl

Integrations

Apache Spark
Alluxio
Apache Flink
Apache Geode
Apache HBase
Apache Hive
Apache Ignite
Domino Enterprise AI Platform
Elasticsearch
Java
JavaScript
Markdown
OctoData
Python
R
Scala
Timbr.ai
Warp 10
Yandex Data Proc
Zepl

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Apache

Founded

1999

Country

United. States

Website

zeppelin.apache.org

Vendor Details

Company Name

Deequ

Website

github.com/awslabs/deequ

Product Features

IDE

Code Completion
Compiler
Cross Platform Support
Debugger
Drag and Drop UI
Integrations and Plugins
Multi Language Support
Project Management
Text Editor / Code Editor

Product Features

Alternatives

Alternatives

JupyterLab Reviews

JupyterLab

Jupyter
Spark Streaming Reviews

Spark Streaming

Apache Software Foundation
MLlib Reviews

MLlib

Apache Software Foundation
Apache Spark Reviews

Apache Spark

Apache Software Foundation
Apache Mahout Reviews

Apache Mahout

Apache Software Foundation