Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

Harness the power of your data and create innovative artificial intelligence (AI) solutions using Azure Databricks, where you can establish your Apache Spark™ environment in just minutes, enable autoscaling, and engage in collaborative projects within a dynamic workspace. This platform accommodates multiple programming languages such as Python, Scala, R, Java, and SQL, along with popular data science frameworks and libraries like TensorFlow, PyTorch, and scikit-learn. With Azure Databricks, you can access the most current versions of Apache Spark and effortlessly connect with various open-source libraries. You can quickly launch clusters and develop applications in a fully managed Apache Spark setting, benefiting from Azure's expansive scale and availability. The clusters are automatically established, optimized, and adjusted to guarantee reliability and performance, eliminating the need for constant oversight. Additionally, leveraging autoscaling and auto-termination features can significantly enhance your total cost of ownership (TCO), making it an efficient choice for data analysis and AI development. This powerful combination of tools and resources empowers teams to innovate and accelerate their projects like never before.

Description

Deequ is an innovative library that extends Apache Spark to create "unit tests for data," aiming to assess the quality of extensive datasets. We welcome any feedback and contributions from users. The library requires Java 8 for operation. It is important to note that Deequ version 2.x is compatible exclusively with Spark 3.1, and the two are interdependent. For those using earlier versions of Spark, the Deequ 1.x version should be utilized, which is maintained in the legacy-spark-3.0 branch. Additionally, we offer legacy releases that work with Apache Spark versions ranging from 2.2.x to 3.0.x. The Spark releases 2.2.x and 2.3.x are built on Scala 2.11, while the 2.4.x, 3.0.x, and 3.1.x releases require Scala 2.12. The primary goal of Deequ is to perform "unit-testing" on data to identify potential issues early on, ensuring that errors are caught before the data reaches consuming systems or machine learning models. In the sections that follow, we will provide a simple example to demonstrate the fundamental functionalities of our library, highlighting its ease of use and effectiveness in maintaining data integrity.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Artie
Axonius
DQOps
Databricks
Embeddable
Genesis Computing
Harbr
Indent
Kedro
LynxCare
Mage Sensitive Data Discovery
Microsoft Intelligent Data Platform
Nuvento
Quaeris
StarfishETL
Syntho
The Asset Guardian EAM (TAG)
ZenML
definity

Integrations

Artie
Axonius
DQOps
Databricks
Embeddable
Genesis Computing
Harbr
Indent
Kedro
LynxCare
Mage Sensitive Data Discovery
Microsoft Intelligent Data Platform
Nuvento
Quaeris
StarfishETL
Syntho
The Asset Guardian EAM (TAG)
ZenML
definity

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Microsoft

Founded

1975

Country

United States

Website

azure.microsoft.com/en-us/services/databricks/

Vendor Details

Company Name

Deequ

Website

github.com/awslabs/deequ

Product Features

Big Data

Collaboration
Data Blends
Data Cleansing
Data Mining
Data Visualization
Data Warehousing
High Volume Processing
No-Code Sandbox
Predictive Analytics
Templates

Product Features

Alternatives

Alternatives

Spark Streaming Reviews

Spark Streaming

Apache Software Foundation
MLlib Reviews

MLlib

Apache Software Foundation
Apache Spark Reviews

Apache Spark

Apache Software Foundation
Apache Mahout Reviews

Apache Mahout

Apache Software Foundation