Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

BeakerX is an extensive suite of kernels and enhancements designed for the Jupyter interactive computing platform. It offers support for the JVM, Spark clusters, and polyglot programming, alongside features like interactive visualizations, tables, forms, and publishing capabilities. Each of BeakerX's supported JVM languages, in addition to Python and JavaScript, is equipped with APIs for generating interactive time-series, scatter plots, histograms, heatmaps, and treemaps. The interactive widgets retain their functionality in both saved notebooks and those shared online, featuring specialized tools for managing large datasets, nanosecond precision, zooming capabilities, and export options. Additionally, BeakerX's table widget seamlessly integrates with pandas data frames, enabling users to easily search, sort, drag, filter, format, select, graph, hide, pin, and export data to CSV or clipboard, facilitating quick connections to spreadsheets. Furthermore, BeakerX includes a Spark magic interface, complete with graphical user interfaces for managing configuration, monitoring status and progress, and interrupting Spark jobs, allowing users the flexibility to either utilize the GUI or programmatically create their own SparkSession. In this way, it significantly enhances the efficiency and usability of data processing and analysis tasks within the Jupyter environment.

Description

Deequ is an innovative library that extends Apache Spark to create "unit tests for data," aiming to assess the quality of extensive datasets. We welcome any feedback and contributions from users. The library requires Java 8 for operation. It is important to note that Deequ version 2.x is compatible exclusively with Spark 3.1, and the two are interdependent. For those using earlier versions of Spark, the Deequ 1.x version should be utilized, which is maintained in the legacy-spark-3.0 branch. Additionally, we offer legacy releases that work with Apache Spark versions ranging from 2.2.x to 3.0.x. The Spark releases 2.2.x and 2.3.x are built on Scala 2.11, while the 2.4.x, 3.0.x, and 3.1.x releases require Scala 2.12. The primary goal of Deequ is to perform "unit-testing" on data to identify potential issues early on, ensuring that errors are caught before the data reaches consuming systems or machine learning models. In the sections that follow, we will provide a simple example to demonstrate the fundamental functionalities of our library, highlighting its ease of use and effectiveness in maintaining data integrity.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Apache Spark

Integrations

Apache Spark

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Two Sigma Open Source

Website

BeakerNotebook.com

Vendor Details

Company Name

Deequ

Website

github.com/awslabs/deequ

Product Features

Application Development

Access Controls/Permissions
Code Assistance
Code Refactoring
Collaboration Tools
Compatibility Testing
Data Modeling
Debugging
Deployment Management
Graphical User Interface
Mobile Development
No-Code
Reporting/Analytics
Software Development
Source Control
Testing Management
Version Control
Web App Development

Product Features

Alternatives

Alternatives

Spark Streaming Reviews

Spark Streaming

Apache Software Foundation
Apache Spark Reviews

Apache Spark

Apache Software Foundation
MLlib Reviews

MLlib

Apache Software Foundation
Apache Spark Reviews

Apache Spark

Apache Software Foundation
Spark Streaming Reviews

Spark Streaming

Apache Software Foundation
Apache Mahout Reviews

Apache Mahout

Apache Software Foundation