Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

A framework for distributed data integration that streamlines essential functions of Big Data integration, including data ingestion, replication, organization, and lifecycle management, is designed for both streaming and batch data environments. It operates as a standalone application on a single machine and can also function in an embedded mode. Additionally, it is capable of executing as a MapReduce application across various Hadoop versions and offers compatibility with Azkaban for initiating MapReduce jobs. In standalone cluster mode, it features primary and worker nodes, providing high availability and the flexibility to run on bare metal systems. Furthermore, it can function as an elastic cluster in the public cloud, maintaining high availability in this setup. Currently, Gobblin serves as a versatile framework for creating various data integration applications, such as ingestion and replication. Each application is usually set up as an independent job and managed through a scheduler like Azkaban, allowing for organized execution and management of data workflows. This adaptability makes Gobblin an appealing choice for organizations looking to enhance their data integration processes.

Description

IBM Analytics Engine offers a unique architecture for Hadoop clusters by separating the compute and storage components. Rather than relying on a fixed cluster with nodes that serve both purposes, this engine enables users to utilize an object storage layer, such as IBM Cloud Object Storage, and to dynamically create computing clusters as needed. This decoupling enhances the flexibility, scalability, and ease of maintenance of big data analytics platforms. Built on a stack that complies with ODPi and equipped with cutting-edge data science tools, it integrates seamlessly with the larger Apache Hadoop and Apache Spark ecosystems. Users can define clusters tailored to their specific application needs, selecting the suitable software package, version, and cluster size. They have the option to utilize the clusters for as long as necessary and terminate them immediately after job completion. Additionally, users can configure these clusters with third-party analytics libraries and packages, and leverage IBM Cloud services, including machine learning, to deploy their workloads effectively. This approach allows for a more responsive and efficient handling of data processing tasks.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Hadoop
Acquia CDP
Apache Spark
Galileo
IBM Cloud Object Storage
MINT
RadiantOne
Switch Automation
ZARUS

Integrations

Hadoop
Acquia CDP
Apache Spark
Galileo
IBM Cloud Object Storage
MINT
RadiantOne
Switch Automation
ZARUS

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

$0.014 per hour
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Apache Software Foundation

Country

United States

Website

gobblin.apache.org

Vendor Details

Company Name

IBM

Founded

1911

Country

United States

Website

www.ibm.com/cloud/analytics-engine

Product Features

Big Data

Collaboration
Data Blends
Data Cleansing
Data Mining
Data Visualization
Data Warehousing
High Volume Processing
No-Code Sandbox
Predictive Analytics
Templates

Product Features

Data Discovery

Contextual Search
Data Classification
Data Matching
False Positives Reduction
Self Service Data Preparation
Sensitive Data Identification
Visual Analytics

Data Visualization

Analytics
Content Management
Dashboard Creation
Filtered Views
OLAP
Relational Display
Simulation Models
Visual Discovery

Alternatives

Apache Spark Reviews

Apache Spark

Apache Software Foundation

Alternatives

Hadoop Reviews

Hadoop

Apache Software Foundation
E-MapReduce Reviews

E-MapReduce

Alibaba
E-MapReduce Reviews

E-MapReduce

Alibaba
Apache Sentry Reviews

Apache Sentry

Apache Software Foundation
MLlib Reviews

MLlib

Apache Software Foundation