Apache Gobblin Description

A distributed data integration framework which simplifies common Big Data integration tasks such as data ingestion and replication, organization, and lifecycle management. It can be used for both streaming and batch data ecosystems. It can be run as a standalone program on a single computer. Also supports embedded mode. It can be used as a mapreduce application on multiple Hadoop versions. Azkaban is also available for the launch of mapreduce jobs. It can run as a standalone cluster, with primary and worker nodes. This mode supports high availability, and can also run on bare metals. This mode can be used as an elastic cluster in the public cloud. This mode supports high availability. Gobblin, as it exists today, is a framework that can build various data integration applications such as replication, ingest, and so on. Each of these applications are typically set up as a job and executed by Azkaban, a scheduler.

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Apache Software Foundation
Headquarters:
United States
Website:
gobblin.apache.org

Media

Apache Gobblin Screenshot 1
You Might Also Like
Secure your business by securing your people. Icon
Secure your business by securing your people.

Over 100,000 businesses trust 1Password

Take the guesswork out of password management, shadow IT, infrastructure, and secret sharing so you can keep your people safe and your business moving.

Product Details

Platforms
Windows
Linux
Type of Training
Documentation
Customer Support
Online

Apache Gobblin Features and Options

Big Data Software

Collaboration
Data Blends
Data Cleansing
Data Mining
Data Visualization
Data Warehousing
High Volume Processing
No-Code Sandbox
Predictive Analytics
Templates

Apache Gobblin Lists