Apache Gobblin Description

A framework for distributed data integration that streamlines essential functions of Big Data integration, including data ingestion, replication, organization, and lifecycle management, is designed for both streaming and batch data environments. It operates as a standalone application on a single machine and can also function in an embedded mode. Additionally, it is capable of executing as a MapReduce application across various Hadoop versions and offers compatibility with Azkaban for initiating MapReduce jobs. In standalone cluster mode, it features primary and worker nodes, providing high availability and the flexibility to run on bare metal systems. Furthermore, it can function as an elastic cluster in the public cloud, maintaining high availability in this setup. Currently, Gobblin serves as a versatile framework for creating various data integration applications, such as ingestion and replication. Each application is usually set up as an independent job and managed through a scheduler like Azkaban, allowing for organized execution and management of data workflows. This adaptability makes Gobblin an appealing choice for organizations looking to enhance their data integration processes.

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Apache Software Foundation
Headquarters:
United States
Website:
gobblin.apache.org

Media

Apache Gobblin Screenshot 1
Recommended Products
Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar Icon
Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar

See what hackers already know about your organization – and stop them from getting in.

Protect your organization from cyber threats with SOCRadar’s cutting-edge threat intelligence. Gain 360° visibility into your digital assets, monitor the dark web, and stay ahead of hackers with real-time insights. Start for free and transform your cybersecurity today.
Free Trial

Product Details

Platforms
Windows
Linux
Types of Training
Training Docs
Customer Support
Online Support

Apache Gobblin Features and Options

Big Data Platform

Collaboration
Data Blends
Data Cleansing
Data Mining
Data Visualization
Data Warehousing
High Volume Processing
No-Code Sandbox
Predictive Analytics
Templates

Apache Gobblin Lists

Apache Gobblin User Reviews

Write a Review
  • Previous
  • Next