Apache Hudi Description

Hudi is a rich platform for building streaming data lakes using incremental data pipelines on a self managing database layer. It can also be optimized for regular batch processing and lake engines. Hudi keeps a timeline of all actions on the table at different times. This allows for instantaneous views and efficient retrieval of data in the order they were received. The following components make up a Hudi instant. Hudi provides efficient upserts by mapping a given Hoodie key consistently with a file ID, via an indexing mechanism. Once a record is written to a file, the mapping between record key/file group/file ID never changes. The mapped file group includes all versions of a group record.

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Apache Corporation
Year Founded:
1954
Headquarters:
United States
Website:
hudi.apache.org

Media

Apache Hudi Screenshot 1
Recommended Products
Our Free Plans just got better! | Auth0 by Okta Icon
Our Free Plans just got better! | Auth0 by Okta

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
Try free now

Product Details

Platforms
SaaS
Type of Training
Documentation
Customer Support
Online

Apache Hudi Features and Options

Data Warehouse Software

Ad hoc Query
Analytics
Data Integration
Data Migration
Data Quality Control
ETL - Extract / Transfer / Load
In-Memory Processing
Match & Merge

Apache Hudi User Reviews

Write a Review
  • Previous
  • Next