Apache Hudi Description
Hudi is a rich platform for building streaming data lakes using incremental data pipelines on a self managing database layer. It can also be optimized for regular batch processing and lake engines. Hudi keeps a timeline of all actions on the table at different times. This allows for instantaneous views and efficient retrieval of data in the order they were received. The following components make up a Hudi instant. Hudi provides efficient upserts by mapping a given Hoodie key consistently with a file ID, via an indexing mechanism. Once a record is written to a file, the mapping between record key/file group/file ID never changes. The mapped file group includes all versions of a group record.
Integrations
Company Details
Product Details
Apache Hudi Features and Options
Apache Hudi User Reviews
Write a Review- Previous
- Next