lakeFS Description

lakeFS allows you to manage your data lake in the same way as your code. Parallel pipelines can be used for experimentation as well as CI/CD of your data. This simplifies the lives of data scientists, engineers, and analysts who work in data transformation. lakeFS is an open-source platform that provides resilience and manageability for object-storage-based data lakes. lakeFS allows you to build repeatable, atomic, and versioned data lakes operations. This includes complex ETL jobs as well as data science and analysis. lakeFS is compatible with AWS S3, Azure Blob Storage, and Google Cloud Storage (GCS). It is API compatible to S3 and seamlessly integrates with all modern data frameworks like Spark, Hive AWS Athena, Presto, AWS Athena, Presto, and others. lakeFS is a Git-like branching/committing model that can scale to exabytes by using S3, GCS, and Azure Blob storage.

Integrations

API:
Yes, lakeFS has an API

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Treeverse
Year Founded:
2020
Headquarters:
Israel
Website:
lakefs.io

Media

lakeFS Screenshot 1
Recommended Products
Open source. Easy to use. Proven. Complete. Icon
Open source. Easy to use. Proven. Complete.

End to end big data that enables you to spend less time formatting data and more time analyzing it.

Discover HPCC Systems - the truly open source big data solution that allows you to quickly process, analyze and understand large data sets, even data stored in massive, mixed-schema data lakes. Designed by data scientists, HPCC systems is a complete integrated solution from data ingestion and data processing to data delivery. The free online introductory courses and a robust developer community allow you to get started quickly.

Product Details

Platforms
SaaS
Type of Training
Documentation
Webinars
Customer Support
Online

lakeFS Features and Options

Data Management Software

Customer Data
Data Analysis
Data Capture
Data Integration
Data Migration
Data Quality Control
Data Security
Information Governance
Master Data Management
Match & Merge