lakeFS 

GitHub Support CommunityData Management

lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes.

With lakeFS you can build repeatable, atomic and versioned data lake operations – from complex ETL jobs to data science and analytics.

Features

1. Exabytes scale version control
2. Git-like operations: branch,
commit, merge, revert
3. Zero copy branching for
frictionless experiments
4. Full reproducibility of
data and code
5. Pre-commit/merge hooks for
data CI/CD
6. Instantly revert changes to data

Official website

Tutorial and documentation

Enter your contact information to continue reading