lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes.
With lakeFS you can build repeatable, atomic and versioned data lake operations – from complex ETL jobs to data science and analytics.
1. Exabytes scale version control
2. Git-like operations: branch,
commit, merge, revert
3. Zero copy branching for
4. Full reproducibility of
data and code
5. Pre-commit/merge hooks for
6. Instantly revert changes to data