lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes.
With lakeFS you can build repeatable, atomic and versioned data lake operations – from complex ETL jobs to data science and analytics.
Features
1. Exabytes scale version control 2. Git-like operations: branch, commit, merge, revert 3. Zero copy branching for frictionless experiments 4. Full reproducibility of data and code 5. Pre-commit/merge hooks for data CI/CD 6. Instantly revert changes to data