Marquez is an open source metadata service for the collection, aggregation, and visualization of a data ecosystem’s metadata. It maintains the provenance of how datasets are consumed and produced, provides global visibility into job runtime and frequency of dataset access, centralization of dataset lifecycle management, and much more. Marquez was released and open sourced by WeWork.


1. Centralized metadata management powering
2. Precise and highly dimensional data model
3. Easily collect metadata via an opinionated Metadata API
4.Datasets as first-class values
5. Enforcement of job and dataset ownership
6. Simple operation and design with minimal dependencies
7. RESTful API enabling sophisticated integrations with other systems
8. Designed to promote a healthy data ecosystem where teams within an organization can seamlessly share and safely depend on one another’s datasets with confidence

