Data Management Archives

Pinecone

BOOK A MEETING FR Pinecone is a fully managed vector database that makes it easy to add vector search to production applications. It combines vector search libraries, capabilities such as filtering, and distributed infrastructure to provide high performance and reliability at any scale. Features 1. Similarity search 2. Image similarity search 3. Audio similarity search …

Continue reading “Pinecone “

Milvus

BOOK A MEETING FR Milvus is an open-source vector database built to power AI applications and vector similarity search. It is available in: Milvus standalone Milvus cluster Features 1. Millisecond search on trillion vector datasets 2. Simplified unstructured data management 3. Reliable, always on vector database 4. Highly scalable and elastic Hybrid search 5. Unified …

Continue reading “Milvus “

Marquez

BOOK A MEETING FR Marquez is an open source metadata service for the collection, aggregation, and visualization of a data ecosystem’s metadata. It maintains the provenance of how datasets are consumed and produced, provides global visibility into job runtime and frequency of dataset access, centralization of dataset lifecycle management, and much more. Marquez was released …

Continue reading “Marquez”

lakeFS

BOOK A MEETING FR lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes. With lakeFS you can build repeatable, atomic and versioned data lake operations – from complex ETL jobs to data science and analytics. Features 1. Exabytes scale version control 2. Git-like operations: branch, commit, merge, revert …

Continue reading “lakeFS “

Intake

BOOK A MEETING FR Intake is a lightweight set of tools for loading and sharing data in data science projects. Intake helps you: Features 1. Load data from a variety of formats (see the current list of known plugins) into containers you already know, like Pandas dataframes, Python lists, NumPy arrays, and more. 2. Convert …

Continue reading “Intake”

DVC

BOOK A MEETING FR Data Version Control is a new type of data versioning, workflow, and experiment management software, that builds upon Git (although it can work stand-alone). DVC reduces the gap between established engineering tool sets and data science needs, allowing users to take advantage of new features while reusing existing skills and intuition. …

Continue reading “DVC”

Dolt

BOOK A MEETING FR Dolt is a version controlled relational database. Dolt implements a superset of MySQL. It is compatible with MySQL, and provides extra constructs exposing the version control features, which are closely modeled on Git. Features 1. Compatible 2. Lineage & Time Travel 3. Collaboration Official website Link Tutorial and documentation Click here …

Continue reading “Dolt “

Delta Lake

BOOK A MEETING FR Delta Lake is an open source project that enables building a Lakehouse architecture on top of data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing on top of existing data lakes, such as S3, ADLS, GCS, and HDFS. Features 1. ACID Transactions 2. …

Continue reading “Delta Lake”

Arrikto

BOOK A MEETING FR A complete machine learning platform that simplifies, accelerates, and secures model development through production Features 1. Simplified deployment 2. ML monitoring 3. Life cycle management 4. Compliance Official website Link Tutorial and documentation Click here to view See more MLOps tools and solutions Montreal 1275 Av. des Canadiens-de-Montréal, Montréal, QC H3B …

Continue reading “Arrikto”

MLOP Category Archives:

Pinecone

Milvus

Marquez

lakeFS

Intake

DVC

Dolt

Delta Lake

Arrikto

Montreal

Los Angeles

Dubai

Doha

Follow us:

Subscribe to our newsletter

Enter your contact information to continue reading