What is Insight Engine?

andypern · January 21, 2025, 7:45pm

Hi kodai, great questions.

1a (CBOX): during our beta phase, there are no HW changes. Existing IceLake & AMD Epyc based (CPU only) systems will be used. For services during beta which require GPUs, partners and customers will allocate/provision one or more of the following:

 a.  GPU systems which can be added to a k8s cluster.  VAST is deploying a series of services on k8s systems which allow bi-directional communication with a VAST Cluster, such that the VAST control plane can monitor and manage certain types of services (this is evolving as we iterate on our codebase).
b.  GPU systems which are separate from k8s and are 100% customer managed.  Interaction with models deployed on those GPU servers will occur via configuration on the pipelines which customers define on their VAST cluster,.  For example, if an NVIDIA NIM/Model is required for inference, and the model is hosted on an existing, non-managed GPU server, a customer could set an ENVIRONMENT_VARIABLE on their VAST managed pipeline to send inference calls to a defined model endpoint (eg: https://mygpu.client.com/v1/...)

AI engineering → It seems like your question is more related to VectorDB’s than the broader scope of ‘ai engineering’. VAST has already implemented a large scale database platform. What’s missing in current GA code is support for the types of data structures and query/search optimizations typically associated with searching for vector embeddings. We are in the process of creating these as extensions to our existing Database, and will be launching initial support for using VAST as a native vector store later this year.

The ‘short’ answer is “yes, VAST could potentially replace Milvus, Chroma, etc”…once we complete our R&D effort

Topic		Replies	Views
VAST Data at GTC 2025: Simplifying the Complex, Scaling the Future VAST News	0	64	April 17, 2025
Advanced AI Vision Search and Reasoning with the VAST InsightEngine with NVIDIA® AI Blueprints VAST News	3	125	March 27, 2025
VAST Unveils a Game-Changing AI Agent to Connect, Understand and Reason Across All Enterprise Data VAST News	0	34	April 29, 2025
A few questions on a simple RAG app Ask a Question	1	81	February 26, 2025
Vectors, Vision, Velocity: VAST RAG Pipelines in Action VAST News	0	25	May 16, 2025

What is Insight Engine?

Related topics