A few questions on a simple RAG app

jonasrosland · February 26, 2025, 5:58pm

@mrunal-modi asked in Discord:

Does vast data allow for vector search index too? So we can avoid both external vector db and search index solutions?
Context Assembly i.e. process of gathering relevant data, such as retrieved vector search results and other contextual information, before sending a query to the LLM for response generation can be hosted inside the VAST data platform too or is on the API server?

jonasrosland · February 26, 2025, 5:58pm

@brian.verkley responded:
There is actually a range of answers here. First, as you appear to have a good handle on, VAST can host the data, and will process the vector embedding creation based on triggers as data is added or changed in the system. These vectors can be stored in our vector database. When we come to RAG, VAST doesn’t make any of the search, index, or context assembly software, but we use and can run NVIDIA NIMS in our kubernetes cluster. Our VAST InisghtEngine with NVIDIA product is an example of this fully managed solution, where those NIMS are available to you out of the box. But if you have specific needs where you need different software for any component, we’re happy to adapt.

Topic		Replies	Views
Does VAST Data have native vector search capabilities similar to MongoDB Atlas? Ask a Question	1	25	June 10, 2025
Does VAST have a vector database? Ask a Question	1	28	June 2, 2025
Vectors, Vision, Velocity: VAST RAG Pipelines in Action VAST News	0	18	May 16, 2025
Introducing VAST Vector Search: Real-Time AI Retrieval Without Limits VAST News	0	25	May 8, 2025
What is Insight Engine? Use Cases vast-customer	7	258	January 29, 2025

A few questions on a simple RAG app

Related topics