Software

1 minute read

Vector Databases: Search by Meaning, at Scale

June 23, 2026

Embeddings turn meaning into vectors (last post). But if you have a million of them, how do you find the right ones for a query — fast? That’s what a vector database does, and it’s the retrieval engine behind every RAG app. Here’s a live semantic search demo.

🗂️ Search by meaning (not keywords): https://dev48v.infy.uk/ai/days/day14-vector-databases.html

Search becomes “find the nearest vectors”

Embed your query into the same space as your documents, then find the document vectors closest to it (by cosine similarity). Because closeness = meaning, the query “how do I reset my password” matches a doc about “recovering account access” — even with zero shared keywords. The demo shows this beating a keyword search that returns nothing.

Why you need a database, not a for-loop

Comparing your query to every vector (brute-force kNN) is fine for hundreds, hopeless for millions. Vector DBs use ANN (approximate nearest neighbour) indexes like HNSW to find the closest vectors in milliseconds — trading a tiny bit of accuracy for huge speed.

What a vector DB actually stores

Vectors + the original text + metadata, behind an ANN index. Pipeline: chunk your docs → embed → upsert. Query: embed the question → search top-k → (often) filter by metadata or combine with keyword search (hybrid).

This is the retrieval half of RAG. Real options: Pinecone, Weaviate, Chroma, pgvector, FAISS.

🔨 Build it (embed → upsert → similarity search → top-k → RAG) on the page: https://dev48v.infy.uk/ai/days/day14-vector-databases.html

Part of AIFromZero. 🌐 https://dev48v.infy.uk

tl.extend — Register Custom CSS Variants Anywhere in Your Codebase, No Central Config Required

June 23, 2026

Quality Assurance

AI in the Quality Department: A Practical Path Forward for Small Manufacturers

June 23, 2026

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Hand-Picked Top-Read Stories

AI in the Quality Department: A Practical Path Forward for Small Manufacturers

Vector Databases: Search by Meaning, at Scale

tl.extend — Register Custom CSS Variants Anywhere in Your Codebase, No Central Config Required

Trending Tags

Vector Databases: Search by Meaning, at Scale

Search becomes “find the nearest vectors”

Why you need a database, not a for-loop

What a vector DB actually stores

Leave a Reply Cancel reply

Previous Post

tl.extend — Register Custom CSS Variants Anywhere in Your Codebase, No Central Config Required

Next Post

AI in the Quality Department: A Practical Path Forward for Small Manufacturers

Vector Databases: Search by Meaning, at Scale

Search becomes “find the nearest vectors”

Why you need a database, not a for-loop

What a vector DB actually stores

Leave a Reply Cancel reply

Previous Post

Next Post

Related Posts