Browsing Tag
rag
39 posts
How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
Most “chat with your documents” demos work in an afternoon. Then you hit the last 20%: retrieval that…
Extract Plain Text from Medium Posts for RAG and Search Indexes
Chunk clean article content for embeddings, summarization, and full-text search—skip nav, clap bars, and scripts. Extract Plain Text…
Why output-stage PII masking is the wrong protective surface for data exfiltration in RAG
“The output filter runs after the LLM has already seen the confidential data. By then, three classes of…
How to Evaluate Vector Databases in 2026
In 2026, a synthetic performance crisis challenges the vector database market. A GitHub search for “vector database benchmark”…
Knowledge Base Software for B2B Support: Architecture, API Design, and AI Readiness
Most teams don’t struggle to choose a knowledge base. They struggle to choose the right one for how…
Parametric Hubris: Empirical Evidence That Tool Availability Does Not Equal Tool Usage in Frontier Language Models
Parametric Hubris: Empirical Evidence That Tool Availability Does Not Equal Tool Usage in Frontier Language Models Frontier large…
Teaching Machines to Understand Documents with Docling
Docling Exploration Introduction In this experiment I have explored the Docling CLI and used it to parse a…
Most GenAI chatbot tutorials stop at “call an LLM get an answer.”
That’s fine… until you try to build something real. While some people turn this gap into multi-hour courses…
I Built an MCP Server to Search Documentation from Claude (So You Don’t Have to Web Search)
Ever been coding with Claude and needed to check the docs for a library? You either: Open a…
Retrieval Strategy Design: Vector, Keyword, and Hybrid Search
This article explains how to design a modern retrieval strategy for AI systems, especially Retrieval-Augmented Generation (RAG). The…