Software

1 minute read

Built an open-source memory layer for local LLMs — single-shot calls, auto-extracted constraints, no context degradation

May 2, 2026

Been running Llama 3.3 70B via Groq for coding tasks and kept losing architectural decisions across sessions. “We use PostgreSQL” — forgotten. “Auth is JWT” — re-debated. Every new chat starts from zero.
So I built steerhead — it sits between you and any OpenAI-compatible API and manages context via SQLite instead of chat history.
The trick: every message is a single-shot API call. Steerhead assembles the system prompt from stored constraints + file history, fires one clean call, then auto-extracts any decisions the model made (via a second LLM pass) and stores them for next time.
Result: 146 tokens of surgical context instead of 80K tokens of degrading conversation history. New session? The model still knows your entire project’s decisions.
Works with:

Groq (free tier, tested with Llama 3.3 70B)
Ollama (local)
OpenRouter (free models)
Any OpenAI-compatible endpoint

What’s there: project-scoped DBs, session persistence, auto constraint extraction, React UI
What’s next: git diff capture, drift detection, memory classification (inspired by Cloudflare’s Agent Memory announcement)
Stack: FastAPI + SQLite + React. Fully local, MIT licensed.
Looking for contributors — especially around constraint extraction accuracy and drift detection.
GitHub: https://github.com/josephmjustin/steerhead

The best AI dictation apps, tested and ranked

May 2, 2026

Software

Architecting Digital Trust: A Relational Deep Dive into the LocalHands Prisma Schema

May 2, 2026

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Hand-Picked Top-Read Stories

FAQ Schema Markup Generators: What They Actually Do (and What They Don’t Tell You)

AI Agents in Practice — Read from the beginning

Trending Tags

Built an open-source memory layer for local LLMs — single-shot calls, auto-extracted constraints, no context degradation

Leave a Reply Cancel reply

Previous Post

The best AI dictation apps, tested and ranked

Next Post

Architecting Digital Trust: A Relational Deep Dive into the LocalHands Prisma Schema

Built an open-source memory layer for local LLMs — single-shot calls, auto-extracted constraints, no context degradation

Leave a Reply Cancel reply

Previous Post

Next Post

Related Posts