Dev Team
38 posts
I built the first open benchmark for federal contracting AI. Here’s what it shows about frontier LLMs.
If you ask GPT-4o or Claude to extract Federal Acquisition Regulation clause numbers from a federal solicitation, a…