Brady Tengberg
11 posts
Reducing LLM Cost and Latency Using Semantic Caching
Running large language models in production quickly exposes two operational realities: every request costs money, and every request…
How to Hide API Endpoints in Next.js
In Next.js, securing sensitive data and preventing the exposure of API endpoints is critical for building secure web…
Waterfall-Model software engineering. Does it still make sense nowadays?
Back in the 1960s, when computers were still a relatively new frontier, the term “software engineering” was first…
AI agents for marketing — I talked to experts about the benefits
I love the Back to the Future series, especially Part II where we see “the future.” Of course,…
Linux Firewall: Blocking a lot with a little
I have a need to block a large list of ever changing IP addresses from servers and systems…
Create a QR Code Generator Using ToolJet and Python in 5 Minutes! 🛠️
This quick tutorial will guide you through the steps to create a QR Code Generator application using ToolJet.…
How to package and deploy a Lambda function as a container image
Introduction: AWS Lambda allows you to run your code inside Docker containers! This feature opens up a lot…
Kubernetes fail compilation: but they keep getting worse
I’ve never been a big gambler. I might have placed a small bet in the past to spice…
Balancing budget for user testing in product management: three case studies
Katarzyna Malecka shares user testing strategies for all budgets, including 3 case studies and key insights to improve…