Blog

May 31, 2026

Agent-First Development in VS Code: The 5 Variables That Actually Determine Your Results

Most developers running VS Code agent mode optimize 3 of 5 variables that determine output quality. The VS Code team finally named all five. Here is what each one controls.

May 30, 2026

Claude Code in Production: What Senior Engineers Actually Need to Know

Most teams using Claude Code are running it at 20% of its actual capability. Here is what teams actually getting leverage are doing differently — from CLAUDE.md to multi-agent file coordination.

May 29, 2026

Google's New Lighthouse Audit Scores How AI Agents See Your Site

Lighthouse now has an Agentic Browsing category. It audits llms.txt, WebMCP integration, and AI accessibility — not for humans, but for the agents navigating your site.

May 28, 2026

What the Top 1% of Developers Build With Claude Code Dynamic Workflows

Multi-agent orchestration is here. HoneyBook collapsed 10 Jira round-trips to one. incident.io runs 4-7 concurrent agents. Anthropic engineers no longer write code. Here is what they are all doing.

May 27, 2026

Should You Actually Use Redis Iris? An Honest Builder's Verdict

Redis Iris solves real production agent problems. It also requires real operational commitment. Here is the 5-question checklist I use to decide.

May 26, 2026

Redis Iris vs. Pinecone Nexus vs. Naive RAG: When to Use What

The only question that matters when choosing a retrieval architecture: how fast does your data change? Here is the full comparison with three concrete scenarios.

May 25, 2026

Why Your RAG Pipeline Fails in Production

Most production RAG failures are not model problems. They are runtime data problems. Here is what actually breaks and why better retrieval does not fix it.

May 24, 2026

How Redis Iris Actually Works: A Builder's Breakdown

Redis Iris has four moving parts. Here is how RDI, the Context Retriever, Agent Memory, and LangCache actually work, with honest tradeoffs on each.

May 22, 2026

The Best Agent Evals Come From Production Failures, Not Design Sessions

Most teams spend weeks designing agent evals from scratch. The ones that build better agents discover them from real traces and real failures. Here is what that actually looks like.

May 17, 2026

OpenAI Just Went Full Palantir

OpenAI's Tomoro acquisition is not a consultancy buy. It is the Palantir model applied to frontier AI, and it is the smartest enterprise move OpenAI has made.