Tag

AI

35 posts on AI

All (83)AI (3)Architecture (1)Developer Experience (1)Developer Productivity (1)Engineering Management (3)Metrics (1)Platform Engineering (1)Product (2)Software Development (1)agents (6)agile (1)ai (35)analytics (1)architecture (23)benchmarks (1)business (4)career (1)claude-code (3)cloud (1)communication (1)cost (3)culture (9)database (2)databases (1)developer-experience (6)devex (9)devops (12)docker (1)elasticsearch (1)embeddings (1)engineering (25)engineering management (1)engineering-management (23)finops (1)gpu (1)guide (1)hiring (2)infrastructure (16)interviews (1)kafka (1)kubernetes (3)leadership (3)llm (10)mcp (1)meta (1)monorepo (1)navigation (1)networking (1)observability (3)on-call (1)patterns (1)performance (1)postgres (1)process (1)product (17)product-management (2)productivity (8)prompt-engineering (2)qdrant (1)queues (1)rag (3)reliability (2)saas (1)scaling (1)search (1)security (7)software-engineering (17)startups (2)strategy (4)teams (6)technical-debt (1)testing (5)threat-intelligence (1)tooling (1)tools (4)typescript (1)vector-search (3)vibe-coding (1)web (2)wordpress (1)

May 20, 20266 min read
Context Window Management Is a New Engineering Discipline
LLMs have finite context. Managing what goes in — and when — is now a first-class engineering concern, not a prompt hack.
ai engineering architecture
May 20, 20266 min read
LLM Output Is Not Data
Engineers pipe LLM output into downstream systems as if it were structured data. It isn't. That mismatch is a whole class of production bugs.
ai engineering architecture
May 17, 20265 min read
Your AI Agent Is a Privileged Insider
When you give an AI agent access to your tools, you've created a privileged insider. The threat model is different from a compromised service — because the agent acts non-deterministically, at scale, on your behalf.
ai security engineering devops architecture
May 17, 202610 min read
Your AI Agent Has a 90% Step Score. Here's Why It's Failing 65% of Runs.
A 10-step AI agent pipeline at 90% per-step reliability succeeds only 35% of the time. This is the compounding reliability math that explains why 78% of companies run pilots but only 14% ship agents to production — and the architecture that closes the gap.
ai agents architecture engineering reliability
May 17, 20266 min read
Your CTI Pipeline Is Already Contaminated
Threat intelligence was built on the assumption that your analysis layer is neutral. LLMs trained on public CTI reports aren't neutral — they've absorbed adversarial narratives, attribution biases, and threat actor disinformation before you wrote a single query.
ai security threat-intelligence engineering
May 17, 20266 min read
Observability Is Broken for AI Systems
Traces, metrics, and logs were designed for deterministic systems. When an agent makes 40 tool calls across three services to complete a task, your existing observability stack tells you almost nothing useful.
ai engineering observability architecture devops
May 17, 20266 min read
Prompt Injection Is the New SQL Injection
In 2002, SQL injection was a known attack that most developers dismissed as someone else's problem. By 2010 it was the top cause of data breaches. Prompt injection is at the 2002 stage. The trajectory is the same.
ai security engineering architecture
May 17, 20266 min read
Your Security Policy Wasn't Written for AI Agents
IAM roles, network policies, secrets rotation schedules — all designed for humans or static services. AI agents are neither. They're dynamic, non-deterministic actors with legitimate credentials, and your current policy model doesn't account for them.
ai security engineering devops
May 16, 20268 min read
Product Management Is the New Engineering Bottleneck. Andrew Ng Already Said It.
AI made engineers 10x faster. PMs didn't keep up. Andrew Ng named it. LinkedIn already restructured around it. Here's what your team should actually do.
product engineering-management ai strategy product-management
May 15, 20266 min read
Your Roadmap Was Built for a World Where Shipping Was Hard
AI just cut engineering cycle time by 80%. Your feature-decision process still takes three weeks. You didn't solve delivery. You exposed discovery.
product ai engineering-management strategy product-management
May 14, 20269 min read
81% Is Marketing. AI Coding Benchmarks Are Contaminated — Here's the Real Score.
SWE-bench Verified is broken. OpenAI officially stopped using it. The same models scoring 80%+ on Verified score only 23% on the contamination-resistant version. Here's what happened, why it matters, and how to actually evaluate AI coding tools.
ai engineering tools benchmarks product
May 13, 20268 min read
Clean Code Is Your AI Tax Rate
AI agents don't make your messy codebase invisible — they make it expensive. When 78% of Claude Code sessions involve multi-file edits, your architecture quality is no longer a code-quality concern. It's a cost and velocity concern.
architecture ai engineering technical-debt product
May 12, 20268 min read
Your AI Agent Has Amnesia. Here's the Architecture That Fixes It.
Long-running agents fail 90% more often without state persistence. This is the memory architecture — working, episodic, semantic, procedural — that makes stateful AI production-ready.
ai architecture agents engineering product
May 11, 20268 min read
The SaaSPocalypse Wasn't a Tech Story — It Was a Pricing Model Reckoning
$285 billion disappeared from SaaS valuations in 48 hours in February 2026. Most analysis blamed AI agents. The real mechanism was a 25-year pricing assumption that everyone forgot was an assumption.
product ai strategy saas business
May 10, 20268 min read
The Delegation Gap: You're Using AI Like a Junior Dev When You Could Run a Whole Team
Anthropic's 2026 Agentic Coding Trends Report shows devs use AI in 60% of their work but fully delegate only 0–20% of tasks. Here's the exact playbook to close that gap with Claude Code Agent Teams.
ai engineering productivity claude-code agents
May 3, 202615 min read
The Spec Is Now the Code: Why Spec-Driven Development Is the Skill Nobody's Talking About
AI agents can execute from a precise spec. The real bottleneck shifted from writing code to writing what you want — clearly. Here's what changed, why it matters for engineers, PMs, and managers, and how to actually do it.
ai software-engineering product engineering-management
May 2, 202612 min read
The One-Person Company Is Real. Here's What It Actually Takes.
Base44 sold for $80M. Medvi hit $401M with one employee. The one-person company isn't a thought experiment anymore — but the playbook everyone's selling you is missing the hard parts.
ai startups product engineering management
May 1, 202615 min read
How to Structure an Engineering Team When AI Writes 41% of the Code
The org chart most teams run was designed when humans wrote all the code. Anthropic's 2026 data says that assumption is gone. Here is what the structure should look like now — and what roles actually matter.
engineering-management ai teams software-engineering product
April 30, 202610 min read
93% of Developers Use AI. Your Team Is Still Missing Deadlines. Here's Why.
Faros AI tracked 22,000 developers and found individual AI gains evaporate at the org level. PR merge times are down 20%. Incidents are up 23.5%. Here is the mechanism — and what actually fixes it.
ai engineering-management software-engineering productivity teams
April 30, 20264 min read
Embedding Models: Which One, and Why It Matters Less Than You Think
Embedding model choice is a 5% problem for most RAG systems. Your chunking strategy is the 50% problem. Here's how to pick anyway.
ai llm rag embeddings vector-search
April 30, 20265 min read
Prompts Are Code: How to Version, Test, and Deploy Them
Your AI feature has a 200-line system prompt living in a string in app.py. That's tech debt. Here's how to treat prompts like first-class artifacts.
ai llm software-engineering prompt-engineering
April 30, 20264 min read
Prompt Caching: The Cost Math Most Teams Get Wrong
Prompt caching is not a 90% discount. It's a 90% discount on the static parts only. Here's how to actually compute your cache savings.
ai llm cost claude-code prompt-engineering
April 30, 20265 min read
Testing AI Features: Why Unit Tests Lie and What to Do Instead
Your AI feature passes 100% of unit tests and ships broken to users every other week. Here's why, and how to actually test LLM-powered systems.
ai llm testing software-engineering
April 29, 20264 min read
How to Prep for a Tech Interview Using AI (Without Looking Clueless)
AI can boost your interview odds by 40%. Here is how to use Claude to prepare—and exactly what to do (and not do) in the room.
ai career interviews
April 29, 20263 min read
Why Your AI Product Feels Broken (Even Though the Model Is Good)
Claude 4 didn't get stupider. Your safety layer is failing. How to identify when the problem is your architecture, not the LLM.
ai product architecture llm
April 29, 20265 min read
Why Your Company's AI Strategy Isn't One (And What You're Actually Missing)
Every company says they have an AI strategy. Most are just feature roadmaps with AI stickers on them. Here is the difference that matters.
ai business strategy product
April 29, 20267 min read
The PM Who Ships: AI Agents Just Collapsed the Distance Between Idea and Production
The 6-week sprint was invented because execution was expensive. AI coding agents just made execution cheap. Here's what that means if you're a product manager.
product ai engineering tools
April 28, 20267 min read
MCP Is Not a Better Function Calling. It's a Different Layer Entirely.
Ten months after MCP went multi-vendor, most teams are still treating it as a nicer function-calling wrapper. That's the wrong mental model — and it's quietly producing architectures that don't scale.
ai architecture mcp agents llm software-engineering
April 27, 202613 min read
86% of Multi-Agent Systems Die Before Production. Here's Why.
A MAST taxonomy of 1,600+ execution traces maps 14 failure modes across 3 root causes. The model is almost never the problem. The orchestration architecture almost always is.
ai architecture agents software-engineering llm
April 26, 202611 min read
Context Engineering Is Just Systems Design (And Most Teams Are Starting Over)
82% of AI teams say prompt engineering alone isn't enough. The ones succeeding in production are treating context design the same way they treat database indexes — as an architectural decision, not a prompt trick.
ai architecture software-engineering llm agents
April 20, 20267 min read
The Security Bill for Vibe Coding Is Coming Due
Georgia Tech tracked 35 CVEs from AI-generated code in March 2026 alone — more than all of 2025 combined. Here's what the data says, why it's happening, and what a secure AI workflow actually looks like.
ai security software-engineering vibe-coding
April 15, 20265 min read
The junior hiring trap
Every team quietly raising the bar on junior reqs thinks it's being smart. They're building a talent debt that won't show up on any dashboard until it's already too late.
engineering-management hiring ai
April 10, 20266 min read
Self-Hosting an LLM on Kubernetes
Managed inference APIs are convenient until they are not. Here is the full picture of running your own LLM on Kubernetes: GPU scheduling, model storage, vLLM vs Ollama, and the operational tradeoffs.
kubernetes llm ai gpu infrastructure
April 5, 20263 min read
Reading code is the bottleneck now
AI agents made writing code cheap. The skill that actually matters shifted to reading what they produced and deciding whether to keep it.
ai software-engineering claude-code
March 28, 202610 min read
RAG in Production: How Retrieval-Augmented Generation Actually Works
LLMs don't know your data. RAG fixes that by turning your documents into a searchable knowledge base. Here is the full pipeline: chunking strategies, dense vs hybrid retrieval, re-ranking, and when to reach for graph-based RAG with LightRAG.
ai llm rag vector-search infrastructure

Context Window Management Is a New Engineering Discipline

LLM Output Is Not Data

Your AI Agent Is a Privileged Insider

Your AI Agent Has a 90% Step Score. Here's Why It's Failing 65% of Runs.

Your CTI Pipeline Is Already Contaminated

Observability Is Broken for AI Systems

Prompt Injection Is the New SQL Injection

Your Security Policy Wasn't Written for AI Agents

Product Management Is the New Engineering Bottleneck. Andrew Ng Already Said It.

Your Roadmap Was Built for a World Where Shipping Was Hard

81% Is Marketing. AI Coding Benchmarks Are Contaminated — Here's the Real Score.

Clean Code Is Your AI Tax Rate

Your AI Agent Has Amnesia. Here's the Architecture That Fixes It.

The SaaSPocalypse Wasn't a Tech Story — It Was a Pricing Model Reckoning

The Delegation Gap: You're Using AI Like a Junior Dev When You Could Run a Whole Team

The Spec Is Now the Code: Why Spec-Driven Development Is the Skill Nobody's Talking About

The One-Person Company Is Real. Here's What It Actually Takes.

How to Structure an Engineering Team When AI Writes 41% of the Code

93% of Developers Use AI. Your Team Is Still Missing Deadlines. Here's Why.

Embedding Models: Which One, and Why It Matters Less Than You Think

Prompts Are Code: How to Version, Test, and Deploy Them

Prompt Caching: The Cost Math Most Teams Get Wrong

Testing AI Features: Why Unit Tests Lie and What to Do Instead

How to Prep for a Tech Interview Using AI (Without Looking Clueless)

Why Your AI Product Feels Broken (Even Though the Model Is Good)

Why Your Company's AI Strategy Isn't One (And What You're Actually Missing)

The PM Who Ships: AI Agents Just Collapsed the Distance Between Idea and Production

MCP Is Not a Better Function Calling. It's a Different Layer Entirely.

86% of Multi-Agent Systems Die Before Production. Here's Why.

Context Engineering Is Just Systems Design (And Most Teams Are Starting Over)

The Security Bill for Vibe Coding Is Coming Due

The junior hiring trap

Self-Hosting an LLM on Kubernetes

Reading code is the bottleneck now

RAG in Production: How Retrieval-Augmented Generation Actually Works