Loop Engineering, Chapter 4: From Laptop Loops to LLMOps

This is Part 4 of a series walking through my book Loop Engineering: Scaling and Governing Agentic AI. In the previous chapter, we chose an orchestration framework. This one is about graduating from a script that works to a system you can operate.

Almost anyone can build what I call a laptop loop: a script on a personal machine that wakes up, does agentic work, and produces something useful. The laptop loop is a genuine achievement and a terrible production system at the same time, and holding both of those thoughts at once is the start of this chapter. It runs under a personal credential that should never have touched a server. Its spending is unmonitored, so a loop stuck retrying a $0.10 call can become a $10,000 surprise before anyone notices. It crashes mid-run and leaves the world half-finished. It has no memory of yesterday and no record an auditor could inspect.

The scale multiplier is the part people underestimate. On your laptop, under your eye, every one of those failure modes is a tolerable annoyance. The moment the same loop runs unattended for someone else, each one becomes an operational emergency, because nobody is watching and the loop is faster than the humans who would.

Why LLMOps is not just MLOps

It is tempting to assume the discipline that managed machine-learning systems will carry over, and some of it does. But managing a language system differs in four ways that matter. The model is external and mutable — it changes underneath you on a vendor's schedule. The output is language, which resists the clean numerical metrics MLOps was built around. The cost is a perpetual, variable token bill rather than a fixed training run. And the system takes actions in the world, which carries a kind of risk a prediction never did. LLMOps is the practice that grows up around those four differences, and pretending it is just MLOps with a new model is how teams get surprised.

💡 Key idea: The agent should never hold a provider key. Policy, budgets, and rate limits belong in one place that every agent passes through — not scattered, piecemeal, through application code where each one can be forgotten.

That one place is the AI gateway: a single door between every agent and every model provider. It centralizes model fallbacks so one provider's outage does not stop the system, enforces rate limits and spend ceilings centrally, vaults credentials so they never live in agent code, and gives you observability across every model, agent, and tool call at once. The chapter pairs it with a treatment of observability as a flight recorder — cost and token usage attached to every span — so that when something goes wrong you can reconstruct exactly what happened and what it cost.

Tomorrow: operating a loop safely is not the same as trusting it. Chapter 5 opens the governance section with the security principle most agentic systems get wrong.

📖 Get the book

The full chapter — the laptop-loop failure catalog, the LLMOps-versus-MLOps breakdown, a declarative gateway config, and an observability sketch with cost accounting — in one place.

Get Loop Engineering on Amazon →

2026-06-18

loop-engineering

llmops

ai-gateway

observability

mlops

agentic-ai

book-series

Sho Shimoda

I share and organize what I’ve learned and experienced.

Search Logs

Deploy Teams bot to Azure 1404 IT assistant bot 1401 Hello World bot 1373 bot for sprint updates 1285 Teams production bot 1275 Teams bot development 1238 Microsoft Bot Framework 1235 Zendesk Teams integration 1197 Teams app zip 1194 Microsoft Teams Task Modules 1187 Bot Framework Adaptive Card 1186 Teams chatbot 1182 Teams bot tutorial 1172 Teams bot packaging 1164 Bot Framework example 1160 Task Modules 1134 Bot Framework proactive messaging 1127 Graph API token 1121 Bot Framework CLI 1119 Bot Framework prompts 1115 C 1107 sideload bot in Teams 1081 Azure App Service bot 1079 Azure CLI webapp deploy 1063 Adaptive Card Action.Submit 1053 Azure Bot Services 1046 Microsoft Graph 1024 identity in Teams 1017 Azure bot registration 1003 Adaptive Cards 1000

Development & Technical Consulting

Working on a new product or exploring a technical idea? We help teams with system design, architecture reviews, requirements definition, proof-of-concept development, and full implementation. Whether you need a quick technical assessment or end-to-end support, feel free to reach out.

Loop Engineering, Chapter 4: From Laptop Loops to LLMOps

Why LLMOps is not just MLOps

Sho Shimoda

Categories

Tags

Search Logs

Development & Technical Consulting