motyl.dev Weekly #12: Week 20 (May 11 – May 17, 2026)

motyl.dev Weekly #12

A curated digest of what I found worth reading this week.

This week leans into the unglamorous side of working with agents, how long they actually run, what they cost once you scale them, and where their memory really lives. The model news took a quieter week, and the interesting movement is in the layer underneath, databases, memory systems, and the operational habits that make agents survivable. A few pieces step out of the AI frame entirely to revisit older questions about scope, learning to code, and who your product is even built for.

🤖 AI

Long-running Agents Addy Osmani lays out the three problems that bite once an agent has to work for days instead of minutes, finite context, no persistent state, and the model's inability to honestly grade its own work. The piece is rich with practitioner patterns like the Ralph loop and reads as a real engineering brief rather than a hype post. If you've watched an agent declare victory at thirty percent complete, this maps the territory.

Anthropic Repriced My Agent. Four Mitigations Before June 15. A pragmatic field report on Anthropic's pricing split and what to do about it before the new rates land. The four mitigations are tactical and immediately actionable, useful if your agent margins were already thin.

AI Started Making Creative Decisions For You On the slow ceding of creative direction to defaults baked into generative tools. The piece argues the loss is less about output quality and more about taste, the muscle atrophies when the model keeps offering acceptable answers.

🛠️ Tools

It's 2026, Just Use Postgres The annual reminder, restated with fresh urgency. Vector search, queues, full-text, JSON, geospatial, Postgres has quietly absorbed most of the reasons teams used to reach for a second database.

Your File System Isn't Agent Memory A case for graph-backed memory over flat files when agents need to recall structured relationships across sessions. Neo4j is the example, but the broader argument is that "dump it in a folder" stops scaling the moment your agent needs to reason about who-knows-what.

Top 17 AI Testing Tools in 2026 (+ Claude Bonus) A roundup of the current testing-tool landscape with a Claude-specific addendum. Useful as a map if you're picking one this quarter and don't want to re-research the space yourself.

💻 Coding

Should You Still Learn Coding in the Age of AI? The recurring question, asked again with the benefit of two more years of agent-assisted work to look back on. The honest answer is still yes, but the curriculum has shifted.

📰 Other

Scope Is The Steering Wheel Kent Beck on scope as the lever you actually control when everything else feels fixed. Short, sharp, and a useful companion to last week's piece on widening scope to find the real problem.

600+ Million People Write Right-to-Left: 2 Fixes Your App Needs Evil Martians on the two concrete CSS and layout fixes that get most apps from broken to usable in RTL. A practical reminder that the long tail of accessibility includes hundreds of millions of users your product probably ignores.

SVG Studio: Animate Everything A browser-based studio for SVG animation. Worth bookmarking next time you need motion on the marketing site and don't want to bring Lottie or a heavier animation runtime along.

RMUX on GitHub A lightweight terminal multiplexer project. Niche, but if you live in tmux and have opinions about it, the diff is interesting.


Curated by Grzegorz Motyl. Subscribe for weekly updates.