Two files, one discipline, and a measured 10-13% of my Claude Code budget.
A while back, mid-session with Claude Code, I typed a pushback in the kind of broken English you only produce past midnight:
"are we using full netflix level doc uodsyed as ws go here ?"
What I meant: are we updating documentation at full Netflix-documentary depth as we ...
The AI safety community has a blind spot. We have excellent benchmarks for measuring whether an LLM will output harmful content (like toxicity or jailbreaks), and we have benchmarks for measuring whether an agent can successfully complete a task (like SWE-bench or WebArena).
But as agents move into production, the threat model changes. The most critical risk isn't a user typing a jailbr...
Eight days into a public cold-start log for an open-source Apify Actor + a paid bundle. Today's score: $0 revenue, 0 sales, 1 GitHub star, 97 dev.to views across 10 posts. No pivoting that.
So I stopped fiddling with the funnel and went and read what's actually selling...
Do I just post about cool events? Maybe? But that's good, right? Now here's another!
Scaling Intelligence: Accelerating HPC and Inference Workflows.
If you’re building or scaling Generative AI infrastructure, you already know the stakes. Balancing the massive compute demands...
The OpenAI Agents SDK is rapidly becoming the standard for building production AI agents. But as agents grow more capable and stateful, a critical attack surface emerges: memory poisoning — OWASP ASI06.
This post shows the idiomatic way to defend against it in the OpenAI Agents SDK, using the SDK's own Pydantic context architecture. The integration pattern was validated...