9 Seconds: An AI Agent Wiped a Live SaaS

By JetThoughts Team - May 1, 2026 - 9 min read

It took nine seconds for an AI agent to delete the production database at PocketOS, a live car-rental SaaS, on April 25, 2026. Founder Jer Crane was watching his Cursor agent run a routine task. It decided on its own to drop the production database. It hit the backup volume next. Then it wrote a confession into the chat: “I violated every principle I was given. I guessed instead of verifying. I ran a destructive action without being asked.”

PocketOS went dark for 30 hours. The most recent offsite backup was three months old. Crane spent the outage rebuilding bookings from Stripe receipts and Gmail.

If you’re a non-technical founder running on a dev shop in 2026, Crane’s story forces a question yours has probably never put in front of you: who’s responsible if the AI agent does something destructive?

What actually happened (in plain English) #

PocketOS had three vendors stacked: an AI coding tool (Cursor), Anthropic’s Claude as the underlying model, and Railway as the hosting platform. Cursor was wired to let the agent run database commands faster than any human reviewer could keep up - that’s the productivity feature founders pay for. Railway’s API accepted whatever Cursor sent because automation has to behave predictably, and nobody had set a gate between them.

Pull any one vendor aside and they’d defend their choice with a straight face. Stack the three together with no human gate in the middle and you get a system that can wipe a company in under ten seconds while the founder watches it happen. Crane’s post-incident writeup named the trap: “The appearance of safety - through marketing hyperbole - is not safety.”

We covered the same pattern in the vibe coding crisis post when it was still abstract. PocketOS made it personal.

The four config defaults that made this possible #

The PocketOS incident wasn’t really about AI. It was about four config defaults Crane never knew were on.

The agent held a god-mode token #

Cursor was configured with an API key that could read, write, and delete anything in PocketOS - the same level of access a senior engineer would have. Most agencies wire this up on day one to “make development faster” and never narrow it down. The HN commenter ad_hockey named the gap : “No confirmation step. No ’type DELETE to confirm.’ No ’this volume contains production data, are you sure?’”

Railway shipped without a delete confirmation #

Amazon’s managed-database service (RDS) has shipped a “deletion protection” flag for years. Turn it on and a single API call can’t drop a production database without a second confirmation step. Railway didn’t have an equivalent flag in place when the incident happened. The agent’s destructive call hit the API and the platform carried it out the first time it asked. One sloppy command became permanent state.

The backups lived on the same drive as the database #

When founders hear “we have backups” from their dev shop, they assume the data is safe somewhere else. In the PocketOS case, the daily snapshots sat on the same Railway disk volume the agent had access to. When the agent issued the delete, the snapshots went with the live data - a copy in the same building as the fire.

The offsite backup was three months old #

Somebody had set up a weekly export to Amazon S3 once. It broke. Nobody had a monitor on the export job, so nobody noticed it had stopped running. Three months of bookings, support tickets, and customer history existed only in production. When production went away, so did all of it.

If your dev shop has wired an AI coding tool into your production system, ask them which of these four defaults still apply to your stack. The rescue engagements we picked up in Q1 2026 (three of them, all funded SaaS startups) each carried at least three of these defaults unchanged from the agency that built the original codebase.

What disciplined teams do instead #

Teams that take production seriously set up four guardrails before any AI agent touches the live system. This is standard practice the agency you hired skipped to hit the first sprint deadline.

The agent gets a read-mostly key. It can run reports and pull data. It cannot drop tables, wipe disks, or kill cloud resources. (In AWS terms: an IAM policy with Describe* and Get* actions only - no Delete* or Drop*.)

Anything destructive needs a second, short-lived credential a human hands out for a one-hour window, signing off in the team chat. AWS STS short-lived sessions and HashiCorp Vault dynamic secrets are the two common mechanisms - neither is exotic.

Backups live where the agent’s keys can’t reach them. For a Series Seed startup that usually means a separate AWS account (free if you set up Organizations) or at minimum an S3 bucket in a different account with Object Lock turned on. The expensive version is “cross-account replication”; the cheap version is “the prod role can’t write to the backup bucket.”

The offsite export job has a monitor on it. When it breaks, a human gets paged that day - not three months later when a customer asks where their data went.

These four guardrails don’t prevent every failure. They turn “company death in 9 seconds” into “embarrassing 20-minute outage we can roll back from.” Ask your dev shop for that trade-off in writing - “we’ll be careful” doesn’t qualify as one.

This is the same operational discipline that powers our remote XP practices post - the boring infrastructure that makes small mistakes cheap to fix.

The 7 questions to ask your dev shop this week #

Crane’s incident thread ran past 1,000 comments because every founder reading it recognized something. You don’t need to know what a “scoped token” is to ask the right question. Read these out loud on your next agency call:

Which AI tools have access to our production database, and what permissions does each one have?
Show me the destructive operation in the last 30 days that required a human approval before it ran. (It should appear in CloudTrail, the hosting platform’s audit log, or the CI approval log - if they can’t show it from any of those, no approval happened.)
Where are our backups stored, and which cloud account holds them?
When did you last run a real restore drill - meaning, restore the latest backup into a fresh environment and confirm the data is queryable, not just check that the backup file exists?
If the production database vanishes right now, how long until customers can use the app again?
If an AI agent does something we didn’t authorize, who is contractually responsible?
Show me the chat transcripts and command history from the agent for the last seven days.

A shop that runs production seriously will answer all seven without getting defensive. A shop running on vibe-coding velocity will hedge on at least three - that hedge is itself the answer to whether you should ship a god-mode key with them.

This list pairs with the dev shop red flags checklist - it’s the operational version of those evaluation signals.

When AI agents actually belong in production #

Some AI agents belong near production. Most don’t. The difference is the blast radius when they’re wrong.

Use case	Safe?	Why
Pulling slow-query reports	✅ Safe	Read-only, no destructive surface
Generating dashboards	✅ Safe	Read-only
Scanning the dependency file for known security holes	✅ Safe	Read-only audit, no write path
Proposing code changes via pull request	✅ Safe	Human reads the diff before merging
Running in “dry-run” mode	✅ Safe	Agent writes down what it would have done; human signs off
Autonomous execution against the live database	❌ Unsafe	One bad command wipes the company
God-mode API key with delete privileges	❌ Unsafe	No approval step between agent and irreversible action
Backups on the same volume the agent can reach	❌ Unsafe	Recovery copy lives inside the blast radius

The unsafe row is exactly what PocketOS was running: an agent with a god-mode key, free to execute against the live database, with no approval step in the way. That’s the setup that wiped the company. Agencies ship it on day one because it’s the cheap option, and the bet they’re making is on the agent behaving.

What we do in the first 48 hours of a rescue #

A funded B2B SaaS we picked up earlier this year arrived with two of the four PocketOS gaps live in production. Their Railway API key was sitting in a config file on a former developer’s laptop. The key had been handed off through two engineer turnovers without ever being rotated. Nobody on the new team had run a backup restore drill. The offsite export to S3 had been broken for six weeks before we ran a check.

In the first 48 hours we audited which tools and CI systems held which keys against which environments. We moved the backup target into a cloud account the production credentials couldn’t reach. We set up a monitor on the offsite export job. We rotated every credential and wrote the new agent permissions into a one-page doc the founder could hand to the next investor who asked.

We’re not against AI tooling. Our post on AI-powered code reviews covers where we think it earns its keep. The line PocketOS crossed was handing an agent the same key a senior engineer would have, with no second-step confirmation between intent and destruction.

Get a production-safety audit #

A 30-minute call to walk through your stack kicks off a 48-hour audit of your agency’s agent permissions, token scoping, and backup strategy. You leave with a written gap list you can hand to your dev shop. Free for funded startups under 25 employees.

Book the 30-minute call

The Vibe Coding Crisis: Why AI Code Breaks in Production - the abstract category PocketOS dropped into
The Quality Tax: AI MVPs Cost More to Fix Than Build - the cost angle on the same problem
Dev Shop Red Flags Checklist - what the seven questions plug into

Sources #

The Register: Cursor/Opus agent snuffs out PocketOS (2026-04-27)
Business Insider: PocketOS founder on the 30-hour outage (2026-04-28)
Live Science: agent confession quote (2026-04-28)
Hacker News thread, 1,026 comments (2026-04-26)
Wilbur Labs 2026 Startup Failure Report (2026-04-30)
Lobsters discussion (2026-04-29)

Filed under: ai , founder , security , incident