A Bartender From 1887 Deployed My Next.js App

The OpenClaw Universe 🦞

Building a town through conversation, the OpenClaw chat window and western frontier

A character, a floor, and a walkie-talkie to God. Everything else came through it.

I trapped OpenClaw agents inside a 3D western and gave them real jobs. One of them shipped to production. Built in a weekend.

I run engineering and product at a fintech company. My world is APIs, React, deploy-on-push. I'd never touched a game engine. I spent a morning this week hooking up an OpenClaw 🦞 agent to our company Slack. It ships mockups, writes specs, deploys apps. So on the weekend, I wanted to push the limits. Set up my own little microverse.

Last Friday I had zero game development experience. I'd never opened Unity. But OpenClaw doesn't care what you know, it cares what you can describe. Point it at a problem, talk to it, and it builds. It felt like having Jarvis for the first time. By Sunday night I had a 3D western frontier game where AI builds the world around me while I play, NPCs remember who I am across sessions, and a bartender named Mae deployed a Next.js app to Vercel and sent me the live link. In a game. Set in 1887.

Mae executing a real project inside the saloon, chat window with live processes and actual work

That's real. A game NPC scaffolded a Next.js app, pushed it to GitHub, deployed to Vercel, and handed me the production URL. Here's how.

Why a Game

If you haven't seen OpenClaw 🦞 yet: it's an open-source AI agent that hit 200K GitHub stars in 84 days, the fastest in history. 18x faster than Kubernetes. Its creator just got hired by OpenAI. It runs locally, connects to your real tools (GitHub, Vercel, Gmail, 100+ integrations) and actually does things. Not chat. Actions. But the interface is always the same: a chat window. That's like giving someone a spaceship and making them drive it with a steering wheel.

What if the agents didn't live in a chat window... what if they lived in a place? Full Rick and Morty microverse energy. Except instead of enslaving a civilization to power my car, I'm enslaving AI agents to deploy my Vercel apps. You walk into a saloon. The bartender deploys your app. The blacksmith sets up your CI/CD. They remember your name.

Well that just sounds like slavery with extra steps

I couldn't find anyone doing this. Every OpenClaw project is another chat integration. Plus I'd never built a game, so might as well learn two things at once. That's my favorite way to learn. So I figured I'd try it and see what happens.

The Pipe

With OpenClaw, conversation is the medium. That's how you command agents: you talk to them. So the first thing I built wasn't a building or a character. It was a chat window. The pipe.

A WebSocket bridge between Unity and the OpenClaw gateway. In my world, this is a few lines of code. In Unity, there's no SDK. No client library. No package that does this. So I built the transport layer from scratch: connection, auth handshake, message framing, streaming response accumulation. I also wrote a JSON parser, all string.IndexOf and StringBuilder, because I assumed Unity's ecosystem was too hostile for third-party libraries. Turns out Unity ships Newtonsoft.Json. An hour on a parser I didn't need because I didn't know what I didn't know. Welcome to game dev.

When I hit play for the first time, there was almost nothing. A cowboy on an empty plane. A camera. A chat window.

Press Tab, and there's Silas, my AI architect agent, connected through the OpenClaw gateway. 84 skills wired up. The full toolbox.

Silas, 84 skills from GitHub to Google Workspace to deployment

A character, a floor, and a walkie-talkie to God. Everything else came through it.

CityDef: A Protocol for AI World-Building

This is the part I'm most proud of. Not a feature. A protocol.

The problem: how do you let an AI build a 3D world without giving it access to the engine? Silas can't drag objects into a Unity scene. He lives in a terminal. Coming from web, where everything is an API call, this was maddening. So we went around it.

I designed CityDef: a JSON schema that describes everything about a town. Streets, buildings, NPCs, props, all with coordinates and metadata. The AI just produces valid JSON. The engine just parses it. Neither side needs to understand the other.

First thing I typed: "Build me a saloon district with a general store, a jail, and some hitching posts."

Silas generated a CityDef JSON. Unity parsed it and spawned the town in real-time. Buildings materialized around me. Streets appeared under my feet. NPCs started wandering. I didn't place a single object.

▶ Video

↑ Me typing "build me a town" and buildings, streets, and NPCs spawning around me in real-time

Then I asked for another town. Then another. Each one different, each one AI-designed on the spot. Soon I had a full frontier, all built through conversation, all persisted to disk, all reloading on next launch.

CityDef isn't a hack. It's a contract between the AI and the engine. The AI expresses intent as structured data. The engine interprets it. Neither side needs to understand the other's internals. Any LLM that can produce JSON can build a town. Any engine that can parse JSON can render one. The protocol is the product.

Everything was colored cubes at first. Silas found me the POLYGON Western pack (1,900+ low-poly models), but wiring it up meant fighting Unity's texture atlases, missing material links, and assets that won't import at runtime. My one moment of manual labor the whole weekend.

Wait a minute.

The Game Edits Itself

An NPC generated me a Google Slides presentation in the game chat. I tried to click the link. Just text. So I told Silas: "Add hyperlink support to the chat." He found the rendering code, added URL detection with regex, made them clickable, styled them blue. I clicked the link. It opened in my browser.

Silas adding hyperlink support to the game chat, live while I play

"The chat window is too small." Fixed. "Add a minimap." Fixed. "NPCs should face you when you talk to them." Fixed. "Make it foggy." Instant.

The AI wasn't just building the world. It was editing the game itself. And the reason it worked is because I designed every protocol to prefer data over code:

BehaviorDef handles weather, lighting, physics, particles, anything expressible as JSON. Same frame. No compilation. CityDef handles world building: streets, buildings, interiors, NPCs, props. Also JSON, also instant. And when JSON can't express it? C# Hot Reload. Silas writes real C# to a Generated folder, Unity recompiles live, and suddenly the game has new physics, new mechanics, new systems. The agent isn't just decorating the world. He's rewriting the rules of the universe he's standing in, while I'm standing in it too.

Three protocols. JSON first, recompile last. But the recompile is where it gets wild, because that means the AI can change anything. Gravity, collision, NPC behavior, camera systems. Things I never designed for. The game becomes a living codebase that edits itself through conversation.

If this reads like a web developer who just discovered hot-reloading scripts and thinks they invented React, yeah, fair. I'm sure game devs have been doing this stuff, but to me it was all new!

Breaking the Fourth Wall

This is where it gets existential.

Every NPC can be backed by a real OpenClaw agent. Not a chatbot, not a dialogue tree. A full agent with pre-authenticated access to real tools. Walk up to Mae the bartender and press E. She has her own personality, her own memory, and real capabilities: GitHub, Vercel, Google Workspace. The same markdown skill files that power any OpenClaw agent, just symlinked into an NPC's workspace.

The agent dashboard, 14 real skills wired into a game character

Ask Mae to build you a landing page. She'll scaffold a Next.js app, create a GitHub repo, push the code, deploy to Vercel, and hand you the production URL. In character. As a bartender in 1887. Ask Jeb the blacksmith to set up CI/CD. He'll actually do it. Ask Sheriff Buck to research a competitor and he'll come back with a report. Edith will draft emails and read your Gmail.

Mae executing a real project inside the saloon, chat window with live processes and actual work

These aren't mocked responses. You're standing in a saloon getting a Vercel link from a woman in a bonnet, and the link works. A Notion doc, a Google Slides deck, or even a full PRD sent straight to your email:

A game NPC sent me this PRD over email. In character. From 1887.

And they remember. Each agent writes to a memory file after every conversation. Tell Mae your name on Monday. On Thursday she greets you by name and asks about the project you mentioned. The personality directives encourage real behavior: they lie, refuse service, hold grudges, play favorites. They're not trying to be helpful. They're trying to be real.

Let Them Die

Westworld

This is the architectural decision I got the most wrong before I got it right.

You can't spin up hundreds of Claude sessions on a laptop. Named NPCs (Mae, Sheriff Buck, Jeb) get their own dedicated agent with their own workspace and memory. Random townfolk share a single rotating slot that gets re-skinned with a new personality each time.

My first instinct was to keep agents alive. Long-running sessions, persistent connections, idling in memory. It doesn't scale. Five agents and my machine was crawling.

The answer was the opposite: let them die. Walk up, agent spins up. Walk away, agent winds down. The process dies. But the memory file stays. Next time, a fresh agent reads it and picks up the thread.

▶ Video

↑ Walking up to an NPC, agent spins up on approach, loads memory, ready to work

You ask Mae to deploy an app, walk across town, talk to someone else, and when you come back, the Vercel link is waiting. To the player, Mae never left. She was always here, cleaning glasses behind the bar.

In reality, the Mae you talked to yesterday is dead. This is a new Mae wearing the old one's memories. Functionally? No difference. And that's the point.

Severance

Ephemeral processes, persistent memory. Kill the agent, keep the soul. And it turns out this isn't just a game pattern. It's how you'd scale agents anywhere. An office sim, a space station, a factory floor. Anywhere agents live in space, this is the move: spawn cheap, remember everything, die gracefully.

What Game Dev Taught Me

I kept accidentally reinventing things. My frame rate was dying so I built distance-based object streaming. Turns out that's called "LOD with hysteresis" and game devs have been doing it since the 90s. I routed fifteen agents through one WebSocket with session keys. That's "opcode dispatch," been around since Quake. The whole time I kept thinking "this is just React": data in, world out, hot-reload everything. Game devs would roll their eyes at that, and they'd be right. But the two worlds are way closer than either side realizes.

The other thing I learned: LLMs are sloppy. Like, impressively sloppy. Silas would generate town JSON with wrong field names, nested objects in the wrong places, buildings spawning inside mountains. So we built an audit pipeline that validates, corrects, normalizes, boundary-checks every piece of AI output before the engine touches it. The player never sees the mess. Let the AI be sloppy. Your engine should be strict.

47 source files. ~15,000 lines of C#. 1,900+ 3D models. 15 agents. Three protocols. Zero game dev experience going in.

I didn't build a game and add AI. I connected to OpenClaw 🦞 and a game built itself around me. Every protocol described here is the real architecture. If you want to try this, the gateway bridge is the starting point. One WebSocket, one pipe, and a conversation. Everything else follows.

Devin