How I Run My
One-Man Company
with Hermes Agent

Auxten Wang · Technical Director @ ClickHouse

PyCon Singapore 2026 · The Agent Track

Auxten Wang

@auxten · auxten.com

  • 🏢 Technical Director @ ClickHouse
  • 🚀 Creator of chDB — acquired by ClickHouse 2024
  • 🧬 Maintaining a pile of side projects… alone
  • 📝 Shipping 3 iOS apps as a one-man company
  • Ex-Shopee · CovenantSQL · Baidu · Qihoo 360

How We Got Here

Four years of AI coding, compressed. It moved absurdly fast.

  • 2022 ChatGPT. Describe a function, copy the answer, paste into your editor.
  • 2024 Cursor + Claude Code. The AI moves inside the editor — reads your repo, runs your tests.
  • 2025 CLI agents & multi-agent pipelines. Codex and Claude Code in the terminal: test → fix → review → benchmark.
  • 2026 Autonomous, always-on agents. The question stops being "can it code?" and becomes "can it run without me, 24/7?"

From Copilot to Coworker

Same models underneath. Completely different relationship.

IDE Copilot

  • You drive, every keystroke
  • Lives in your editor session
  • Forgets everything when the tab closes
  • Stops the moment you close the laptop

Autonomous Agent

  • Lives on a server, runs 24/7
  • Persistent memory across sessions
  • Reaches you on chat — Discord, Telegram…
  • Learns its own skills, gets better over time

Think of it as hiring an ops teammate, not buying a faster autocomplete.

What is Hermes Agent?

Open-source autonomous agent by Nous Research · MIT · shipped Feb 2026 · ~195k★ on GitHub

🏠Self-hostedRuns on a $5 VPS or your Mac mini. Your data stays with you.
📱Talk to it anywhereOne gateway to Discord, Telegram, Slack, WhatsApp…
🧠Persistent memoryMulti-layer memory carries context across every session.
🔨Self-improvingBuilt-in learning loop: writes its own reusable skills.
🔒Sandboxed executionRuns commands in Docker / SSH / serverless backends.
Always onCron, background jobs, long-running tasks — no babysitting.

Not a chatbot. Not a copilot. An agent that lives on your machine and gets smarter every day.

My Use Stack

Deliberately boring. One chat app in, the agent does the rest.

💬Discordphone + desktop
Hermes Agenton Mac mini · 24/7
🧠Personal Codexmy own sub · ~$200/mo
↓ failover
🔁MiniMax M3via OpenRouter · backup
tmux foreman MCP tools imported skills background cron ClickMem memory

No new SaaS, no per-seat fees. Recurring cost: ~$200/mo — the Codex subscription I already pay for.

Migrating My Brain in One Sentence

A year of teaching Claude Code, Codex & OpenClaw my skills and conventions. Moving it all to Hermes took… one message.

general
Auxten May 14
Learn all the skills I defined over on OpenClaw.
H
Hermes BOT May 14
Done. Synced 27 custom skills from ~/.openclaw/skills/ to ~/.hermes/skills/openclaw-imports/.
Verified: source 27 → target, no mismatch; skill_view loads fine. New session: /reload-skills and you're set.

It also reads my CLAUDE.md / AGENTS.md — the conventions I taught Cursor and Codex just carry over.

Why I Trust It — #1

It never silently drops a task

  • Instant ack. The second I send a task, it drops a ✅ on my message — before any LLM call. Backend model flaky? Doesn't matter; I know it landed.
  • Closed loop. Every message gets a final result. Success or failure, it always comes back and tells me.
  • No more "did it even get my message?" No more staring at a dead chat.
general
Auxten 08:00
Check the online status of my 3 apps.
✅ received · instant
H
Hermes BOT 08:17
Verdict: nothing critical. Checked Langfuse / GA4 / RevenueCat / ASA… full report ↓

17 minutes later — it came back, like it always does.

Why I Trust It — #2

Background cron that actually runs my ops

ScheduleWhat it does (and reports to Discord)
Sun 08:00–09:30Weekly Apple Search Ads optimization for 3 appsper-app keyword/bid tuning, then one roll-up report
Daily 08:00Cross-app anomaly inspectioninstalls / trials / revenue / LLM errors — read-only, pings me only if something's off
Daily 10:00Emby "pick of the day"one title, tuned to my taste
Every 14 daysApp Store keyword + metadata refresh, then ship a new version
Sat 15:00Download the week's new anime episodes to my NAS

I optimize ad spend every week and inspect every online task every day — without lifting a finger.

Why I Trust It — #3

Hermes as my foreman (监工)

Hermescc-supervisor
↓ drives tmux
Claude Codepane 0:0
Codexpane 0:1
summarize output chase progress catch fake tests
dev-ops
Auxten
Be a foreman: open a tmux and manage my pile of Claude Code sessions — summarize key output, chase progress, and quality-review (catch laziness / hallucinations / fake tests).
H
Hermes BOT
Spun up cc-supervisor; the dashboard refreshes every pane every 30s. Pane 0:0 (Claude Code) is idle at the prompt with commit typed but not run — I won't press Enter to commit on my own unless you explicitly ask.

An agent… supervising other agents. And it knows where to stop and ask.

Use Case 1: Analyzing My Own Apps

I ship 3 iOS apps solo. Ads, users, revenue, releases, crashes — scattered across 6 dashboards. So I stopped opening dashboards.

Apple Search Ads RevenueCat GA4 AppsFlyer D1 / Supabase Langfuse
Hermes+ my ad-ops skills
💬One reportin Discord

Ad spend, user activity, releases — one agent, one place, plain language.

It Doesn't Just Dump Numbers

app-ads
H
Hermes BOT Sun 09:30
Daily ASA report
Yesterday: spend $106.63 · installs 32 · CPI $3.33
Last 7d: spend $440.16 · installs 104 · CPI $4.23
Since re-enabling: 6/14 2 → 6/15 27 → 6/16 32 installs — clear recovery ✅

Weekly ad optimization, per app

daily-watch
H
Hermes BOT 08:17
Verdict: nothing critical, but 3 medium anomalies.
One app's GA4 engagedSessions yesterday 10, 7-day avg ≈ 35.8, ↓ ~72%; but sessions / users / events all normal → likely a tracking artifact, not the app being down.

Daily anomaly inspection

It tells me whether to panic — and usually, not to.

Use Case 2: What Should I Watch Tonight?

Not everything has to be work. I taught it my taste once…

Breaking Bad Homeland Game of Thrones Westworld House of Cards Black Mirror Severance Fargo Silicon Valley EVA A Mortal's Journey

…and every day at 10am it gives me one pick from my own Emby server — with a reason, and never something I've already seen.

One Pick a Day, With Taste

emby-daily
H
Hermes BOT
The Wire
Peels back the drug trade, the police, politicians and the press, layer by layer — the criminal ecosystem of Breaking Bad + the power games of House of Cards. S01E01 is on your Emby.
H
Hermes BOT
Ozark
An ordinary man swallowed by a criminal system, with Billions-style money games — colder and more oppressive.
emby-daily
Auxten
Watched Billions — loved it.
H
Hermes BOT
Got it — marked Billions as "watched & liked." I won't recommend it again, but I'll use it as a taste signal to find similar ones.
Auxten
Stop recommending Day of the Jackal.
H
Hermes BOT
Noted. Updated three places: the daily-rec exclusion list + the Emby skill's watched list + your screen-taste memory.

Bonus: every Saturday it grabs the latest episodes of my favorite cultivation anime and uploads them to my NAS. A feedback loop, for my couch.

First, the Memory Trap

Build on my laptop (Cursor / Claude Code / Codex) → hand off to Hermes on the mini. Context evaporates.

The obvious fix: "just remember everything." That's the trap.

Auto-capture pollutes: a one-week vegetarian trip becomes my "permanent diet"; a Firebase idea I dropped months ago still haunts every design.

Bad memory is worse than no memory.

ClickMem — a memory safe

Remember what matters, forget what stopped being true, never treat a passing detail as your identity. A safe, not a firehose.

Expandadd a belief
Revisereplace when better info arrives
Contractforget — with a reason
Reinforcepin as authoritative
Refuseblacklist, never store again
  • Explicit-only. Two ways in: an agent commits a refined conclusion (clickmem_remember), or I import curated docs (AGENTS.md / CLAUDE.md / rules). Raw chat stays cold — never auto-injected.
  • Conflicts surfaced, not hidden. A new memory that contradicts an old one gets flagged for me to resolve — not silently stacked.

One Local Server, Zero Server-Side LLM

Embedding-only recall, scoped by project / privacy / source · chDB or ClickHouse.

The Button Software Couldn't Click

Hermes pushed an update → a pile of macOS permission dialogs.

macOS explicitly prevents software from clicking security dialogs.

The agent that controls my computer… can't click "Allow" to control my computer.

💡 What if I use hardware?

$182 Rock 5B + IP-KVM

  • USB HID emulation (keyboard + mouse)
  • HDMI input — real video capture
  • RK3588 hardware video decoding
  • Agent helps agent: mutual passwordless SSH — if one dies, the other revives it

Total system draw: 7 watts

Open-sourced as HandsOn — one MCP interface, many backends (macOS, Rock 5B, PiKVM, NanoKVM).

Submitting a PR to Itself

  • My agent found a bug — in its own platform's code.
  • Plugin reload → restart loop → gateway death.
  • It diagnosed root cause, wrote a one-line fix + 2 tests.
  • Opened the PR under its own GitHub account.
  • Three human reviewers approved it.

An AI agent. Finding a bug in the platform it runs on.
Submitting a patch. Approved by humans.

The One Rule

Treat your agent like a new hire. Give it its own email, its own user account, its own GitHub, its own machine.

Mine literally runs as a separate macOS user on the Mac mini. Blast-radius containment — if it does something destructive (and it will), it's in its own sandbox.

Also: use Time Machine.

The Stack Today

  • Daily dev: Cursor + Claude Code + Codex
  • 24/7 agent: Hermes Agent on Mac mini
  • Comms: Discord (phone + desktop)
  • Backend LLM: my personal Codex sub + OpenRouter MiniMax M3 backup
  • Memory: ClickMem — chDB + Qwen3
  • Hardware: HandsOn IP-KVM (Rock 5B)

Recurring cost: ~$200/month — the Codex subscription I already pay for.

Five Things I'd Tell You

  1. Infrastructure > prompts. — System engineering is the moat.
  2. One sentence can migrate your whole brain. — Portable skills + memory beat any lock-in.
  3. Bad memory is worse than no memory. — A real partner has to be able to forget.
  4. Reliability earns trust. — Instant ack + a closed loop beat raw capability.
  5. Treat it like a coworker, not a tool. — Containment = safety + freedom.

Keep Building,
Keep Fresh.

Hermes Agent · chDB · ClickMem · HandsOn

@auxten · auxten.com

Q&A