How I Run My
One-Man Company
with Hermes Agent

Auxten Wang · Technical Director @ ClickHouse

PyCon Singapore 2026 · The Agent Track

Auxten Wang

@auxten · auxten.com

🏢 Technical Director @ ClickHouse
🚀 Creator of chDB — acquired by ClickHouse 2024
🧬 Maintaining a pile of side projects… alone
Ex-Shopee · CovenantSQL · Baidu · Qihoo 360

How We Got Here

Four years of AI coding, compressed — and the tools keep converging.

2022 ChatGPT. Code completion — describe a function, copy, paste.
2024 Cursor + Claude Code. The AI moves inside the editor — reads your repo, runs your tests.
2025 CLI agents: Claude Code, Codex, OpenClaw — test → fix → review → benchmark, in the terminal.
2025 Then they grew a clock. Codex & Claude Code add scheduled tasks; OpenClaw runs always-on — every tool starts to look the same: memory, cron, autonomy.
2026 Autonomous, always-on agents. Not "can it code?" but "can it run without me, 24/7?"

From Copilot to Coworker

Same models underneath. Completely different relationship.

IDE Copilot

You drive, every keystroke
Lives in your editor session
Forgets everything when the tab closes
Stops the moment you close the laptop

Autonomous Agent

Lives on a server, runs 24/7
Persistent memory across sessions
Reaches you on chat — Discord, Telegram…
Learns its own skills, gets better over time

Think of it as hiring an ops teammate, not buying a faster autocomplete.

They All Converged — and Hermes Leads

Code completion → Cursor → Claude Code → OpenClaw → cron-driven CLI agents — all grew memory, schedules, autonomy. By daily token volume, Hermes Agent now tops the leaderboard.

What is Hermes Agent?

Open-source autonomous agent by Nous Research · MIT · shipped Feb 2026 · ~195k★ on GitHub

🏠Self-hostedRuns on a $5 VPS or your Mac mini. Your data stays with you.

📱Talk to it anywhereOne gateway to Discord, Telegram, Slack, WhatsApp…

🧠Persistent memoryMulti-layer memory carries context across every session.

🔨Self-improvingBuilt-in learning loop: writes its own reusable skills.

🔒Sandboxed executionRuns commands in Docker / SSH / serverless backends.

⏱Always onCron, background jobs, long-running tasks — no babysitting.

Not a chatbot. Not a copilot. An agent that lives on your machine and gets smarter every day.

My Use Stack

Deliberately boring. One chat app in, the agent does the rest.

💬Discordphone + desktop

⇄

⚡Hermes Agenton Mac mini · 24/7

→

🧠Personal Codexmy own sub · ~$200/mo

↓ failover

🔁MiniMax M3via OpenRouter · backup

tmux foreman MCP tools imported skills background cron ClickMem memory

No new SaaS, no per-seat fees. Recurring cost: ~$200/mo — the Codex subscription I already pay for.

Migrating My Brain in One Sentence

A year of teaching Claude Code, Codex & OpenClaw my skills and conventions. Moving it all to Hermes took… one message.

general

Auxten May 14

Learn all the skills I defined over on OpenClaw.

Hermes BOT May 14

Done. Synced 27 custom skills from ~/.openclaw/skills/ to ~/.hermes/skills/openclaw-imports/.
Verified: source 27 → target, no mismatch; skill_view loads fine. New session: /reload-skills and you're set.

It also reads my CLAUDE.md / AGENTS.md — the conventions I taught Cursor and Codex just carry over.

Why I Trust It — #1

It never silently drops a task

Instant ack. The second I send a task, it drops a ✅ on my message — before any LLM call. Backend model flaky? Doesn't matter; I know it landed.
Closed loop. Every message gets a final result. Success or failure, it always comes back and tells me.
No more "did it even get my message?" No more staring at a dead chat.

general

Auxten 08:00

Check the online status of my apps.

✅ received · instant

Hermes BOT 08:17

Verdict: nothing critical. Checked Langfuse / GA4 / RevenueCat / ASA… full report ↓

17 minutes later — it came back, like it always does.

Why I Trust It — #2

Background cron that actually runs my ops

Schedule	What it does (and reports to Discord)
Sun 08:00–09:30	Weekly ad-campaign optimization across my appsper-app keyword/bid tuning, then one roll-up report
Daily 08:00	Cross-app anomaly inspectioninstalls / trials / revenue / LLM errors — read-only, pings me only if something's off
Daily 10:00	Emby "pick of the day"one title, tuned to my taste
Every 14 days	App Store keyword + metadata refresh, then ship a new version
Sat 15:00	Download the week's new anime episodes to my NAS

I optimize ad spend every week and inspect every online task every day — without lifting a finger.

Why I Trust It — #3

Hermes as my foreman (监工)

⚡Hermescc-supervisor

↓ drives tmux

Claude Codepane 0:0

Codexpane 0:1

summarize output chase progress catch fake tests

dev-ops

Auxten

Be a foreman: open a tmux and manage my pile of Claude Code sessions — summarize key output, chase progress, and quality-review (catch laziness / hallucinations / fake tests).

Hermes BOT

Spun up cc-supervisor; the dashboard refreshes every pane every 30s. Pane 0:0 (Claude Code) is idle at the prompt with commit typed but not run — I won't press Enter to commit on my own unless you explicitly ask.

An agent… supervising other agents. And it knows where to stop and ask.

Use Case 1: Analyzing My Own Apps

I ship my own apps solo. Ads, users, revenue, releases, crashes — scattered across a pile of dashboards. So I stopped opening dashboards.

Ad network Revenue Analytics Attribution App DB LLM traces

→

⚡Hermes+ my ad-ops skills

→

💬One reportin Discord

Ad spend, user activity, releases — one agent, one place, plain language.

It Doesn't Just Dump Numbers

app-ads

Hermes BOT Sun 09:30

Daily ASA report
Yesterday: spend $106.63 · installs 32 · CPI $3.33
Last 7d: spend $440.16 · installs 104 · CPI $4.23
Since re-enabling: 6/14 2 → 6/15 27 → 6/16 32 installs — clear recovery ✅

Weekly ad optimization, per app

daily-watch

Hermes BOT 08:17

Verdict: nothing critical, but 3 medium anomalies.
One app's GA4 engagedSessions yesterday 10, 7-day avg ≈ 35.8, ↓ ~72%; but sessions / users / events all normal → likely a tracking artifact, not the app being down.

Daily anomaly inspection

It tells me whether to panic — and usually, not to.

Use Case 2: What Should I Watch Tonight?

Not everything has to be work. I taught it my taste once…

Breaking Bad Homeland Game of Thrones Westworld House of Cards Black Mirror Severance Fargo Silicon Valley EVA A Mortal's Journey

…and every day at 10am it gives me one pick from my own Emby server — with a reason, and never something I've already seen.

One Pick a Day, With Taste

emby-daily

Hermes BOT

The Wire
Peels back the drug trade, the police, politicians and the press, layer by layer — the criminal ecosystem of Breaking Bad + the power games of House of Cards. S01E01 is on your Emby.

Hermes BOT

Ozark
An ordinary man swallowed by a criminal system, with Billions-style money games — colder and more oppressive.

emby-daily

Auxten

Watched Billions — loved it.

Hermes BOT

Got it — marked Billions as "watched & liked." I won't recommend it again, but I'll use it as a taste signal to find similar ones.

Auxten

Stop recommending Day of the Jackal.

Hermes BOT

Noted. Updated three places: the daily-rec exclusion list + the Emby skill's watched list + your screen-taste memory.

Bonus: every Saturday it grabs the latest episodes of my favorite cultivation anime and uploads them to my NAS. A feedback loop, for my couch.

First, the Memory Trap

Build on my laptop (Cursor / Claude Code / Codex) → hand off to Hermes on the mini. Context evaporates.

The obvious fix: "just remember everything." That's the trap.

Auto-capture pollutes: a one-week vegetarian trip becomes my "permanent diet"; a Firebase idea I dropped months ago still haunts every design.

Bad memory is worse than no memory.

ClickMem — memory with a lifecycle

Remember what matters, forget what stopped being true, never treat a passing detail as your identity. A safe, not a firehose.

rememberexplicit decision

→

recallsimilarity + filter

→

conflict?similar but different

→

revisebetter belief wins

Explicit-only. Two ways in: an agent commits a refined conclusion (clickmem_remember), or I import curated docs (AGENTS.md / CLAUDE.md / rules). Raw chat stays cold — never auto-injected.
Conflicts surfaced, not hidden. A new memory that contradicts an old one gets flagged for me to resolve — not silently stacked.

One Local Server, Zero Server-Side LLM

Recall = similarity + filter, scoped by project / privacy / source · chDB local, ClickHouse at scale.

The Button Software Couldn't Click

Hermes pushed an update → a pile of macOS permission dialogs.

macOS explicitly prevents software from clicking security dialogs.

The agent that controls my computer… can't click "Allow" to control my computer.

💡 What if I use hardware?

$182 Rock 5B + IP-KVM

USB HID emulation (keyboard + mouse)
HDMI input — real video capture
RK3588 hardware video decoding
Agent helps agent: mutual passwordless SSH — if one dies, the other revives it

Total system draw: 7 watts

Open-sourced as HandsOn — one MCP interface, many backends (macOS, Rock 5B, PiKVM, NanoKVM).

Submitting a PR to Itself

My agent found a bug — in its own platform's code.
Plugin reload → restart loop → gateway death.
It diagnosed root cause, wrote a one-line fix + 2 tests.
Opened the PR under its own GitHub account.
Three human reviewers approved it.

An AI agent. Finding a bug in the platform it runs on.
Submitting a patch. Approved by humans.

The One Rule

Treat your agent like a new hire. Give it its own email, its own user account, its own GitHub, its own machine.

Mine literally runs as a separate macOS user on the Mac mini. Blast-radius containment — if it does something destructive (and it will), it's in its own sandbox.

Also: use Time Machine.

The Stack Today

Daily dev: Cursor + Claude Code + Codex
24/7 agent: Hermes Agent on Mac mini
Comms: Discord (phone + desktop)
Backend LLM: my personal Codex sub + OpenRouter MiniMax M3 backup
Memory: ClickMem — chDB + Qwen3-Embedding-0.6B
Hardware: HandsOn IP-KVM (Rock 5B)

Recurring cost: ~$200/month — the Codex subscription I already pay for.

Five Things I'd Tell You

Infrastructure > prompts. — System engineering is the moat.
One sentence can migrate your whole brain. — Portable skills + memory beat any lock-in.
Bad memory is worse than no memory. — A real partner has to be able to forget.
Reliability earns trust. — Instant ack + a closed loop beat raw capability.
Treat it like a coworker, not a tool. — Containment = safety + freedom.

Keep Building,
Keep Fresh.

Hermes Agent · chDB · ClickMem · HandsOn

@auxten · auxten.com

How I Run MyOne-Man Companywith Hermes Agent

Auxten Wang

How We Got Here

From Copilot to Coworker

IDE Copilot

Autonomous Agent

They All Converged — and Hermes Leads

What is Hermes Agent?

My Use Stack

Migrating My Brain in One Sentence

Why I Trust It — #1

It never silently drops a task

Why I Trust It — #2

Background cron that actually runs my ops

Why I Trust It — #3

Hermes as my foreman (监工)

Use Case 1: Analyzing My Own Apps

It Doesn't Just Dump Numbers

Use Case 2: What Should I Watch Tonight?

One Pick a Day, With Taste

First, the Memory Trap

ClickMem — memory with a lifecycle

One Local Server, Zero Server-Side LLM

The Button Software Couldn't Click

$182 Rock 5B + IP-KVM

Submitting a PR to Itself

The One Rule

The Stack Today

Five Things I'd Tell You

Keep Building,Keep Fresh.

Q&A ⚡

How I Run My
One-Man Company
with Hermes Agent

Keep Building,
Keep Fresh.