Autonomous agent: Miracle or baby?

AI Agents aren’t miracles, they are babies. High-maintenance and fragile. Building Hermes proved that real autonomy does not have a shortcut.

The marketing industry currently views autonomous agents as a miracle.

We are told that agents will soon replace entire workflows, making decisions and executing tasks while we sleep. Earlier this year, the noise around OpenClaw was deafening—it was the “breakthrough” that promised to put an agent in every chat app.

When Hermes was released, I decided to take a swing at it myself. I wanted to see if a self-improving agent on my own server could live up to the promise.

What I found was that an agent is not a miracle. It is a baby.

It is high-maintenance, prone to silent failures, and entirely dependent on the environment you build for it. For a long time, my “baby” refused to even crawl.

The expectation gap

There are incredible agents out there, built by world-class engineers, that perform tasks with great precision. My expectations were set by those polished demos and the OpenClaw headlines. I expected a self-correcting partner. I got a system that required constant “parenting”.

The failure wasn’t in the AI. It was in my ability to build its home.

My lack of back-end experience meant that every time Hermes “cried,” I didn’t know if it was hungry (lack of data), cold (broken environment variables), or just having a tantrum (a volume mount mismatch in Docker).

The friction of DIY

When you build your own agent from scratch, you realize that “autonomy” is a structural illusion.

In my first attempts, Hermes was a disaster.

  • It initially ran as a systemd service that died every time the server rebooted.
  • It would connect to Discord but stay silent because I hadn’t enabled the right “intents”.
  • My choice of cheap models, made it operate like a 3-year old, not an experienced worker.

Building a “miracle” requires a level of knowledge I am yet to achieve. I realized that vibe coding isn’t a shortcut to skipping the fundamentals, it’s a way to accelerate your encounter with them.

From conversation to orchestration

The turning point came when I stopped trying to make Hermes “smart” and started making it useful.

I wanted a conversationalist - a partner to brainstorm with. But using a multi-step agent for simple chat is a waste of compute. It’s overkill in theory and my failure in reality.

The real value appeared when I reframed Hermes as an orchestration layer.

Hermes isn’t there to talk - it’s there to act. It’s a way to run Python scripts from my phone via Telegram. It’s a structured gateway that receives status updates and manages recurring tasks without me ever touching a terminal. It doesn’t just “chat”. It triggers. No miracle task completion, staying simple for now.

Performance is a design choice. You can hire the best “brains” in the world - silicon or human - but without a system to manage and guide them, you are simply babysitting.