Nous Research Ships Hermes Agent v0.14 Foundation Release as NVIDIA Calls It the Most-Used Agent in the World on OpenRouter

Overview

Nous Research shipped Hermes Agent v0.14.0 — the Foundation Release on May 16, bundling 808 commits, 633 merged pull requests, and 215 community contributors into a build that finally lets users install the open-source agent with one pip command, run it natively on Windows, and turn any OAuth-authenticated Claude Pro, ChatGPT Pro, or SuperGrok subscription into a local OpenAI-compatible endpoint. The release arrives three days after NVIDIA’s May 13 blog post described Hermes as “the most used agent in the world according to OpenRouter” and noted the repository had “crossed 140,000 GitHub stars in under three months” by the time of writing.

What We Know

What Hermes Agent actually is

Hermes Agent is described by Hugging Face’s documentation as “an open-source AI agent CLI by Nous Research for coding, research, and development tasks in the terminal” with native support for Hugging Face’s Inference Providers giving it access to hundreds of open models from 17+ providers through one interface. The project’s GitHub page characterizes it as the agent that “creates skills from experience, improves them during use” and that runs on infrastructure ranging from a low-cost VPS to a GPU cluster, with seven supported terminal backends (local, Docker, SSH, Singularity, Modal, Daytona, Vercel) and access to 200+ models via OpenRouter and other providers. It is released under the MIT License.

The Tenacity release that anchored the ranking

The story behind the v0.14 release begins with v0.13.0 — “The Tenacity Release” — shipped May 7, which Nous Research’s release notes describe as the build where “Hermes Agent now finishes what it starts.” The release packed 864 commits, 588 merged pull requests, and 295 community contributors, the release notes say, and introduced multi-agent Kanban for delegating durable tasks, a /goal primitive that locks the agent onto a target across turns, and Google Chat as the 20th supported messaging platform. The same release closed eight P0 security issues, the release notes add, including a guild-scoped fix for a Discord role-allowlist bug previously scoring CVSS 8.1 as a cross-guild DM bypass.

What v0.14.0 “Foundation” ships

The May 16 v0.14.0 release is framed as a packaging and platform-reach milestone rather than a feature blitz. The release notes summarize the build as “808 commits · 633 merged PRs · 1393 files changed · 165,061 insertions · 545 issues closed (12 P0, 50 P1) · 215 community contributors” and pin its theme as “Hermes installs and runs anywhere, ships with the things you actually want to use, and stops shipping the things you don’t.”

Four headline items stand out:

PyPI distribution. Until v0.14.0, installing Hermes required cloning the repo or running a shell installer. The new release announces that “pip install hermes-agent works from PyPI.” The release notes add that the wheel bundles the terminal UI and shell launcher.
OpenAI-compatible local proxy. A new hermes proxy command exposes a http://localhost:port endpoint that speaks the OpenAI API and is backed by whichever OAuth provider the user is signed into — Claude Pro, ChatGPT Pro, or SuperGrok. The release notes state that downstream tools like “Codex CLI, Aider, Cline, Continue, your custom scripts” can then hit the user’s existing subscription without an API key.
xAI Grok via SuperGrok OAuth. Grok lands as a SuperGrok-authenticated provider, and grok-4.3 is wired in with a 1M-token context window, the release notes confirm.
Native Windows beta. Hermes now runs on cmd.exe and PowerShell without WSL — a “first-class native Windows path across CLI / gateway / TUI / tools” per Nous Research, though the release notes flag it as early beta.

The Foundation release also lands a debloating wave that moves heavyweight messaging adapters and media SDKs to lazy install, a 180x speedup on browser console evaluations by reusing one persistent Chrome DevTools connection, per the release notes, and a cold-start wave that shaves roughly 19 seconds off the hermes launch path. The release notes also say the hermes tools All-Platforms screen dropped “from 14 seconds to under 1.5 seconds.” Two new messaging platforms — LINE and SimpleX Chat — bring the total to 22.

The self-improvement loop

The technical narrative driving Hermes is that the agent learns from its own past work. NVIDIA’s blog post describes the mechanism plainly: “Hermes writes and refines its own skills. Every time the agent encounters a complex task or receives feedback, it saves its learnings as a skill so it can adapt and improve over time.” That is how NVIDIA describes the mechanism. The blog frames Hermes as an exemplar of self-improving agents on NVIDIA RTX PCs, RTX PRO workstations, and the company’s DGX Spark personal-AI workstation.

What We Don’t Know

How widely the OpenRouter ranking will translate into measured productivity gains. NVIDIA’s blog confirms Hermes is the most-used agent on OpenRouter at the time of writing, but the post stops short of providing independently verified benchmarks for the skill-learning loop. The user-reported productivity claims circulating in third-party coverage do not appear in either Nous Research’s release notes or NVIDIA’s published account.
Whether the OpenAI-compatible local proxy will draw a response from Anthropic, OpenAI, or xAI. Routing Claude Pro, ChatGPT Pro, or SuperGrok consumer subscriptions through a local proxy and feeding them into third-party code-agent CLIs is exactly the workflow each provider has historically governed through API terms. The release ships the capability; none of the surveyed primary sources document how the providers will respond.
How durable Hermes’s lead is. OpenRouter daily rankings can swing significantly week to week, and the primary sources surveyed here do not publish OpenRouter daily totals for the week of the v0.14.0 release.

Analysis

Hermes Agent’s rise on OpenRouter and the back-to-back v0.13 and v0.14 releases together mark a real inflection point for self-improving open-source agents. The Tenacity release closed the durability gap — multi-agent Kanban, /goal, security wave — that had kept users tethered to managed enterprise agents. The Foundation release closes the distribution gap: one pip install away, with a local proxy that turns any consumer LLM subscription into a programmable endpoint.

That posture is a different bet from the cloud-only agent stacks the major model providers have been pushing throughout 2026. The structural critique of self-improving agents — that skills which embed a model’s idiosyncrasies become harder to audit as they accumulate — remains. But what v0.14 demonstrates is that the conversation in the agent space is no longer about which assistant has the most connectors or the longest context window. It is about which one remembers what it did yesterday.