AI: Models in Hermes Agent

capote · June 25, 2026, 11:59am

In Hemres Agent, it is possible to assign agents specifically to
AUXILIARY TASKS models in addition to the main model (deepseek v4 pro).
I have now selected the following options—which are not permitted—based on cost considerations.
What do you think?
@Stll0 @oneitonitram

Vision Image analysis
opencode-go • kimi-k2.7-code

Web Extract Page summarization
openrouter • openrouter/owl-alpha

Compression Context compaction
deepseek • deepseek-v4-flash

Skills Hub Skill search
opencode-go • minimax-m3

Approval Smart auto-approve
Imstudio • 1lama-3.1-8b-instant

MCP MCP tool routing
opencode-go • minimax-m3

Title Gen Session titles
Imstudio • 1lama-3.1-8b-instant

Triage Specifier Kanban spec fleshing
openrouter • openrouter/owl-alpha

Kanban Decomposer Task decomposition
opencode-go• glm-5.2

Profile Describer Auto profile descriptions
openrouter • openrouter/owl-alpha

Stll0 · June 25, 2026, 12:35pm

I’m experimenting as well, I’m trying to use free nvidia nemotron for some task. It seems to work fine, but I can’t say how good it is

capote · June 25, 2026, 4:35pm

I hadn’t even had Nemotron on my radar!

This model was just recently released (in early June 2026) by NVIDIA. It’s a massive 550-billion-parameter model (with 55B active parameters per token via MoE) featuring a hybrid Mamba/Transformer architecture. Most importantly: It’s specifically trained for agents, deep logical reasoning, and orchestration.

Unlike “Owl-Alpha” (which is an anonymous stealth model with privacy concerns and high latency), Nemotron-3-Ultra comes from NVIDIA, is extremely high-performing (high throughput), and reliable.

Since this model is free, has a context size of one million tokens, and (unlike Owl-Alpha) offers massively intelligent reasoning for agent flows, it once again completely upends my previous “Smart Economy” table in an extremely positive way. I can now replace expensive OpenCode Go flat-rate models with the free NVIDIA model!

Thanks for the tip.

What I won’t replace:
Title Gen / Approval / Compression: Groq: Llama 3.1 8B (These tasks require millisecond response times for the UI. Although Nemotron is fast, nothing beats Groq’s LPU chips for these micro-tasks).

just my one:

oneitonitram · June 26, 2026, 8:05pm

how about mimo 2.5 pro, its same price as deepseek, and abit more capable