Files

T

History

sleepy 1a1522266c Final batch of structure unification

Restructured to unified template:
- hermes/feedback/localllm/gemma-models-feedback.md
- hermes/feedback/frontier/openai-gpt-feedback.md

All key feedback files now follow FEEDBACK_TEMPLATE.md structure

2026-04-09 16:16:15 +02:00

frontier

Final batch of structure unification

2026-04-09 16:16:15 +02:00

general

Initial commit: coding harness feedback analysis

2026-04-09 15:13:45 +02:00

localllm

Final batch of structure unification

2026-04-09 16:16:15 +02:00

README.md

Initial commit: coding harness feedback analysis

2026-04-09 15:13:45 +02:00

README.md

Hermes Agent Feedback Collection

Last Updated: 2026-04-09
Purpose: Community feedback and performance data for the hermes-agent harness

Folder Structure

feedback/
├── localllm/          # Community feedback for local/smaller models
├── frontier/          # Community feedback for frontier models
├── general/           # General feature feedback, issues, benchmarks
└── README.md          # This file

Feedback Format

Each feedback file includes:

Model/Feature used (name, size, provider)
Benchmark results or task performance
Issues encountered
What worked well
Source reference: URL or site where feedback came from

Local/Small Models

File	Topic
localllm/qwen-models-feedback.md	Qwen 3.5 performance
localllm/gemma-models-feedback.md	Gemma 4 comparison
localllm/local-setup-issues.md	Setup challenges
localllm/general-local-llm-feedback.md	Overview

Frontier Models

File	Topic
frontier/claude-sonnet-feedback.md	Claude performance
frontier/openai-gpt-feedback.md	OpenAI integration
frontier/budget-providers-feedback.md	Kimi, DeepSeek, MiniMax
frontier/general-frontier-feedback.md	Overview

General

File	Topic
general/bug-reports-and-issues.md	Known issues
general/feature-feedback.md	Features & UX
general/terminal-bench-benchmarks.md	Benchmarks

Key Findings Summary

Token Overhead (All Models)

Critical: 73% of every API call is fixed overhead (~13.9K tokens)

Component	Tokens
Tool definitions (31 tools)	8,759
System prompt	5,176
Fixed overhead	~13,935

Impact: Even simple queries can cost 15K-20K tokens

Best Local Models

Model	VRAM	Rating
Qwen 3.5 27B	24GB	⭐⭐⭐⭐⭐
Qwen 3.5 14B	16GB	⭐⭐⭐⭐
Qwen 3.5 8B	8GB	⭐⭐⭐

Cost-Effective Providers

Provider	Cache Discount	Use Case
DeepSeek	90%	Maximum savings
Kimi K2.5	75%	Daily driver
MiniMax	None	Fast, capable

Critical Issues

Issue	Severity	Status
#4146 - Sandbox bypass	Critical	Open
#1071 - llama-server compatibility	Critical	Fix ready
#4469 - Message queue bug	High	Open

Contributing

To add feedback:

Create a new file in appropriate folder
Follow the feedback format
Include source URLs
Update this README if needed

External Resources

GitHub: https://github.com/NousResearch/hermes-agent
Docs: https://hermes-agent.nousresearch.com/
Discord: Community discussions
Reddit: r/LocalLLaMA, r/LocalLLM

README.md

Hermes Agent Feedback Collection

Folder Structure

Feedback Format

Quick Navigation

Local/Small Models

Frontier Models

General

Key Findings Summary

Token Overhead (All Models)

Best Local Models

Cost-Effective Providers

Critical Issues

Contributing

External Resources