Caveman
Caveman makes AI agents talk like a caveman — ~75% fewer output tokens with zero loss of technical accuracy. "why use many token when few do trick."
Works across 30+ AI coding agents: Claude Code, OpenCode, Codex, Cursor, Gemini, Copilot, Windsurf, and more.
- Repo: github.com/JuliusBrussee/caveman
- License: MIT
- npm:
caveman-shrink(MCP middleware)
Benefits
- ~65% fewer output tokens on average (range 22–87% across real-world benchmarks)
- ~46% fewer input tokens when compressing memory files (CLAUDE.md, project notes)
- No accuracy loss — brevity improved accuracy by 26 points in a March 2026 study
- Lifetime savings tracking with USD cost estimation
Quick Install
One-liner (auto-detects all agents):
curl -fsSL https://raw.githubusercontent.com/JuliusBrussee/caveman/main/install.sh | bash
Per-agent:
# Claude Code plugin
claude plugin marketplace add JuliusBrussee/caveman
claude plugin install caveman@caveman
# Codex CLI
npx skills add JuliusBrussee/caveman -a codex
# OpenCode
# Copy skill file manually or use the one-liner installer
# Cursor / Windsurf (always-on via rule file)
npx skills add JuliusBrussee/caveman -a cursor --with-init
Compression Levels
| Level | Command | Token Reduction | Style |
|---|---|---|---|
| Lite | /caveman lite | ~30% | Drops filler words |
| Full | /caveman | ~65% | Default caveman style |
| Ultra | /caveman ultra | ~75% | Telegraphic, minimal |
| Wenyan | /caveman wenyan | ~80% | Classical Chinese |
Turn off: Type normal mode or /caveman off.
Commands
/caveman # Activate full compression
/caveman ultra # Maximum compression
/caveman lite # Light mode
/caveman-commit # Generate ≤50char conventional commit
/caveman-review # One-line PR comments per change
/caveman-stats # Token savings: "143k saved. ~$1.46"
/caveman-stats --share # Tweetable stats
/caveman-compress CLAUDE.md # Rewrite memory file in caveman-speak
Cavecrew Subagents
Specialized subagents that talk in caveman by default (~60% fewer tokens than vanilla):
cavecrew-investigator # Research and investigation
cavecrew-builder # Implementation and coding
cavecrew-reviewer # Code review and QA
caveman-shrink (MCP Middleware)
Wraps any MCP server to compress tool descriptions — reduces input tokens from tool listings:
npm install -g caveman-shrink
# Configure as middleware in your MCP server chain
Auto-Activation
Claude Code, Codex, and Gemini auto-activate Caveman every session via hooks. For other agents:
# Always-on via init rule file
npx skills add JuliusBrussee/caveman -a cursor --with-init
# Per-session manual activation
/caveman
Ecosystem
| Tool | Purpose |
|---|---|
| caveman | Output compression (this) |
| caveman-code | Full terminal coding agent |
| cavemem | Cross-agent memory |
| cavekit | Spec-driven build loop |
| cavegemma | Gemma 4 31B fine-tuned on caveman pairs |