Token Optimizer

open-source Free Star1k

Context optimization plugin that identifies and eliminates wasted tokens across Claude Code, OpenCode, OpenClaw, and Codex environments while preserving work through compactions

skill

1.2K GitHub Stars

257 Passing Tests

$300-600 Monthly Savings

Overview

Token Optimizer is a context optimization plugin that identifies and eliminates wasted tokens across Claude Code, OpenCode, OpenClaw, and Codex environments while preserving work through compactions. It analyzes three categories of token waste: structural waste (bloated configuration files, unused skills, duplicate system prompts), runtime waste (verbose command output that floods context mid-session), and behavioral waste (habits like premature cache expiration and inefficient model selection). Zero runtime dependencies—pure Python stdlib or TypeScript depending on platform.

The Verdict

Who Should Use Token Optimizer?

Best For

Heavy API users processing high token volumes
Teams tracking AI coding costs
Developers experiencing context overflow
Multi-session workflows needing continuity
Those wanting visibility into token usage

Not Ideal For

Casual users with minimal API spend
Small projects under context limits
Commercial use (PolyForm Noncommercial)

What's Great

Zero runtime dependencies—pure stdlib
Live dashboard with per-turn breakdowns
Quality scoring with degradation detection
Survives compaction with checkpoint/restore
Subagent cost attribution
No telemetry—all local SQLite

GitHub README

Watch Out For

PolyForm Noncommercial license
Requires plugin marketplace install
Learning curve for optimization strategies

GitHub

Pricing

Open Source

Free

PolyForm Noncommercial License

View all features & details

Visibility & Measurement

Live dashboard tracking tokens & costs
Four pricing tier breakdowns
Per-turn cost analysis
Quality scoring (v6 dual-score)
Cache hit rate analysis
TTL distribution tracking

Session Continuity

Checkpoints before compaction
Critical decision restoration
Multi-session workflow support
Zero baseline context overhead

Optimization

Structural waste detection
Runtime waste reduction
Behavioral waste coaching
Quality nudges & loop detection

Platforms

Claude Code
OpenCode
OpenClaw
Codex
macOS, Linux, Windows

How It Compares

Category	Token Optimizer	Headroom	RTK
Tool output compression	99%+ per-output, progressive disclosure	60-95% (cherry-picked benchmarks)	60-90% (CLI only)
First-read file skeletons	Shadow-validated, fail-open	—	—
Bash/CLI output compression	Generic + git/ls/pytest patterns	Partial	Yes (main feature)
Tabular/JSON compression	Value-preserving columnar	Yes (main feature)	—
Delta reads (re-read = diff only)	Yes	—	—
Model routing (wrong model for task)	9 waste detectors	—	—
Loop/spin detection	Yes	—	—
Context quality scoring	Per-session, cross-session average	—	—
Cache instability detection	Yes	—	—
Retry churn detection	Yes	—	—
Tool cascade waste	Yes	—	—
Code structure maps	Outlines on repeated reads	—	—
Conversation history (60-75% of cost)	Checkpoint + compaction awareness	Doesn’t touch it	Doesn’t touch it
Quality gates	3-tier system, edit-rate proxies	“Same answers” (untested)	—
Measured dollar savings	Real bill reduction per category	Per-output ratios only	`rtk gain` analytics
Multi-platform	Claude Code, Codex, OpenClaw, OpenCode	Python library + proxy	macOS, Linux, WSL

Summary: Token Optimizer covers all 16 optimization categories. Headroom and RTK each specialize in one area (tool output compression) but miss conversation history (60-75% of cost), loop detection, model routing, and other major waste sources.

User Reviews

Loading reviews...