RTK (Rust Token Killer)

oss Free Star63k

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands with zero dependencies

skill

60K GitHub Stars

60-90% Token Savings

<10ms Overhead

Overview

RTK (Rust Token Killer) is a CLI proxy that reduces LLM token consumption by 60-90% on common development commands. It filters and compresses command outputs before they reach an AI's context window using smart filtering, grouping, truncation, and deduplication. Single Rust binary with zero dependencies and sub-10ms overhead. Works with Claude Code, GitHub Copilot, Cursor, Windsurf, Cline, and 9+ other AI coding assistants.

The Verdict

Who Should Use RTK?

Best For

Developers using AI coding assistants daily
Teams with high API token costs
Projects needing extended context windows
CLI-heavy development workflows
Anyone running lots of git, test, build commands

Not Ideal For

Non-CLI workflows
Native Windows (limited support)
Projects requiring exact command output
Workflows with custom command formats

What's Great

Widely adopted with strong community support
Zero dependencies, single Rust binary
Sub-10ms overhead—virtually invisible
Auto-rewrite hook for 100% adoption
Built-in analytics (rtk gain command)
Supports 40+ common dev commands
Works with 10+ AI coding assistants

GitHub README

Watch Out For

Limited native Windows support (use WSL)
No auto-rewrite hook on Windows
May filter out occasionally needed details
Requires learning command mappings

Pricing

Free

Open source, Apache 2.0

View all features & details

Supported Commands

File: ls, read, find, grep, diff
Git: status, log, diff, commit, push, pull
Testing: Jest, Vitest, pytest, cargo test, go test
Build: ESLint, TypeScript, cargo build, ruff
Package: pnpm, pip, bundle, prisma
Cloud: AWS CLI, Docker, kubectl

Optimization Strategies

Smart filtering (removing noise)
Grouping (aggregating similar items)
Truncation (preserving relevant context)
Deduplication (collapsing repeated lines)

Sample Session Savings

10× ls/tree calls: 80% reduction
5× cargo test: 90% reduction
30-min session: 118K → 24K tokens (-80%)

Installation

Homebrew (macOS/Linux)
Pre-built binaries
Cargo install
Works on macOS, Linux, Windows (WSL)

How It Compares

Category	Token Optimizer	RTK	Headroom
Tool output compression	99%+ per-output, progressive disclosure	60-90% (CLI only)	60-95% (cherry-picked benchmarks)
First-read file skeletons	Shadow-validated, fail-open	—	—
Bash/CLI output compression	Generic + git/ls/pytest patterns	Yes (main feature)	Partial
Tabular/JSON compression	Value-preserving columnar	—	Yes (main feature)
Delta reads (re-read = diff only)	Yes	—	—
Model routing (wrong model for task)	9 waste detectors	—	—
Loop/spin detection	Yes	—	—
Context quality scoring	Per-session, cross-session average	—	—
Cache instability detection	Yes	—	—
Retry churn detection	Yes	—	—
Tool cascade waste	Yes	—	—
Code structure maps	Outlines on repeated reads	—	—
Conversation history (60-75% of cost)	Checkpoint + compaction awareness	Doesn’t touch it	Doesn’t touch it
Quality gates	3-tier system, edit-rate proxies	—	“Same answers” (untested)
Measured dollar savings	Real bill reduction per category	`rtk gain` analytics	Per-output ratios only
Multi-platform	Claude Code, Codex, OpenClaw, OpenCode	macOS, Linux, WSL	Python library + proxy

Summary: Token Optimizer covers 16/16 categories. RTK excels at CLI output compression but misses 85-90% of actual token waste (conversation history, loops, model routing, etc.).

User Reviews

Loading reviews...