
Kodwai is the first platform that scores how well you collaborate with AI coding agents (Claude Code, Cursor, Codex). Solve real challenges in your own terminal; the CLI captures your code, tests, git history, agent transcript, and time, then scores you across three axes: Direction, Outcome, and Lift, each citing its own evidence. Climb a public leaderboard, earn badges, and build a profile that shows how you engineer, not what you memorized. Free, bring your own agent.
Kodwai is the first platform that scores how well you collaborate with AI coding agents like Claude Code, Cursor, and Codex. Instead of testing what you memorized, it measures your ability to direct an agent through real coding challenges. You solve problems in your own terminal, and the CLI captures your code, tests, git history, agent transcript, and time. The platform then scores you across three axes—Direction, Outcome, and Lift—each backed by specific evidence. It's free, and you bring your own agent.
Kodwai provides ticket-sized problems across categories you actually ship in. You browse, pick one, and run a single CLI command that downloads the problem, starter files, and tests, inits a git repo, and starts the timer. No sandbox, no artificial constraints—just your own editor and agent.
Your session is scored on Direction (how well you steer the agent), Outcome (what actually shipped), and Lift (how much the agent improved over your baseline). Each axis lands with per-signal evidence, so you can see exactly why you scored the way you did.
Your scores place you on a public leaderboard where you can compare your vibe coding skills against other developers. Earn badges for your achievements and build a profile that shows how you engineer, not what you memorized.
Kodwai works with Claude Code, Cursor, and Codex. You choose the agent you're most comfortable with, and the platform scores your collaboration regardless of which tool you use.
"Passing tests is not skill. Kodwai reads the whole session, so the score rewards how you drive."
A careless one-shot prompt can make the test suite go green, but Kodwai catches the difference. It reads the entire session—your prompts, the agent's transcript, your verification steps, and how you recovered when the agent went confidently wrong. This means a developer who carefully decomposes a problem and verifies each step scores higher than someone who just ships a lucky prompt.
You're tired of LeetCode and whiteboard puzzles that an AI agent can clear in seconds. If you want a metric that actually reflects how you direct agents, catch hallucinations, and verify what ships, Kodwai gives you a score that proves your judgment. It's free, runs on your machine, and works with the agents you already use.
Other tools you might consider
Loading comments…
Maker
sleepyfox
Visit Website
kodwai.com
Project Info
Product Keywords