
Runsight is a YAML-first workflow engine designed specifically for AI agents, enabling developers to design, commit, run, and evaluate agent workflows with Git-native version control. Every workflow is stored as a YAML file in your repository, allowing you to branch, review, and merge changes just like any other code.
The platform offers real-time cost tracking per run with hard budget caps to prevent overspending, along with a built-in evaluation framework for assertions and regression testing.
"Ship agents like you ship code."
| Feature | Benefit |
|---|---|
| Canvas + YAML Editor | Dual visual and code views |
| Per-run Cost Tracking | Monitor spending to the cent |
| Git Integration | Version control for workflows |
It's completely self-hosted, runs on your machine with your API keys, and is 100% open source under Apache 2.0 license.
Runsight is a YAML-first workflow engine built specifically for AI agents. It lets developers design, commit, run, and evaluate agent workflows with Git-native version control. Every workflow is stored as a plain YAML file in your repository, so you can branch, review, and merge agent changes just like any other code. The platform runs entirely on your machine with your own API keys, is 100% open source under the Apache 2.0 license, and requires no cloud account or signup to get started.
Two views of the same workflow state. Edit YAML in the Monaco editor and the canvas updates automatically. Move nodes on the canvas and the YAML stays clean. Commit the YAML to Git like any other code, with full diff support for code reviews.
Every block and every run is tracked to the cent. Set a max_cost and max_tokens budget that kills execution before overspend occurs. No more surprise bills after batch runs β you see exactly what each step costs in real time.
Assertions on every block output let you validate results programmatically. Transform hooks enable structured extraction, and regression testing across runs ensures your agents behave consistently as workflows evolve.
Stop a running agent before it wastes your budget. Pause execution, inspect the current state, then either resume or kill the run. This gives you fine-grained control over long-running or expensive agent tasks.
"Ship agents like you ship code."
This philosophy is the core of Runsight. By treating workflows as Git-diffable YAML files, the platform brings software engineering best practices β branching, code review, version history β into the world of AI agent development. If Runsight disappears tomorrow, you still have readable, executable workflow configs. No vendor lock-in, no proprietary formats, no cloud dependency.
You're building AI agent workflows and want version control, cost transparency, and evaluation built into your toolchain from day one. Runsight is especially valuable if you're tired of debugging agents with print statements, managing scattered Python orchestration scripts, or getting surprised by API costs after batch runs. The one-command startup (uvx runsight) and self-hosted nature make it trivial to try β no signup, no cloud account, just your machine and your API keys.
Other tools you might consider
Loading commentsβ¦
Maker
Michael Rogov
Visit Website
runsight.ai
Project Info
Product Keywords
Alternatives