Why ReviewCue Exists
ReviewCue exists because AI code review needs a deterministic local packet before the model sees the diff: changed files, relevant context, risk cues, and reviewer questions.
reviewcue exists because AI code review has a context problem.
Most agent review workflows start with one of two bad defaults:
- give the model too much of the repo and hope it finds the right signal
- give the model a raw diff and hope it does not miss the surrounding risk
Neither default is good enough.
The first burns context and leaks more local state than necessary. The second is too thin. A diff without nearby tests, package files, docs, config, generated artifacts, and known risk cues is not a review packet. It is just a change.
reviewcue is an early-stage local-first CLI for building that packet before the review starts.
π
The review prompt should not be improvised from a raw diff. It should be compiled from local evidence.
That is the whole thesis.
The workflow pain
AI review agents can be useful. I use them. I want more of them in the workflow.
But the way context gets handed to them is often sloppy.
A human copies a diff into chat. An IDE plugin grabs whatever files it thinks matter. A hosted bot sees the pull request but not the local intent. A coding agent reviews its own work with the same context that produced the work in the first place.
That can catch obvious issues.
It can also miss the boring stuff that actually breaks teams:
- a package file changed but no install or packaging check ran
- a shell script gained a risky command
- a test file was not touched near a behavior change
- generated output got mixed into a human-authored diff
- docs changed without the command they document
- a secret-looking string appeared in a fixture
Those are review cues.
They do not require a model to notice. They require deterministic packaging before the model gets involved.
What ReviewCue is trying to become
The current repository is still in progress. The package metadata describes it plainly: a local-first review packet builder for git diffs.
The PRD is the useful statement of intent:
- read local git diffs and changed files
- detect related tests, docs, package metadata, and config files
- flag review cues like missing tests, secret-looking strings, broad dependency changes, generated files, and risky shell commands
- emit portable Markdown and JSON review packets
- work offline without model calls
The current implementation has the foundations: TypeScript CLI argument parsing, git diff collection, and a unified diff parser that turns changed files into structured hunks, line additions, deletions, status, binary flags, and rename metadata.
That sounds boring because it is the part that should be boring.
If you cannot parse the diff deterministically, everything above it becomes vibes.
The packet matters more than the reviewer
There is a tempting product mistake here: jump straight to the reviewer.
Pick a model. Write a prompt. Ask for security issues, bugs, missing tests, readability problems, and edge cases. Return a polished report.
That is useful until the input is wrong.
A good reviewer with the wrong packet still misses things. A cheaper reviewer with the right packet often catches the practical risks faster.
That is the same pattern I keep seeing across agentic engineering tools. The harness matters. The preflight matters. The proof matters. The prompt is only one part of the system.
reviewcue sits before the reviewer and asks a smaller question:
what should be in the review packet?
Not βwhat is the final judgment?β
Not βcan a model replace the reviewer?β
Just βwhat local evidence belongs next to this diff before anyone makes a decision?β
Local-first is the safety boundary
Review context is sensitive.
It can contain secrets, business logic, customer references, private file paths, experimental code, generated artifacts, or half-finished work. Sending all of it to a model because the packaging layer is lazy is a bad default.
Local-first tooling changes the posture.
reviewcue can inspect the repo, build a compact packet, and let the human decide what leaves the machine. Markdown is useful for human handoff. JSON is useful for another tool or agent. The important part is that the packet is explicit.
Raw review handoff
- βDiff copied into chat
- βContext chosen by memory
- βRisk cues depend on reviewer intuition
- βToo much or too little repo state leaves the machine
- βOutput is hard to reproduce
ReviewCue-style handoff
- βDiff parsed into structure
- βRelated context selected locally
- βRisk cues listed before model review
- βPacket can be inspected before sharing
- βMarkdown and JSON make handoff portable
This is not anti-cloud and not anti-model.
It is pro-boundary.
Models should review the packet you meant to send, not whatever context happened to be easiest to paste.
Why this belongs in the OSS stack
reviewcue fits a pattern across the rest of the stack.
taskbrief tries to make the task input explicit. proofdock tries to make the evidence output explicit. agent-qc tries to make the handoff quality explicit. runreceipt captures what actually ran. lockfilelens, actionpin, and guardrailmd each make one class of review risk easier to see.
reviewcue is the bridge into code review.
It does not need to know whether the next reviewer is Roger, Codex, Claude, Gemini, a GitHub bot, or a teammate. Its job is to package the change so the next reviewer starts closer to the truth.
That is the bigger agentic engineering thesis again:
do not ask the model to infer the operating system around the work.
Build the operating system around the model.
The deeper insight
Good review is not just a smart opinion.
Good review is the result of the right evidence arriving at the right moment in the right shape.
Agents can help with the opinion. They can summarize, question, compare, and notice patterns. But the evidence shape should be deterministic. The changed files, related tests, config deltas, package metadata, binary flags, generated artifacts, and risky strings should not depend on how charming the review prompt was.
That is why reviewcue is worth building even before it becomes impressive.
It moves review quality one layer earlier.
Before the model.
Before the comment.
Before the human burns attention reconstructing context that a local tool could have packaged.