Anyone else go through 5+ prompt versions and completely forget what actually changed?


I’ve got like 10 different versions of the same prompt in my Notes app because I keep tweaking it and saving a “just in case this one is better” copy.

Then I later look at them and have no clue what actually shifted between v2 and v4. Did I remove the example I/O? Did I add more context? Delete something?

I’d end up opening them side-by-side and manually scanning for differences.

So I made BestDiff – paste two prompts and instantly see what changed.

What it does:

  • Drop in prompt v1 and v2 → instant visual diff, track-changes style
  • Detects every word/punctuation change (compare runs at word/character level)
  • Spots moved text too
  • “Copy for LLM” button formats edits as {++added++} / {–removed–} so you can throw it back into ChatGPT and ask “which version works better?”
  • Works fully offline (100% private, nothing leaves your device)

When I actually use it:

  • Testing whether extra examples/context improved results
  • Comparing “short/concise” vs. “long/detailed” versions
  • Checking what I changed when I revert to an older draft
  • Seeing differences between prompts that performed well vs. poorly

Would love any feedback on what would make it more helpful for prompt-testing workflows !

Leave a Reply