How to OCR code without Gemini messing with it?

By skyforbes Dec 10, 2025 No Comments

Hello – I'm trying Gemini for OCR (it's simpler to use than the API) but it makes clear changes the text.

This is a fragment of an image that I gave to Flash 2.5:

The prompt was "OCR this, without making any changes to the text"

It took some liberties with the output, removing a pair of # marks near the end.

    multi_entry_queries = {}
    for idx, entry_str in enumerate(stream, start=1):
        try:
            entry = json.loads(entry_str)
        except Exception as e:
            print(f"Cannot parse #{idx}: {entry_str.strip()}", file=sys.stderr)
            continue
        if 'c' in entry and entry['c'] == "COMMAND":
            continue
        if '$' in entry.get('ns', ''):
            continue

Another example:

https://preview.redd.it/wl7cheue485g1.png?width=441&format=png&auto=webp&s=69f94848ceb07f0d0ee6397b2db001ee4223a874

# "create" goes to position 0, etc
crud_counts[ts_min_s][crud_index(cmd_type[0])] += 1

These look like attempts at correction (it recognized and highlighted the Python code). How do I get it to be faithful? I've tried including clauses like "don't make any changes"; "don't interpret the code"; "recover the original text". It won't "listen".

Any ideas?

By skyforbes

GeminiAI

I use ChatGPT as a brutally honest reasoning partner, not a therapist. This is the instruction block and memory method I use for that. It’s opinionated and not for everyone, but if you want a deep, non-coddling configuration, you can adapt this.

skyforbes Dec 10, 2025

GeminiAI

Trying to get Gemini to try to explain to me a little bit how visual models translate pixels into vectors. I don’t get it, but is pretty cool

skyforbes Dec 10, 2025

GeminiAI

PROMPT FOR THE POLYA METHOD

skyforbes Dec 10, 2025

How to OCR code without Gemini messing with it?

Like this:

By skyforbes

Leave a ReplyCancel reply

You Missed

Drop down is enabled by default on Windows 11, causing clutter. This PC is unbearable to use with this on.

the definition of bloat?

I use ChatGPT as a brutally honest reasoning partner, not a therapist. This is the instruction block and memory method I use for that. It’s opinionated and not for everyone, but if you want a deep, non-coddling configuration, you can adapt this.

How to Put Skills on Your Resume (And What NOT to List)

Archives

How to OCR code without Gemini messing with it?

Like this:

By skyforbes

Related Posts

I use ChatGPT as a brutally honest reasoning partner, not a therapist. This is the instruction block and memory method I use for that. It’s opinionated and not for everyone, but if you want a deep, non-coddling configuration, you can adapt this.

Trying to get Gemini to try to explain to me a little bit how visual models translate pixels into vectors. I don’t get it, but is pretty cool

PROMPT FOR THE POLYA METHOD

Leave a ReplyCancel reply

You Missed

Drop down is enabled by default on Windows 11, causing clutter. This PC is unbearable to use with this on.

the definition of bloat?

I use ChatGPT as a brutally honest reasoning partner, not a therapist. This is the instruction block and memory method I use for that. It’s opinionated and not for everyone, but if you want a deep, non-coddling configuration, you can adapt this.

How to Put Skills on Your Resume (And What NOT to List)