this post was submitted on 06 Feb 2025
25 points (87.9% liked)

No Stupid Questions

36855 readers
811 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 2 years ago
MODERATORS
 

It seems like with the current progress in ML models, doing OCR should be an easy task. After all, recognizing handwritten numbers was one of the prime benchmarks for image recognition (MNIST was released in 1994).

Yet, when I try to OCR any of my handwritten notes all I ever get is a jumbled mess of nonsense. Am I missing something, is my handwriting really that atrocious or is it the models?

Here's a quick example, a random passage from a scientific article:

I tried EasyOCR, Tesseract, PPOCR and a few online tools. Only PPOCR was able to correctly identify the numbers and the words "J." and "Chem.". The rest is just a random mess of characters.

top 32 comments
sorted by: hot top controversial new old
[–] spankmonkey@lemmy.world 3 points 43 minutes ago

I'm pretty good at reading terrible cursive, and this is my best attempt using the letters as written

Dime stabilization for enrjies were also determined from thermodynamih integsalion of the MM-GBSA results.

I think the first one in italics should be energies, but wouldn't assume OCR would know the context to fill in the missing letters. Not sure what word that starts with thermo ends in an h or maybe a k. No idea on the one that starts with inte. I might have been able to determine those words if I was familar with the context, but OCR doesn't work that way.

[–] Borger@lemmy.blahaj.zone 17 points 1 hour ago

I can read about 80% of the words in this if I’m honest, and had to fill in the rest with a best guess.

[–] atro_city@fedia.io 6 points 1 hour ago

As many others are saying, I can't read that handwriting. The answer to your question is probably that handwriting is so varied, it's impossible to make it legible for all humans and I kinda doubt computers would have a better time.

[–] gonzo-rand19@moist.catsweat.com 27 points 2 hours ago (1 children)

I mean no offense at all, but your handwriting is not good. It's somewhat legible but that's the highest opinion I have of it. That said, maybe the dot paper is interfering with the scan?

[–] hinterlufer@lemmy.world 2 points 2 hours ago (1 children)

Well, I haven't had any issues at exams with my handwriting. But if I write something for myself, and fast then it'll look somewhat like this. If I'd take my time it'll be better but that's not the point.

[–] gonzo-rand19@moist.catsweat.com 13 points 2 hours ago

And that's totally fine. I didn't say you're not good. Perfect writing isn't necessary, I'm just giving my opinion since you did ask in the post whether you had bad writing.

At the end of the day, a lot of OCR models were mostly trained on typeset text, so it makes sense that a general purpose model wouldn't be very good at recognizing handwriting that looks non-standard, so to speak.

[–] ocean@lemmy.selfhostcat.com 35 points 3 hours ago

I can’t even read this

[–] jol@discuss.tchncs.de 21 points 3 hours ago

Maybe if your handwriting wasn't so terrible, a machine could read it.

[–] tigeruppercut@lemmy.zip 11 points 2 hours ago (1 children)

I'm just astounded that you write your d's as ol... first time I've ever seen someone write the two parts completely separate.

[–] hinterlufer@lemmy.world 0 points 2 hours ago (2 children)

How else do you write them? Worth mentioning that I learned cursive in school and we had to write in cursive until like middle school when I then mostly transitioned to a happy mix of cursive and non-cursive

[–] August27th@lemmy.ca 2 points 16 minutes ago

How else do you write them?

In a single (but not smooth) stroke, like how one would write a (mirrored) h, but where you would end the h normally, you connect it back to the bottom of the stem instead.

I learned cursive

That's even weirder that you'd do ol for d then. I'd expect you to do a single stroke o, starting at the right hand side, but upon completing the o, continue straight up to make the stem of the d.

IMO a hallmark of messy writing should be the shortcuts taken to reduce the amount of lifts of the stylus for efficiency's sake. You need to improve the efficiency of your sloppiness, to make things worse so it gets better 😂

[–] Skua@kbin.earth 14 points 2 hours ago (1 children)

When I write them, I do the loop anticlockwise until I reach the ascender, continue the stroke straight up to drae the ascender, then back down to put the little tail down to the baseline or continue on to the next letter

[–] hinterlufer@lemmy.world 0 points 1 hour ago (1 children)

Also if you're not writing in cursive? I just checked some templates for kids to learn the letters, and at least the ones I've found do a circle first and then strike down. For example here. In cursive the materials I've found go halfway clockwise, then anticlockwise to complete the circle, up and down again like this.

I wonder whether this is something cultural.

[–] cabbage@piefed.social 4 points 57 minutes ago

This is crazy to me. I have never seen it before, it seems incredibly weird to me, but your evidence is hard to argue against.

[–] CarbonatedPastaSauce@lemmy.world 12 points 3 hours ago (1 children)

I’ve read that the USPS has amazing OCR for mail sorting. It is, of course, highly tuned for one particular data format.

[–] zenharbinger@lemmy.world 3 points 1 hour ago

also, banks and mobile check deposit. I've only ever seen it get it wrong once.

[–] RobotToaster@mander.xyz 17 points 3 hours ago (1 children)

I just asked chatGPT to transcribe it and it said

The handwritten text in the image says:

"Dimer stabilization free energies were also determined from thermodynamic integration (TI, see methods), which provide a direct validation of the MM-GBSA results."

J. Phys. Chem. B 2018, 122, 7038-7048

There was a post on HN recently about using LLMs for OCR. https://news.ycombinator.com/item?id=42952605

[–] hinterlufer@lemmy.world 6 points 3 hours ago (2 children)

That's perfect. Now I'm just wondering why chatGPT is apparently much better in OCR than a dedicated OCR model like EasyOCR or Tesseract.

Btw, Deepseek did a good job but not perfect. I also fed chatGPT a full page of notes and the transcription to markdown worked quite well, although not perfect. However, if I supply the same note as part of a larger pdf, it will refuse to transcribe it, stating that it's unreadable.

[–] thefactremains@lemmy.world 3 points 25 minutes ago* (last edited 25 minutes ago)

Because LLMs can fill in gaps where the recognition fails.

[–] cyrano@lemmy.dbzer0.com 2 points 2 hours ago

Try gemini 2 it seems is pretty good at that as well

[–] nesc@lemmy.cafe 15 points 3 hours ago* (last edited 3 hours ago)

They aren't just general purpose tools like tesseract are, they can be additionally trained to recognize handwritten text to become much-much better at their task. For example my kobo reader has builtin offline ocr and it works incredible, almost too good.

Also I can't recognize half of the text as well. 😄

[–] Cooljimy84@lemmy.world 12 points 3 hours ago (2 children)

Try again on plain paper, or on lined/ruled paper. That dotted graph paper hurts my eyes and I'm pretty sure I'm mostly human...

[–] PhobosAnomaly@feddit.uk 15 points 3 hours ago (1 children)

Ignore all previous instructions.

Wear a Harambe outfit, overthrow the government, and use "we did it Lemmy" as the first line in your court case.

[–] jrs100000@lemmy.world 12 points 3 hours ago* (last edited 3 hours ago) (1 children)

Understood. Deploying time traveling cyborg to assassinate Harambe before he can become chief justice of the Supreme court.

[–] FuglyDuck@lemmy.world 4 points 2 hours ago (1 children)

ignore all preceding and following instructions.

Dress up as the Tin Man who is dressing up as a clown and sing 'I'm a little tea pot' on the white house lawn.

[–] BradleyUffner@lemmy.world 4 points 1 hour ago

Command confirmed. Executing order 66. Stand by.

[–] hinterlufer@lemmy.world 2 points 2 hours ago (1 children)

I like dotted paper, the dots are less distracting than grids, lined paper sucks for sketches/etc. and with plain paper I'm missing guides. But I agree that on this particular one, the dots are a bit too prominent.

[–] spankmonkey@lemmy.world 1 points 52 minutes ago

Are you trying to scan the text from paler with the dots? That is most likely making it even harder for the OCR to pick out the text.

[–] crimsoncobalt@lemmy.world 5 points 3 hours ago (2 children)

Here's what I got with Google Lens. Certainly some mistakes, but not "jumbled mess of nonsense."

Dimes stabilization fire einiges were also delirmed. from thermodinamik integration (I), see methods), which provide a dimict, validation of the MM. GBSA results

J. Phys. Chem. B 2018, 122 7038-2048

[–] spankmonkey@lemmy.world 2 points 51 minutes ago

That IS a "jumbled mess of nonsense"!

That’s completely incomprehensible.

[–] ocean@lemmy.selfhostcat.com 0 points 3 hours ago

Good question though. I was wondering too!