1429
submitted 1 year ago* (last edited 1 year ago) by BarterClub@sh.itjust.works to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] whoamibro@lemmy.world -2 points 1 year ago

If the source is already in text (perfectly accessible), why should we make an image out of it? That's like saying let's email a document, but instead of the original doc file, let's print them out, scan, and then send the pdf of those images instead.

That is not a correct analogy because printing and scanning a document is less convenient than just forwarding the email. But here, most people are comfortable taking a ss and share it. That's what they're learnt. So they keep doing that.

My man, you just don't know how crappy OCR can be with non-latin alphabet writing systems, especially Chinese characters.

That's why the OCR tools have to be improved. They should atleast be able to read the top 10 most used fonts in a language without issues.

[-] condenser@lemdro.id 2 points 1 year ago

The analogy is just an attempt at explaining, no need to argue about them. The main point is about giving preference to the original copy, not the lossy and inefficient copy. No matter how image to text conversion tools get better there always be a gap.

this post was submitted on 21 Jul 2023
1429 points (99.0% liked)

Technology

59419 readers
2993 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS