Hot take but PDFs became the primary form of document transfer because Microsoft made .doc, docx, docm, rtf, doc 2003-2020...
All those "It won't open" just forced everyone to say "Fuck it send me the PDF"
Hot take but PDFs became the primary form of document transfer because Microsoft made .doc, docx, docm, rtf, doc 2003-2020...
All those "It won't open" just forced everyone to say "Fuck it send me the PDF"
Pretty much. PDF was specifically designed to retain the same look across any device. The goal was that if you designed a document to look a certain way, that opening it on another device wouldn’t fuck your entire design. That’s also why editing PDFs is so damned frustrating, because they’re designed to not change. It largely started as a frustration with the “move an image 3 pixels to the left, and now all your text is in the wrong place” issue. But the EEE strategy by Microsoft directly contributed to pdf becoming the de facto way to share documents.
My Dad got frustrated with docs as people saw that as an invitation to edit the document, or cut and paste stuff he would write. So he switched to using PDF whenever other people got involved.
Yeah, that’s ironically what Microsoft has been moving towards. Collaborative editing is incredible when used properly. But that also means anyone with edit access can mess up your carefully crafted document. Luckily, things like Comments are becoming more commonplace, so people can suggest edits without actually being able to commit them.
It doesn’t solve the copy/pasting issue, but you can copy/paste from PDFs these days anyways. Realistically, even saving it as an image won’t solve that, since most devices can recognize text in images now.
Well, that and every time you touch a DOC/DOCX file it reformats itself to your local settings, fucking up the entire layout. PDF is a terrible, inefficient, poorly (or at least variably) implemented format which was proprietary for two decades but is now about the best option we have for a document to look the same at the recipient end as the sender and still include text, vector, bitmapped, semi-interactive, and certifiable/traceable contents.
I really, really hate that so many people still try to share ebooks as PDFs. Why that was ever a thing makes no sense to me. Yes, I absolutely wish to read a 500 page novel on portrait letter size pages with tiny font that completely ignores my screen size.
I've given up on trying to find certain books in sane formats. Thankfully Calibre is really good at converting PDFs to actual ebook formats.
There's a bit of a learning curve, and sometimes I have to do a little semi-automated cleanup -- but it works.
Really? I must have had a particularly troublesome PDF. It was almost like running it through OCR, generating hundreds of weird typos and formatting errors when I tried to convert with calibre.
The OCR struggles with some PDFs for whatever reasons: font, formatting, etc.
There are 3rd party PDF OCR websites/programs that work better. If I'm having issues I run it through one of those first.
Any suggestions? Even the good ones had error rates that might not matter for a couple of pages, but when scaled to a 500 page book, even a 1% error rate results in an annoying level of typos.
I use gImageReader + Tesseract, but that probably doesn't meet your criteria. Unfortunately OCR is very rarely perfect unless the input is perfectly clear and with a "OCR friendly" font/formatting. There are "AI powered" OCR out there, but I can't speak to how well they work and I don't know of any free ones.
What are more efficiente and better implemented formats for documents sharing?
Djvu, but it's toolset is proprietary.
Markdown is gaining traction. There's lots of tools that will edit and display Markdown consistently, and without a dedicated tool, it's just a very readable text file.
And, most importantly for today, it's easy to generate a PDF file from, haha.
Yes and No.
They were really designed to show the same output on the screen and printer.
Even if you are using the same word processor software and file format, a document can look vastly different when you send it to someone else who doesn't have the same screen resolution or the same fonts installed.
PDF started as just a print preview for the postscript printer language. They should have just stopped there instead of trying to make it do all sorts of other shit that can open security holes.
The constant parade of file formats drove popularity, but it was really about being the only popular format to look the same.
Instead of using .odt.
Maybe with more advertising? Most people don't know about the Open Document Format and that it's standardization sent MS to panicky rework their .doc & co. to pseudo-open OOXML (.docx etc.).
When you save an odt from Word and open it in OpenOffice, the formatting is usually all fucked. At least that used to be the case. A pdf comes out right on the other side.
It's intentionally fucked by MS. It doesn't matter that this non-MS software actually follows consistent standards. As long as its only the minority, they get away with it looking like it's the others not being consistent.
MS has a history of doing it. It's in the company ethos of "embrace, extend and extinguish". Imagine something as simple as storing the contents of a document being at the behest of a private company. Humanity is all the worse for it.
DOS isn't done until Lotus won't run.
Except for my local printing shop, which couldn't print my PDF poster for some reason so now they are asking for a PPT. WTF!
There's someone at your local print shop unqualified to be doing their job.
One thing I've learned over the years: the scruffier looking the IT guy, the more they should be listened to.
They don't bear the moniker "greybeard" without reason
I thought that was because they shout a lot.
Richard Stallman is rarely wrong
Except when he eats something from his foot.
Or talks about paedophilia...
"The nominee is quoted as saying that if the choice of a sexual partner were protected by the Constitution, 'prostitution, adultery, necrophilia, bestiality, possession of child pornography, and even incest and pedophilia' also would be. He is probably mistaken, legally--but that is unfortunate. All of these acts should be legal as long as no one is coerced. They are illegal only because of prejudice and narrowmindedness."
RMS on June 28th, 2003
"I am skeptical of the claim that voluntarily pedophilia harms children. The arguments that it causes harm seem to be based on cases which aren't voluntary, which are then stretched by parents who are horrified by the idea that their little baby is maturing. "
RMS on June 5th, 2006
"There is little evidence to justify the widespread assumption that willing participation in pedophilia hurts children.
Granted, children may not dare say no to an older relative, or may not realize they could say no; in that case, even if they do not overtly object, the relationship may still feel imposed to them. That's not willing participation, it's imposed participation, a different issue. "
RMS on Jan 4th, 2013
That said, when he's talking about the potential dangers of proprietary software, he's usually bang on.
Taking RMS's word as law outside of dev space is like asking Michael Jordan to solve all the geopolitical conflicts in the middle east. Why the fuck would you think he knows anything about that?
...I mean if Michael Jordon went around saying "we should just like, carpet bomb the place and take over" I think people would also be pretty horrified
I went from "he was probably defending pedophiles not acting on it" to "HOLY FUCK" real fast...
I understand this is not an easy topic, but what is the "HOLY FUCK" bit you're referring to here?
We're just too pedestrian to get it.
I have a great idea for a program! I should describe it in agonizing detail to an AI owned by some company so it will spit out working source code. Nothing can go wrong with my plan!
If you make it open source they can't steal your idea
I mean, they can still steal your idea, fork it, repackage it and charge for it while refusing to upstream their development. But now it's a licensing discussion and not a personal attack.
Apple and Google have stolen plenty of ideas. They don't care if it's closed or open.
If they can't prevent others from using it, seems fine to me.
If AI will stop people from telling me their "amazing" app ideas when they find out I'm a programmer then I'm all for it.
This is a flawless plan, especially since they pinky swore that they wouldn't keep around the information you put into the black box AI. So we're all safe!
I feel like that violin is probably written in C. Or maybe Go. Has anyone made this yet? I might have my next weekend project
It's being re-written in Rust.
Unsafe Zig if you wanna play fast..
This is a bit Mary Sue for my taste but lots of truth there lol
TIL the xkcd main character(s?) is (are?) called Cueball
Cueball is the main character; others have their own names; for example Megan (appearing a lot as the main Feminine character)
More recurring characters are listed on this page: https://www.explainxkcd.com/wiki/index.php/Characters
It's unofficial, but yeah. People call him that.
A community for a webcomic of romance, sarcasm, math, and language.