970
Elsevier (mander.xyz)
you are viewing a single comment's thread
view the rest of the comments
[-] andrew_bidlaw@sh.itjust.works 44 points 4 months ago

If the paper is worth it and does have an original not OCR-ed text layer, it'd better be exported as any other format. We don't call good things a PDF file, lol. It's clumsy, heavy, have unadjustable font size and useless empty borders, includes various limits and takes on DRM, and it's editing is usually done via paid software. This format shall die off.

The only reason academia needs that is strict references to exact page but it's not that hard to emulate. Upsides to that are overwhelming.

I had my couple of times properly digitalizing PDFs into e-books and text-processing formats, and it's a pain in the ass, but if I know it'd be read by someone but me, I'm okay with putting a bit more effort into it.

[-] petersr@lemmy.world 27 points 4 months ago

Well, I guess PDF has one thing going for it (which might not be relevant for scientific papers): The same file will render the same on any platform (assuming the reader implements all the PDF spec to the tee).

[-] fossilesque@mander.xyz 21 points 4 months ago
[-] andrew_bidlaw@sh.itjust.works 9 points 4 months ago

Thanks. I've used simplier tools (besides pirated Acrobat) and wrote some scripts to optimize deDRMing and breaking passwords on them. That one ypu posted looks promising. I'd save it to toy with it in my free time.

[-] fossilesque@mander.xyz 2 points 4 months ago* (last edited 4 months ago)

It's the bees knees. Bonus theme for it: https://draculatheme.com/stirling-pdf

[-] Syn_Attck@lemmy.today 4 points 4 months ago

Wow, this is awesome, thanks!

[-] visc@lemmy.world 7 points 4 months ago
[-] andrew_bidlaw@sh.itjust.works 13 points 4 months ago

FB2 is a known format for russian pirates, but it can and should be improved because it sucks ass in many things. FB3 was announced long ago but it hasn't got any traction yet.

EPUB is mor/e popular, so it's probably be the go to format for most books US and EU create, but it isn't much better.

Other than that, even Doc\Docx is better than PDF, but I'd recomend RTF for it has less traces of M$ bullshit, and while it's imperfect format, it's still better.

[-] Syn_Attck@lemmy.today 8 points 4 months ago

Whatever the format, let's hope it doesn't end up having the extension .map

(minor attracted persons aka PDF file joke)

[-] andrew_bidlaw@sh.itjust.works 10 points 4 months ago

Get ready for a sweaty techbro to explain why Least Optimized Lossless Image is the superior format.

[-] JasonDJ@lemmy.zip 4 points 4 months ago

Only if you use the Self-Hashing Orthogonal Tracing Algorithm, naturally.

[-] humbletightband@lemmy.dbzer0.com 5 points 4 months ago* (last edited 4 months ago)

Maybe for books. I've seen only pdf and PostScript widely used for papers in academia.

Edit: ok my supervisor liked div but he was the only one I knew with this kind of taste

[-] andrew_bidlaw@sh.itjust.works 3 points 4 months ago

Div? Can you unpack your thoughts on that, as I haven't faced it yet?

I only know DJVU or deja vu format that's usually used for raw scans.

[-] humbletightband@lemmy.dbzer0.com 2 points 4 months ago

Djvu is also for books and similar.

I don't know about div format much, but I remember that mktex was producing it as a side effect

[-] sem@lemmy.blahaj.zone 5 points 4 months ago

I don't like docx because it looks different in libreoffice compared to Windows, also you can run into problems with fonts

[-] andrew_bidlaw@sh.itjust.works 2 points 4 months ago

DOC is a mess in different editions of Word too, especially if you do some complex formatting, but it's the default format for text documents thanks to MS.

[-] visc@lemmy.world 3 points 4 months ago

Docx doc rtf and all those have a different purpose than pdf, word docs don’t even necessarily look the same on two different computers with the same version of word, and rtf doesn’t even attempt any kind of paper description, it’s literally only a rich format for text. None of these are a true “if I give this to someone to print I know what I will get” “portable document format”

I will look at fb*, I had not heard of them. Thanks!

[-] ElderWendigo@sh.itjust.works 8 points 4 months ago

Most papers are made in TEX or LaTEX. These formats separate display from data in such a way that they can be quickly formatted to a variety of page size, margins, text size, et al with minimal effort. It's basically an open standard typesetting format. You can create and edit TEX in any text editor and run it through a program to prepare it for print or viewing. Nothing else can handle math formulas, tables, charts, etc with the same elegance. If you've ever struggled to write a math paper in Microsoft word, seriously question why your professor hasn't already forced you to learn about LaTEX.

this post was submitted on 20 Jun 2024
970 points (98.9% liked)

Science Memes

11021 readers
3464 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.



Rules

  1. Don't throw mud. Behave like an intellectual and remember the human.
  2. Keep it rooted (on topic).
  3. No spam.
  4. Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.



Research Committee

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago
MODERATORS