this post was submitted on 24 Apr 2025
205 points (97.2% liked)

Comic Strips

16439 readers
1718 users here now

Comic Strips is a community for those who love comic stories.

The rules are simple:

Web of links

founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] jordanlund@lemmy.world 28 points 2 weeks ago (2 children)

It's sorting correctly.

11 comes after 1.

The problem is the data needs to be fixed. 01, 02, 03...

[–] logicbomb@lemmy.world 11 points 2 weeks ago (1 children)

It's not sorting correctly. Right above the listing it specifically says "(sorting by #)".

If it says it is sorting by number, but then it sorts alphabetically, then it isn't sorting correctly.

[–] jordanlund@lemmy.world 7 points 2 weeks ago (1 children)

Problem is it can't sort by number of there are text values in file name. "11" is a number. "11." is not. :) Fun with computers!

[–] logicbomb@lemmy.world 10 points 2 weeks ago (1 children)

Both "11" and "11." are strings, because the context is listing filenames. Filenames are not numbers. They are strings. If you sort filenames by number, you are asking the computer to interpret the string as having a number inside it. At worst, it might interpret "11" as an integer and "11." as a floating point number, because that syntax is often used to specify a floating point number in programming. But even then, it could still sort them correctly.

I don't mean to start an argument, but as a professional programmer, there are just some things that I know.

[–] fushuan@lemm.ee 5 points 2 weeks ago (2 children)

Are you intentionally ignoring that the actual names of the files are "11. EpisodeEleven.mp3"? There's whitespaces and a bunch of letters there.

I'm also a professional programmer, and assuming that sorting by numbers code would try to grab the first block of strings until the first whitespace is a big assumption I would not make. I'd say that after trying to convert everything but the extension to a number for sorting it failed so it defaulted to string sorting for everything else.

load more comments (2 replies)
[–] WrenFeathers@lemmy.world 6 points 2 weeks ago* (last edited 2 weeks ago) (4 children)

Fair point, but how does it reconcile that 11 and 12 comes before 2?

(Admittedly, I know nothing at all about this stuff. I’m a music major and I went all in on that, so I’m genuinely curious)

[–] Buffalox@lemmy.world 9 points 2 weeks ago (1 children)

In alphabetical sorting 11 comes before 2 For the same reason AA is before B.

[–] WrenFeathers@lemmy.world 4 points 2 weeks ago (7 children)

So that’s the joke then? That someone chose to alphabetically sort numbers?

[–] Buffalox@lemmy.world 5 points 2 weeks ago* (last edited 2 weeks ago)

Not just someone. It's the default when numbers are in text strings, they are treated as text, not as numerical values.
To account for numbers in text strings in any text listing system, requires quite a bit of extra work when programming it.
So the joke is that computers are pretty dumb in this regard, and they need a lot of help to do it right.

[–] knightly@pawb.social 4 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

Basic sorting is always like this, the joke is that way too many people still number things badly. Alphanumerically sorting variable-length numbers without normalizing the number of digits will always result in situations like 02 < 1 < 109 < 11 < 2

[–] WrenFeathers@lemmy.world 4 points 2 weeks ago (1 children)

I’m assuming that normalizing digits is similar to normalizing audio? Where you take the numbers and assigning them all a blanket set of instructions?

Normalization in mixing audio essentially takes a track and adjusts the volume of the track’s peaks to keep them from clipping the volume threshold.

Is it like this?

[–] knightly@pawb.social 3 points 2 weeks ago* (last edited 2 weeks ago) (1 children)

Yeah! In general, Normalization refers to adjusting values measured on different scales to a common scale.

Consider 1 and 10. The value of the first digit of both numbers is 1, so a scale-invariant numerical sort sees both numbers as coming before 2.

Normalizing both numbers to a two digit scale gives us 01 and 10, which sort as expected with 02.

[–] WrenFeathers@lemmy.world 2 points 2 weeks ago (6 children)

Ahhh. So you have automated processes that will handle it as well. Just plug in the ranges and it does its thing.

load more comments (6 replies)
[–] abekonge@lemmy.ml 2 points 2 weeks ago

Wow. Reply guy party.

[–] zarkanian@sh.itjust.works 1 points 2 weeks ago

The joke is that computers work this way, and it's aggravating.

load more comments (4 replies)
[–] jordanlund@lemmy.world 4 points 2 weeks ago (1 children)

Filenames are text, not numbers, so 11 comes before 2. So does 111, 101, etc. Anything starting with 1 is before 2.

In order to sort "properly", you have to name your files properly.

00
01
02
up to...
99

If you have more than 100 files in the same folder, the you have to go back through and re-name them:

000
001
002
... 099
100

[–] WrenFeathers@lemmy.world 7 points 2 weeks ago (2 children)

I’m starting to understand why programmers are always so angry…

[–] Buffalox@lemmy.world 5 points 2 weeks ago (1 children)

Oh no this isn't it, this is extremely basic. As a programmer you will encounter way way more complex logic problems, so logic problems become 2nd nature. And it can be frustrating when "normies" don't understand what appear to a programmer to be a matter of pretty simple logic thinking.

[–] WrenFeathers@lemmy.world 3 points 2 weeks ago (1 children)

As a “normie” I’d never assume to understand even the most basic of programming. My head hurts to even try and come up with a joke involving the subject.

[–] Buffalox@lemmy.world 4 points 2 weeks ago (1 children)

It's so nice we have people like you, that allows us to feel superior, even if we are inferior in every other way imaginable. 👍

[–] WrenFeathers@lemmy.world 3 points 2 weeks ago (1 children)
[–] Buffalox@lemmy.world 4 points 2 weeks ago (1 children)

Because it's the same? Or because it's very different?
My wife was a professional musician for a few years.

PS:
Do you get this joke?
The wife sends the man for shopping and asks him to buy 1 liter milk and if they have eggs buy 12.
When he comes home, she looks at him astonished, why did you buy 12 liters of milk?
Because they had eggs he responds.

[–] WrenFeathers@lemmy.world 3 points 2 weeks ago

I do get that joke, but that’s probably because I’m of the ASD persuasion.

[–] jordanlund@lemmy.world 3 points 2 weeks ago (2 children)
[–] WrenFeathers@lemmy.world 2 points 2 weeks ago

Ahh. Makes sense that it can be automated. Same as with audio engineering. Though I like to normalize by hand sometimes as automated normalization tends to make a track sound lifeless and dead.

[–] tal@lemmy.today 1 points 2 weeks ago

A Perl program to convert the number of digits in the first numeric field that appears in a list of filenames.

source

#!/usr/bin/perl -w
# numberit.pl
# Converts between number formats (number of leading zeros) in numbers in title names
# Usage: <number of digits> filelist

$digits = shift (@ARGV);

if ($digits > -1)
{
    foreach $oldName (@ARGV)
    {
        $newName = $digitValue = $oldName;

        if ($digitValue =~ m/\//) {
          $digitValue =~ m/^(.*\/[^0-9\/]*)([0-9]+)([^\/]*)$/;
          $prefix = $1;
          $postfix = $3;
          if (!defined($prefix)) {
            $prefix = "";
          }

          $digitFormatted = sprintf("%0${digits}d", $2);

        } else {
          $digitValue =~ m/^([^0-9]*)([0-9]+)([^\/]*)$/;
          $prefix = $1;
          $postfix = $3;
          if (!defined($prefix)) {
            $prefix = "";
          }

          $digitFormatted = sprintf("%0${digits}d", $2);


        }

        if ($digitValue) {
          $newName = $prefix . $digitFormatted . $postfix;
          rename($oldName, $newName);
        }
      }
}

Looks something like:

$ touch a1.mp3
$ touch a2.mp3
$ numberit.pl 3 *
$ ls
a001.mp3  a002.mp3
$ numberit.pl 2 *
$ ls
a01.mp3  a02.mp3
$
[–] Skullgrid@lemmy.world 3 points 2 weeks ago (1 children)

Fair point, but how does it reconcile that 11 and 12 comes before 2?

it doesn't, it reconciles that 02 comes before 11

[–] WrenFeathers@lemmy.world 2 points 2 weeks ago (1 children)

Ahhh. Because it’s alphabetical? Like how “eleven” comes before “two” alphabetically?

load more comments (1 replies)
[–] konalt@lemmy.world 3 points 2 weeks ago

It's sorted alphabetically. All the numbers that start with "1" come first, then all the numbers that start with "2", regardless of how long the actual number is.

[–] rumschlumpel@feddit.org 27 points 2 weeks ago

That's just alphabetical sorting. There's other sorting styles that would put 11 after 2, but those aren't available everywhere.

[–] tal@lemmy.today 22 points 2 weeks ago* (last edited 2 weeks ago) (3 children)

Assuming that the leading number and period is part of the filename:

  1. If the directory-browsing thing supports natural sorting of numbers, you can use that; this will detect numbers within the filename and sort by them. For example, I use emacs's dired to browse directories and play movies. It can modify the flags passed to ls, and ls supports natural sorting of numbers in filenames with the -v flag. Doing this in dired means C-u s v RET, and the directory will be displayed sorted numerically.

  2. If the directory-browsing thing doesn't support that, rename to a "0"-padded format such that a lexicographic order has the same ordering as numeric order:

     $ for f in *; do
         n="$(printf %04d ${f%%.*})"
         mv "$f" "$(echo $f|sed s/^[0-9]*/$n/)"
     done 
    

    That way,

     1. Episode_One.mp4
    

    will become

     0001. Episode_One.mp4
    

    and so forth, and so software that can only do lexicographic orderings is happy.

If the leading number and period isn't part of the filename, then we need to parse human-language strings.

$ npm install words-to-numbers

$ for f in *; do
    w="$(echo $f|sed -r "s/^Episode_([^.]*).mp4/\1/")"
    n=$(echo $w|node -e 'wn=require("words-to-numbers"); const l=require("readline").createInterface({input:process.stdin, output:process.stdout, terminal:false}); rl.on("line", l=> {console.log(wn.wordsToNumbers(l));})')
    nf=$(printf %04d $n)
    mv "$f" "$nf. $f"
done

That'll rename to the format mentioned above in option 2:

Episode_One.mp4

will become

0001. Episode_One.mp4

And lexicographic sorting will work.

[–] hemko@lemmy.dbzer0.com 24 points 2 weeks ago

This is why I love Lemmy. There's a meme post and then someone writes a 7 page long technical solution like they were paid contributor at stack overflow.

Never change <3

[–] Buffalox@lemmy.world 6 points 2 weeks ago (1 children)

Wow is it really that simple? I wonder why not everybody does that! 🤣

[–] tal@lemmy.today 5 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

The first case is the most-likely for most people, and the simplest to do. Most directory browsers do support numeric sorting.

In the second case, I provided a Perl program in another comment in the thread that provides a generic way to do this with one command for nearly all files of this sort.

The third case, where human-language stuff needs to be parsed, true enough, doesn't just have a button to push.

load more comments (2 replies)
[–] ICastFist@programming.dev 4 points 2 weeks ago

You know what, I'll just manually rename the files

[–] pageflight@lemmy.world 11 points 2 weeks ago

So true. I recently learned there's a Python package for natural sorting, similar in other languages.

[–] Skullgrid@lemmy.world 8 points 2 weeks ago (1 children)
[–] SculptusPoe@lemmy.world 5 points 2 weeks ago (1 children)

You are correct, she should have used leading '0's. Really, they could have fixed that sort. Excel does it without leading '0's. Excel also does a lot of other nonsense because it is trying too hard, maybe I prefer the dumb sort. I try to get my people to start dated files with YYYYMMDDHHMM just because the file sorting is sort of dumb, but they insist on MMDDYYYY so all the years get jumbled together by month...

[–] knightly@pawb.social 7 points 2 weeks ago* (last edited 2 weeks ago)

This is 100% my pet peeve. Obviously Y-M-D-H-M is the correct format for timestamps because an alphanumeric sort will put them in chronological order.

[–] zarkanian@sh.itjust.works 5 points 2 weeks ago

That's why you give them names starting with 001, 002, etc.

[–] ElectroLisa@lemmy.blahaj.zone 5 points 2 weeks ago

Someone's got wrong locale set

load more comments
view more: next ›