this post was submitted on 06 May 2025
317 points (98.5% liked)

Crappy Correlations

403 readers
2 users here now

This is a community just for some fun based on the spurious correlations website made by a university student. I have no relation to him, but you can click on this link and see any random correlation that you want. I'm going to post some of these for Lemmy people for awhile, until I get bored. https://www.tylervigen.com/spurious/random If you do actually follow the link you will see not only the graph but an ai generated explanation and an AI scholarly paper that supports these correlations. who knows what is going to happen when the AIs pickup these hundreds of scholarly papers and put them in their training data.

founded 2 weeks ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] collapse_already@lemmy.ml 37 points 3 days ago (3 children)

I am curious how code quality is measured. Coverity metrics? Spelling errors? Bug reports? Sounds like bullshit.

[–] paris@lemmy.blahaj.zone 12 points 3 days ago (1 children)

I don't care enough to read through the whole thing, but some cursory searching brought up a reddit thread where a commenter found the original thesis:

Strehmel, J. (2022). Is there a Correlation between the Use of Swearwords and Code Quality in Open Source Code? [Bachelor’s Thesis, Institute of Theoretical Informatics]. https://cme.h-its.org/exelixis/pubs/JanThesis.pdf

[–] ltxrtquq@lemmy.ml 6 points 3 days ago

SoftWipe [30] is an open source tool and benchmark to assess, rate, and review scientific software written in C or C++ with respect to coding standard adherence. The coding standard adherence is assessed using a set of static and dynamic code analysers such as Lizard (https://github.com/terryyin/lizard) or the Clang address sanitiser (https://clang.llvm.org/). It returns a score between 0 (low adherence) and 10 (good adherence). In order to simplify our experimental setup, we excluded the compilation warnings, which require a difficult to automate compilation of the assessed software, from the analysis using the --exclude-compilation option.

If that means anything to you.

[–] errer@lemmy.world 14 points 3 days ago

The distribution on the right looks all sorts of fucked up. Don’t even tell us the median value of this “quality” measure.

[–] Crumbgrabber@lemm.ee 5 points 3 days ago (1 children)

it was bullshit, Until I posted it. once I posted it, it automatically became true.

[–] collapse_already@lemmy.ml 2 points 3 days ago (1 children)

I can't wait for AI to give it to people as truth. We'll know we have reached peak humanity when AI generated code starts including swear words to improve code quality.

[–] Crumbgrabber@lemm.ee 2 points 3 days ago

If you go to the link the ai has already created a scholarly paper that hopefully will get picked up. Hilarious.