this post was submitted on 09 Aug 2025
151 points (93.6% liked)

Technology

39864 readers
294 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 6 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] SocialMediaRefugee@lemmy.ml 10 points 1 month ago (3 children)

wtf, I checked out the claim about states with R in their name and it was pretty ridiculous how badly it answered it. It answers like someone running through their thoughts in front of you.

====================================================

list all of the US states with R in their name

ChatGPT said: Here’s the full list of U.S. states that have the letter R in their name:

Arizona

Arkansas

California

Colorado

Delaware

Florida

Georgia

Illinois

Maryland

Massachusetts

Missouri

Nebraska

North Carolina

North Dakota

Oregon

South Carolina

Texas – (wait… nope, no R here! ❌)

Vermont

Virginia

Washington – (nope, no R either ❌)

West Virginia

Wisconsin – (no R ❌)

Wyoming – (no R ❌)

Let me clean that up so only correct ones remain:

Final Correct List (22 states):

Arizona

Arkansas

California

Colorado

Delaware

Florida

Georgia

Maryland

Missouri

Nebraska

North Carolina

North Dakota

Oregon

Vermont

Virginia

West Virginia

New Hampshire

New Jersey

New York

Massachusetts

Connecticut – (wait, no R ❌)

Minnesota – (no R ❌)

Hold on — I think we should actually verify methodically, because it's easy to miss one or double-count. Do you want me to go alphabetically through all 50 states and pick only the ones with "R" so we’re 100% sure? That way we won’t miss any.

[–] SocialMediaRefugee@lemmy.ml 5 points 1 month ago (1 children)

I tried this with MS Copilot and got this:

===========================================

list all of the US states with R in their name

Sure! Here's a list of all U.S. states that contain the letter "R" in their name: 🗺️ U.S. States with "R" in the Name

  • Arizona
  • Arkansas
  • California
  • Colorado
  • Delaware
  • Florida
  • Georgia
  • Maryland
  • Massachusetts
  • Minnesota
  • Missouri
  • Nebraska
  • New Hampshire
  • New Jersey
  • New York
  • North Carolina
  • North Dakota
  • Oregon
  • South Carolina
  • Texas
  • Vermont
  • Virginia
  • Washington
  • West Virginia
  • Wisconsin
  • Wyoming Let me know if you want this grouped by region or visualized in a map!
[–] mindbleach@sh.itjust.works 1 points 1 month ago

Seems like loss of context. By the end it's seeing a list of US states, alphabetically, and it'd usually be weird to skip one.

The question is not kept at the forefront for each state named.

[–] eager_eagle@lemmy.world 4 points 1 month ago* (last edited 1 month ago) (3 children)

LLMs fundamentally can't answer this kind of question, so it's an unfair test. They don't see letters, they just see tokens.

It's a bit like asking a random person on the street the square root of 75. Most will just guess on the spot.

[–] Vendetta9076@sh.itjust.works 4 points 1 month ago

I think this kind of stance is really interesting because of course you're right but it doesn't matter. It doesnt actually matter what an LLM can do, it matters what corpos tell us they can do. And according to the corpos this is child's play.

[–] jj4211@lemmy.world 2 points 1 month ago

Most will just guess on the spot.

Well no, most would say "I don't know". Which an LLM is unlikely to do unless the training material shows that a consistent answer is "I don't know". It will give a fact shaped answer that may fail, but it's at least 'shaped' right.

Again, the big problem is not that LLMs are just useless because they can't do these little specific tricks, it's that it is important to continue calibrating expectations, particularly as, at this point, people have bet trillions on these things and that's a lot of money to have people lie and cheat their way to make everyone overestimate them. Without counter perspective, I think 90% of my coworkers would have been laid off already as the executives just soak in unchallenged marketing bullshit from the big players.

[–] SocialMediaRefugee@lemmy.ml 1 points 1 month ago (2 children)

But I've seen AI results that are basically extracts of sources. They'll even give a link to them.

[–] jj4211@lemmy.world 1 points 1 month ago

A bit off topic, but that's pretty much a result of "prompt stuffing". Your prompt is processed into a good old fashioned search query and then the search results are sort of added to the prompt. Basically from the LLM perspective, it seems a request to rework your source material in a manner consistent with your prompt. The LLM is fed the correct answer, so it doesn't have to answer, it just has to reword the input.

[–] Hotzilla@sopuli.xyz 0 points 1 month ago (1 children)

GPT-5 without "thinking" mode got the answer wrong.

GPT-5 with thinking answered:

Here are the 21 US states with the letter “R” in their name:

Arizona, Arkansas, California, Colorado, Delaware, Florida, Georgia, Maryland, Missouri, Nebraska, New Hampshire, New Jersey, New York, North Carolina, North Dakota, Oregon, Rhode Island, South Carolina, Vermont, Virginia, West Virginia.

It wrote a script that verified it while doing the "thinking" (feeding the hallusinations back to the LLM)