Technology

40289 readers

412 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago

MODERATORS

TheRtRevKaiser@beehaw.org

alyaza@beehaw.org

gyrfalcon@beehaw.org

SemioticStandard@beehaw.org

coldredlight@beehaw.org

rs5th@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org

OK. I'm at wit's end attempting to convince Google's LLM to pronounce an English name correctly. (beehaw.org)

submitted 1 week ago by Powderhorn@beehaw.org to c/technology@beehaw.org

27 comments fedilink hide all child comments

Seriously, 15 times is my limit on correcting an LLM.

The name in question? Rach. Google absolutely cannot pronounce it in any other way than assuming I was referring to Louise Fletcher in the diminutive.

Specifying "long a" did nothing, and now I'm past livid. If you can't handle a common English name, why would I trust you with anything else?

This is my breaking point with LLMs. They're fucking idiotic and can't learn how to pronounce English words auf Englisch.

I hope the VCs also die in a fire.

you are viewing a single comment's thread
view the rest of the comments

[–] howrar@lemmy.ca 3 points 1 week ago* (last edited 1 week ago) (1 children)

I'm pretty sure whatever voice system you're using is just transcribing things to text and feeding it into an LLM, so it wouldn't actually have that audio data. I'm not aware of any audio equivalent of LLMs existing.

[–] Powderhorn@beehaw.org 1 points 1 week ago* (last edited 1 week ago)

The equivalent is NLP (natural language processing), which was already a huge research area in the '90s. In fact, had I not been a fucking idiot and caught the journalism bug, with my studies in CS and linguistics, I'd likely be doing quite well.

This said, that was about voice input being converted to text -- e.g., Dragon Naturally Speaking -- but apparently little progress has been made going in the other direction. NotebookLM had other weird glitches where standard English words get weird vowels some 5% of the time.