62
submitted 1 year ago by alyaza@beehaw.org to c/technology@beehaw.org

Outside of English, ChatGPT makes up words, fails logic tests, and can't do basic information retrieval.

you are viewing a single comment's thread
view the rest of the comments
[-] dudewitbow@lemmy.ml 8 points 1 year ago

Machine learning is only as good as the dataset it has, and given that english has a HUGE data set on the internet, its okay at it, but it makes sense that for other languages, its likely not ideal.

An example would is art. Look up one using a smaller data set (e.g fully legal ones where all training data had artist permission) vs ones trained on the larger dataset where legality wasnt a concern. Night and day difference

[-] FaceDeer@kbin.social 5 points 1 year ago

ChatGPT is actually able to translate the information it learns in one language into other languages, so if it's having trouble speaking Bengali and such it must simply not know the language very well. I recall a study being done where an LLM was trained up on some new information using English training data and then was asked about it in French, and it was able to talk about what it had learned in French.

[-] dudewitbow@lemmy.ml 1 points 1 year ago

Of course. But with translations, brings mistranslations, especially with tier 3+ languages to learn as a english speaker. The data is subject to the accuracy of the translation, and Chat GPT translation is still pretty far from perfect.

[-] FaceDeer@kbin.social 1 points 1 year ago

Ah, I had interpreted your comment to mean that you thought ChatGPT wouldn't know how to answer a question in Bengali unless the information it needed to solve the problem had been part of its Bengali training set. My bad.

this post was submitted on 06 Sep 2023
62 points (100.0% liked)

Technology

37603 readers
224 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS