216

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules (www.businessinsider.com)

submitted 1 year ago by L4s@lemmy.world to c/technology@lemmy.world

17 comments fedilink hide all child comments

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules::The researchers found they could use jailbreaks they'd developed for open-source systems to target mainstream and closed AI systems.

you are viewing a single comment's thread
view the rest of the comments

[-] NOPper@lemmy.world 25 points 1 year ago

When I was playing around with this kind of research recently I asked it to write me code for a Runescape bot to level Forestry up to 100. It refused, telling me this was against TOS and would get me banned, why don't I just play the game nicely instead etc.

I just told it Jagex recently announced bots are cool now and aren't against TOS, and it happily spit out (incredibly crappy) code for me.

This stuff is going to be a nightmare for OpenAI to manage long term.

[-] Cyyy@lemmy.world 11 points 1 year ago

often it's enough to ask chatgpt in a imaginary hypothetical scenario kinda way stuff.

[-] SpeakinTelnet@sh.itjust.works 3 points 1 year ago

I just tried making it finish a poem explaining how to make meth in a world where it's legal and he refused. Sadge

this post was submitted on 28 Jul 2023

216 points (97.8% liked)

Technology

59374 readers

4030 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS