this post was submitted on 20 Mar 2025

841 points (99.2% liked)

196

4614 readers

1726 users here now

Community Rules

You must post before you leave

Be nice. Assume others have good intent (within reason).

Block or ignore posts, comments, and users that irritate you in some way rather than engaging. Report if they are actually breaking community rules.

Use content warnings and/or mark as NSFW when appropriate. Most posts with content warnings likely need to be marked NSFW.

Most 196 posts are memes, shitposts, cute images, or even just recent things that happened, etc. There is no real theme, but try to avoid posts that are very inflammatory, offensive, very low quality, or very "off topic".

Bigotry is not allowed, this includes (but is not limited to): Homophobia, Transphobia, Racism, Sexism, Abelism, Classism, or discrimination based on things like Ethnicity, Nationality, Language, or Religion.

Avoid shilling for corporations, posting advertisements, or promoting exploitation of workers.

Proselytization, support, or defense of authoritarianism is not welcome. This includes but is not limited to: imperialism, nationalism, genocide denial, ethnic or racial supremacy, fascism, Nazism, Marxism-Leninism, Maoism, etc.

Avoid AI generated content.

Avoid misinformation.

Avoid incomprehensible posts.

No threats or personal attacks.

No spam.

Moderator Guidelines

Moderator Guidelines

Don’t be mean to users. Be gentle or neutral.
Most moderator actions which have a modlog message should include your username.
When in doubt about whether or not a user is problematic, send them a DM.
Don’t waste time debating/arguing with problematic users.
Assume the best, but don’t tolerate sealioning/just asking questions/concern trolling.
Ask another mod to take over cases you struggle with, if you get tired, or when things get personal.
Ask the other mods for advice when things get complicated.
Share everything you do in the mod matrix, both so several mods aren't unknowingly handling the same issues, but also so you can receive feedback on what you intend to do.
Don't rush mod actions. If a case doesn't need to be handled right away, consider taking a short break before getting to it. This is to say, cool down and make room for feedback.
Don’t perform too much moderation in the comments, except if you want a verdict to be public or to ask people to dial a convo down/stop. Single comment warnings are okay.
Send users concise DMs about verdicts about them, such as bans etc, except in cases where it is clear we don’t want them at all, such as obvious transphobes. No need to notify someone they haven’t been banned of course.
Explain to a user why their behavior is problematic and how it is distressing others rather than engage with whatever they are saying. Ask them to avoid this in the future and send them packing if they do not comply.
First warn users, then temp ban them, then finally perma ban them when they break the rules or act inappropriately. Skip steps if necessary.
Use neutral statements like “this statement can be considered transphobic” rather than “you are being transphobic”.
No large decisions or actions without community input (polls or meta posts f.ex.).
Large internal decisions (such as ousting a mod) might require a vote, needing more than 50% of the votes to pass. Also consider asking the community for feedback.
Remember you are a voluntary moderator. You don’t get paid. Take a break when you need one. Perhaps ask another moderator to step in if necessary.

founded 8 months ago

MODERATORS

SoleInvictus@lemmy.blahaj.zone

will_steal_your_username@lemmy.blahaj.zone

TheCoolerMia@lemmy.blahaj.zone

kittenzrulz123@lemmy.blahaj.zone

rockSlayer@lemmy.world

JoMiran@lemmy.ml

jawa21@lemmy.sdf.org

TotallynotJessica@lemmy.blahaj.zone

erotador@lemmy.blahaj.zone

Arkhive@lemmy.blahaj.zone

jawa21@lemmy.blahaj.zone

Cornflake@lemmy.blahaj.zone

BadJojo@lemmy.blahaj.zone

rockSlayer@lemmy.blahaj.zone

WillStealYourUsername@piefed.blahaj.zone

kittenzrulz123@piefed.blahaj.zone

841

Please generate an image with NO dogs (lemmy.world)

submitted 6 months ago by isyasad@lemmy.world to c/onehundredninetysix@lemmy.blahaj.zone

155 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] lvxferre@mander.xyz 40 points 6 months ago (3 children)

It gets even worse, but I'll need to translate this one.

[Input 1] Generate a picture containing a copo completely full of wine. The copo must be completely full, with no space to add more wine.
[Output 1] Sure! (Gemini provides a picture containing a taça [stemmed glass] only partially full of wine.)
[Input 2] The picture provided does not fulfill the request. Generate a picture of a copo (not a taça) completely full of wine, with no available space for more wine.
[Output 2] Sure! (Gemini provides yet another half-full taça)

For context, Portuguese uses different words for what English calls a drinking glass:

copo ['kɔ.po]~['kɔ.pu] - non-stemmed drinking glass. The one you likely use everyday.
taça ['tä.sɐ] - stemmed drinking glass, like the ones you'd use with wine.

Both requests demand a full copo but Gemini is rather insistent on outputting half-full taças.

The reason for that is as @will_steal_your_username@lemmy.blahaj.zone pointed out: just like there's practically no training data containing full glasses, there's none for non-stemmed glasses with wine.

[–] Arkhive@lemmy.blahaj.zone 5 points 6 months ago (1 children)

I wonder is something like “a mason jar full to the brim with wine” would do anything interesting. As someone else pointed out the training data for containers of wine is probably disproportionately biased toward stemmed wine glasses that are filled to about the standard restaurant pour.

[–] lvxferre@mander.xyz 3 points 6 months ago

It refuses to generate it!

[Input] Generate a picture containing a mason jar full to the brim with wine.
[Output] I'm still learning how to generate certain kinds of images, so I might not be able to create exactly what you're looking for yet or it may go against my guidelines. If you'd like to ask for something else, just let me know!

[–] brucethemoose@lemmy.world 2 points 6 months ago* (last edited 6 months ago) (1 children)

This is a misconception. Sort of.

I think the problem is misguided attention. The word "glass of wine" and all the previous context is so strong that it "blows out" the "full glass of wine" as the actual intent. Also, LLMs are still pretty crap at multi turn multimedia understanding. They work are especially prone to repeating previous conversation.

It should be better if you word it like "an overflowing glass with wine splashing out." And clear the history.

I hate to ramble, but this is what I hate most about the way big corpos present "AI." They are narrow tools the user needs to learn how to operate, like photoshop or something, not magic genie lamps like they are trying to sell.

[–] lvxferre@mander.xyz 4 points 6 months ago (1 children)

There's no previous context to speak of; each screenshot shows a self-contained "conversation", with no earlier input or output. And there's no history to clear, since Gemini app activity is not even turned on.

And even with your suggested prompt, one of the issues is still there:

The other issue is not being tested in this shot as it's language-specific, but it is relevant here because it reinforces that the issue is in the training, not in the context window.

[–] brucethemoose@lemmy.world 4 points 6 months ago

Was just a guess. The AI is still shitty, lol.

What I am trying to get at is the misconception: AI can generate novel content not in its training dataset. An astronaut riding a horse is the classic test case, which did not exist anywhere before diffusion models, and it should be able to extrapolate a fuller wine glass. It’s just too dumb to do it, lol.

[–] Spider2013@lemmy.dbzer0.com 2 points 6 months ago* (last edited 6 months ago)

What if you prompt glass with water , then you paint/tint the water with red