Asklemmy

50410 readers

838 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it's welcome here!

Open-ended question
Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
Not ad nauseam inducing: please make sure it is a question that would be new to most members
An actual topic of discussion

Looking for support?

Looking for a community?

Lemmyverse: community search
sub.rehab: maps old subreddits to fediverse options, marks official as such
!lemmy411@lemmy.ca: a community for finding communities

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 6 years ago

MODERATORS

260

What's something that you were surprised to find out a lot of people hate? (lemm.ee)

submitted 2 years ago* (last edited 2 years ago) by Rinna@lemm.ee to c/asklemmy@lemmy.ml

573 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] TheActualDevil@sffa.community 3 points 2 years ago (1 children)

If a human being takes people’s work and pieces it together in a way that resembles other works without using any LLM/AI or automation tool, is the final result content theft too?

Yes, obviously. Artists and writers can learn from others and can be inspired by other's works, but they can't use parts of those works. That is content theft. Imitating a style is fine, but you have to create something new. LLMs cannot create, only steal.

[–] XEAL@lemm.ee -1 points 2 years ago* (last edited 2 years ago) (1 children)

If, for example, I ask an LLM to produce a short story with a completely unique and random prompt that doesn't resemble any known existing story in its training data (or in the entire world, if you like), is the generated output of the LLM also stolen?

[–] TheActualDevil@sffa.community 1 points 2 years ago (1 children)

I think what you're proposing isn't something they can do. Are you saying "What if I asked it to create a short story who's pieces don't resemble any pieces of known stories?" or are you saying "What if I asked it to create a short story who's whole doesn't resemble any known stories?"

The first one can't happen. The second? Yes, it's stealing.

Where is it getting this story? LLMs don't have creativity. They don't understand story structure. It pulls sentences and paragraphs from work in it's training data. If the generated output contains work that others have made, that's called plagiarism. If it doesn't, then your hypothetical isn't realistic. LLMs can't create original works. That's the whole point. It pulls pieces of the training data and rearranges them. It would be like if I was writing a college paper and instead of writing anything myself I just pulled 100 different sources and copied a sentence or two from each source and structured them as my paper. That's 100% plagiarism.

[–] XEAL@lemm.ee 0 points 2 years ago

I was referring to producing a unique plot.

The process of generating a story involves recombining and rephrasing the LLM's training data in unique ways, it's not a copypaste job. They generate content by predicting and generating text based on patterns, an this implicates a degree of transformation and synthesis.

Where do you draw the line between plagiarism vs inspiration, whether it's a person or an LLM? How long and similar to something existing does a fragment of text have to be to cross the plagiarism line?