Midjourney bans all Stability AI employees over alleged data scraping (www.theverge.com)

submitted 8 months ago by Akisamb@programming.dev to c/machine_learning@programming.dev

2 comments fedilink hide all child comments

cross-posted from: https://lemmy.ml/post/13088944

top 2 comments

sorted by: hot top controversial new old

[-] Akisamb@programming.dev 4 points 8 months ago

It does seem odd that scraping activity from just two accounts allegedly managed to cause such an extended server outage. The irony of this situation also hasn’t been lost on online creatives, who have extensively criticized both companies (and generative AI systems in general) for training their models on masses of online data scraped from their works without consent. Stable Diffusion and Midjourney have both been targeted with several copyright lawsuits, with the latter being accused of creating an artist database for training purposes in December.

As far as I know they do not have copyright over the output of their models. Apart from banning the users they pretty much have no solutions to stop this. Even if they had copyright, it's still legally unknown if training LLMs constitutes a copyright violation.

In a similar fashion a lot of the recent chat llm's have been trained on output from chatgpt. After all why pay humans to produce training data when your competitor has already done it for you.

[-] autotldr 1 points 8 months ago

This is the best summary I could come up with:

In a summary of a business update call on March 6th, Midjourney claimed that “botnet-like activity from paid accounts” — which the company specifically links to Stability AI employees — was behind the outage.

In its summary of the business update call on March 6th (which Midjourney refers to as “office hours”), the company says it’s banning all Stability AI employees from using its service “indefinitely” in response to the outage.

Midjourney is also introducing a new policy that will similarly ban employees of any company that exercises “aggressive automation” or causes outages to the service.

Midjourney founder David Holz responded to Mostaque in the same thread, claiming to have sent him “some information” to help with his internal investigation.

It does seem odd that scraping activity from just two accounts allegedly managed to cause such an extended server outage.

Stable Diffusion and Midjourney have both been targeted with several copyright lawsuits, with the latter being accused of creating an artist database for training purposes in December.

The original article contains 461 words, the summary contains 167 words. Saved 64%. I'm a bot and I'm open source!

this post was submitted on 13 Mar 2024

14 points (100.0% liked)

Machine Learning

478 readers

1 users here now

A community for posting things related to machine learning

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

founded 1 year ago

MODERATORS

Ategon@programming.dev

Akisamb@programming.dev

ericjmorey@programming.dev