419
We Asked A.I. to Create the Joker. It Generated a Copyrighted Image.
(www.nytimes.com)
This is a most excellent place for technology news and articles.
I think it's much more likely whatever scraping they used to get the training data snatched a screenshot of the movie some random internet user posted somewhere. (To confirm, I typed "joaquin phoenix joker" into Google and this very image was very high up in the image results) And of course not only this one but many many more too.
Now I'm not saying scraping copyrighted material is morally right either, but I'd doubt they'd just feed an entire movie frame by frame (or randomly spaced screenshots from throughout a movie), especially because it would make generating good labels for each frame very difficult.
I just googled "what does joker look like" and it was the fourth hit on image search.
Well, it was actually an article (unrelated to AI) that used the image.
But then I went simpler -- googling "joker" gives you the image (from the IMDb page) as the second hit.