The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people’s permission.
Scraping is protected. GPT and the line are more akin to fair use machines than plagiarism machines. This is a lot of hot air to go nowhere. Rage bait
“We have to protect the children”
The worst rush to legislation is done in the name of stopping terrorists and saving the children. Always.
Scraping social media posts and reddit posts doesn’t sound like stealing, they’re public posts.
deleted by creator
Just because something is posted online doesn’t mean it can be taken a resold. Copyright law prevents that. Of course, copyright law and generative AI is new and gray area.
deleted by creator
I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.
if it was unsecured it’s basically public. whomever put that data on a publicly accessible server is at fault