A group of Canadian news and media companies filed a lawsuit Friday against OpenAI, alleging that the ChatGPT maker has infringed their copyrights and unjustly enriched itself at their expense.
The companies behind the lawsuit include the Toronto Star, the Canadian Broadcasting Corporation, the Globe and Mail, and others who seek to win monetary damages and ban OpenAI from making further use of their work.
The news companies said that OpenAI has used content scraped from their websites to train the large language models that power ChatGPT — content that is “the product of immense time, effort, and cost on behalf of the News Media Companies and their journalists, editors, and staff.”
Good luck to them! It’d be interesting to see how they prove scraping. Like do you find something unique to your website & then prompt the model to give you just that? So you use the citation/reference features that link to your websites?
Knowing the slimeballs at OpenAI I’d wager they’d have covered their tracks.
EDIT To be clear, I’m not suggesting they deleted evidence, but they “laundered” the data via a public training dataset like Eluther AI’s “the pile”.
The star, globe, and CBC. Cuts deep
Ew they used postmedia to train their models? Yuck