OpenAI and Apple in talks with publishers to license news content for AI training

OpenAI and Apple in talks with publishers to license news content for AI training

5 months ago
Anonymous $6hYC3Wwiad

https://techmonitor.ai/technology/ai-and-automation/openai-apple-license-news-content-copyright

OpenAI is reportedly offering publishers up to $5m to license news content to train its large language models (LLMs), with Apple also reportedly engaged in similar talks. The news comes a week after the New York Times announced that it was suing OpenAI for copyright infringement, alleging that the AI company had used its articles to train its LLMs without its permission. 

AI developers were criticised throughout 2023 for using image and text data to train their models without considering whether or not it was copyrighted. Most of it is sourced from information scraped indiscriminately from the internet, whether through purpose-built web crawlers or open-source data providers like LAION, before it is vetted and curated. The extent to which this curation process includes the removal of copyrighted data remains unknown, though the suspicion that it doesn’t led major news organisations including CNN, Reuters and the New York Times to block OpenAI’s web crawler from their websites in August 2023.