Datagrom AI News Logo

Researchers suggest OpenAI trained AI models on paywalled O’Reilly books

Researchers suggest OpenAI trained AI models on paywalled O’Reilly books

April 1, 2025: OpenAI Faces Allegations of Misusing OReilly Content - A report by the AI Disclosures Project accuses OpenAI of using paywalled OReilly books without permission to train its GPT-4o model, which shows greater recognition of these texts than previous models. The report used DE-COP, an inference attack method, to suggest GPT-4o was likely trained on copyrighted material.

While the findings aren't conclusive, they raise ethical concerns about OpenAI's data practices. OpenAI, which did not comment, is already facing several lawsuits over its handling of copyrighted data.

Link to article Share on LinkedIn

Stay Current on AI in Minutes Weekly

Cut through the AI noise - Get only the top stories and insights curated by experts.

One concise email per week. Unsubscribe anytime.