News article datasets used for training language models
Price
$1M–$5M annually
Date
2024
Buyer
OpenAI
Seller
Various regional news publishers
Type
DATASET_LICENSING
Region
Global
Market Context
Reports indicate OpenAI offered smaller publishers between $1M and $5M annually for licensing article datasets.
Term
Annual licensing agr
(Multi-year)
Confidence:
Medium-High
Citation:
DatFlash (2026).
"News article datasets used for training language models"
https://www.datflash.com/transaction/openai-various-regional-news-publishers-news-article-datasets-used-for-training-language-models-2024
https://www.datflash.com/transaction/openai-various-regional-news-publishers-news-article-datasets-used-for-training-language-models-2024