Reddit conversation dataset API used for training large language models
Price
$60,000,000 annually
Date
2024
Buyer
Google
Seller
Reddit
Type
DATASET_LICENSING
Region
Global
Market Context
Google agreed to pay roughly $60M annually to access Reddit data for training AI models.
Term
Data access licensin
(Multi-year)
Confidence:
High
Citation:
DatFlash (2026).
"Reddit conversation dataset API used for training large language models"
https://www.datflash.com/transaction/google-reddit-reddit-conversation-dataset-api-used-for-training-large-2024
https://www.datflash.com/transaction/google-reddit-reddit-conversation-dataset-api-used-for-training-large-2024