DatFlash Logo
DatFlashTM tracks real-world dataset transactions and supply signals, normalized for AI decision-making.

Reddit conversation dataset API used for training large language models

Price
$60,000,000 annually
Date
2024
Buyer
Google
Seller
Reddit
Type
DATASET_LICENSING
Region
Global
Market Context
Google agreed to pay roughly $60M annually to access Reddit data for training AI models.
Term
Data access licensin  (Multi-year)
Confidence: High
Citation: DatFlash (2026). "Reddit conversation dataset API used for training large language models"
https://www.datflash.com/transaction/google-reddit-reddit-conversation-dataset-api-used-for-training-large-2024
Download JSON