News Google strikes $60m deal with Reddit for AI training data — what you need to know

Feb 23, 2024
But scale isn’t everything, and in some ways Reddit is an imperfect sample for training artificial intelligence when compared to literature or magazines. Grammar is faster and looser, there’s a lot of memes and inside jokes, it’s full of information that’s just plain wrong and it's predominantly male.
Not to mention the ad hoc moderation decisions. For interactions between people this moderation already makes what people say artificial.

Will Google have access to deleted posts and comments as well? That would be the only saving grace, and even it has its limits.