It’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models
They changed the terms and made the API pay to use for large volumes of use. People using it to train models have already pillaged what they need and you can get the data prior to APIgeddon elsewhere.
You can scrape Lemmy instances for training data without even running an instance.
Yeah, sorry if I’m not great at communicating. That’s exactly what I’m trying to point out when I said:
That’s the thing, anything public is fair game. This is why Reddit is ruining their API.
It’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models
They changed the terms and made the API pay to use for large volumes of use. People using it to train models have already pillaged what they need and you can get the data prior to APIgeddon elsewhere.