YouTube creator sues Nvidia and OpenAI for ‘unjust enrichment’ for using their videos for AI training

vegeta@lemmy.world · 3 months ago

YouTube creator sues Nvidia and OpenAI for ‘unjust enrichment’ for using their videos for AI training

Buffalox@lemmy.world · edit-2 3 months ago

Simply refer to the sources you used

Source: The Internet.

Most things are duplicated thousands of times on the Internet. So stating sources would very quickly become a bigger text than almost any answer from an AI.

But even disregarding that, as an example: Stating that you scraped republican and democrat home sites on a general publicly available site documenting the AI, does not explain which if any was used for answering a political question.

Your proposal sounds simple, but is probably extremely hard to implement in a useful way.

Victoria@lemmy.blahaj.zone · 3 months ago

fundamentally, an llm doesn’t “use” individual sources for any answer. it is just a function approximator, and as such every datapoint influences the result, just more if it closely aligns with the input.