Google Researchers’ Attack Prompts ChatGPT to Reveal Its Training Data

stopthatgirl7@kbin.social · 11 months ago

Google Researchers’ Attack Prompts ChatGPT to Reveal Its Training Data

KingRandomGuy@lemmy.world · 11 months ago

Not sure what other people were claiming, but normally the point being made is that it’s not possible for a network to memorize a significant portion of its training data. It can definitely memorize significant portions of individual copyrighted works (like shown here), but the whole dataset is far too large compared to the model’s weights to be memorized.

ayaya@lemdro.id · edit-2 11 months ago

And even then there is no “database” that contains portions of works. The network is only storing the weights between tokens. Basically groups of words and/or phrases and their likelyhood to appear next to each other. So if it is able to replicate anything verbatim it is just overfitted. Ironically the solution is to feed it even more works so it is less likely to be able to reproduce any single one.

Kbin_space_program@kbin.social · edit-2 11 months ago

That’s a bald faced lie.

and it can produce copyrighted works.
E.g. I can ask it what a Mindflayer is and it gives a verbatim description from copyrighted material.

I can ask Dall-E “Angua Von Uberwald” and it gives a drawing of a blonde female werewolf. Oops, that’s a copyrighted character.

ayaya@lemdro.id · 11 months ago

I think you are confused, how does any of that make what I said a lie?

TimeSquirrel@kbin.social · 11 months ago

I can do that too. It doesn’t mean I directly copied it from the source material. I can draw a crude picture of Mickey Mouse without having a reference in front of me. What’s the difference there?

Flying Squid@lemmy.world · 11 months ago

If you have a crude picture of Mickey Mouse and you make money from it, Disney definitely has a chance at going after you.

brianorca@lemmy.world · 11 months ago

That’s due to trademark, not copyright.