Meta Admits Use of ‘Pirated’ Book Dataset to Train AI

@onlinepersona@programming.dev

I do wonder how it shakes out. If the case establishes that a license to use the material should be acquired for copyrighted material, then maybe the license I’m setting on comments might bring commercial AI companies in hot water too - which I’d love. Opensource AI models FTW

CC BY-NC-SA 4.0

@jarfil@beehaw.org

That license would require the AI model to only output content under the same license. Not sure if you realize, but commercial use is part of the OpenSource definition:

https://opensource.org/osd/

Your content would just get filtered out from any training dataset.

As for going against commercial companies… maybe you are a lawyer, otherwise good luck paying the fees.

Meta Admits Use of ‘Pirated’ Book Dataset to Train AI

Meta Admits Use of ‘Pirated’ Book Dataset to Train AI

Meta Admits Use of 'Pirated' Book Dataset to Train AI * TorrentFreak

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

Loot, Pillage, & Plunder