Meta Admits Use of 'Pirated' Book Dataset to Train AI * TorrentFreak
torrentfreak.com
external-link
Meta admits in court that it used portions of the Books3 dataset to train its Llama models. This dataset includes many pirated books.
@dumpsterlid@lemmy.world
link
fedilink
English
-8
edit-2
10M

What a bunch of losers, thinking they are making the future…… by stealing from as many artists as they can? How do you convince yourself you are doing the right thing when what you are doing is scaling up the theft of art from small artists to a tech company sized operation?

And how much oxygen has been wasted over the years by music companies pushing the narrative that “stealing” from artists with torrenting is wrong? This is so much worse than stealing (and a million times worse than torrenting) though because the point of the theft is to destroy the livelihood of the artist who was stolen from and turn their art into a cheap commodity that can be sold as a service with the artist seeing none of the monetary or cultural reward for their work.

FaceDeer
link
fedilink
310M

What a bunch of losers, thinking they are making the future…… by stealing from as many artists as they can?

Are you aware of which community this is posted in?

@dumpsterlid@lemmy.world
link
fedilink
English
110M

I didn’t realize at first, my bad. I realize that makes a lot of my post redundant but I think my point still stands.

So much hypocrisy that a massive corporation can actually steal like this and it is more socially acceptable than torrenting.

And that’s the issue I in particular have. It’s a double standard and not only that, they’re using it to generate money for their own tools

It’s not the same as some kid pirating photoshop to play around with, or a couple who is curious about GOT and want to watch it without paying HBO.

This is a separate issue and I hate that this place is so reddit like that trying to talk about it gets “hurrr dur I guess you’re mad because AI and meta are just the current hate train circle jerk hurrr i form my own opinions hurr”

Like, no, I’m upset because this is a whole new topic of piracy use.

@j4k3@lemmy.world
link
fedilink
English
-110M

I’m not upset because I think it is totally irrelevant because training AI is not reproducing any works and it is no different than a person who reads or sees said works talking about or creating in the style of said works.

At the core, this amounts to thought policing as the final distilled issue if this is given legal precedent. It would be a massive regression of fundamental human rights with terrible long term implications. This is no different than how allowing companies to own your data and manipulate you has directly lead to a massive regression of human rights over the last 25 years. Reacting like foolish luddites to a massive change that seems novel in the moment will have far reaching consequences most people lack the fundamental logic skills to put together in their minds.

In practice, offline AI is like having most of the knowledge of the internet readily available for your own private use in a way that is custom tailored to each individual. I’m actually running large models on my own computer daily. This is not hypothetical, or hyperbole; this is empirical.

deleted by creator

@j4k3@lemmy.world
link
fedilink
English
-210M

deleted by creator

@Meatballs@mander.xyz
link
fedilink
English
110M

deleted by creator

deleted by creator

Meta stealing intellectual property and utilizing it for corporate gain is not the same as normal users pirating content. They are so far apart that it warrants its own discussion and cannot be lumped in together.

@Kissaki@feddit.de
link
fedilink
English
810M

Did you just make a contradictory argument for both sides?

Is your distinction that piracy by individuals gives cultural recognition while that of corporations doesn’t?

If you think piracy is warranted, at the cost of artists/creators, how is a generalized AI that makes it available and more accessible as a cultural abstracted good different?

nevernevermore
link
fedilink
310M

I’m going to imagine it’s because that cultural abstracted good is then put behind a pay wall, which OP will theb also pirate, thus fulfilling the prophesy.

@dumpsterlid@lemmy.world
link
fedilink
English
0
edit-2
10M

Because I don’t see a strong argument for piracy coming at a direct, immutable cost to artists. I also don’t see a strong argument that piracy reduces the chance fans will pay for art when the art is made decently easy to purchase and is being sold at a reasonable price. Of course there are complexities to this discussion but ultimately when you compare it to massive corporations wholesale stealing massive amounts of works of art with the specific intention of undercutting and destroying the value of said art by attempting to commodify it I think the difference is pretty clear. One of these things is a morally arguable choice by one individual, the other is class warfare by the rich.

Joe shmo torrents an album from a band they like, maybe they buy the album in the future or go to a band concert and buy merch. Joe shmo hasn’t mined some economic gain out of a band and then moved on, Joe shmo has become more of a committed fan because they love the album. Meta steals from a band so that they can create an algorithm that produces knockoff versions of the band’s music that Meta can sell to say a company making a commercial who wants music in that style but would prefer not to pay an actual human artist an actual fair price for the music. These are not the same.

(AI doesn’t create convincing fake songs yet necessarily, but you get my point as it applies to other art that AI can create convincing examples of, books and writing being a prime example)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ
!piracy@lemmy.dbzer0.com
Create a post
⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don’t request invites, trade, sell, or self-promote

3. Don’t request or link to specific pirated titles, including DMs

4. Don’t submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

  • 1 user online
  • 67 users / day
  • 331 users / week
  • 839 users / month
  • 3.41K users / 6 months
  • 1 subscriber
  • 3.48K Posts
  • 83.3K Comments
  • Modlog