• Tiger Jerusalem@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    6 months ago

    Reddit is a trove of user built content under the guise of community. What Spez did was to say “thanks for all the free work, suckers!”, put a price sticker on it, and laughed all the way to the bank.

    And this is why I’m not active on any Internet community anymore. Nevermind, I guess I just can’t help myself…

  • Voyajer@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    This is why I don’t blame anyone for editing/deleting their post history on reddit.

    • FaceDeer@kbin.social
      link
      fedilink
      arrow-up
      0
      arrow-down
      1
      ·
      6 months ago

      I do. It’s frankly selfish. Having an AI get training on my old comments costs me nothing and it results in the development of useful AI tools. Trying to sabotage that is petty and pointless. It’s not like you could somehow collect the fraction of a pittance that you think you’re owed retroactively. I never commented on Reddit thinking “awesome, I’m going to make bank on the content I’m generating here.”

      People complain about the capitalist mindset of the world and then they do this. Sigh.

      • Nurse_Robot@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 months ago

        Defending giant corporations profiting off of uncompensated individuals, while criticizing anyone who doesn’t want to provide free labor to said corporations, is a disgusting take. Are you a CEO?

      • Hackerman_uwu@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        6 months ago

        What about people who just think “A.I.” Is dog shit and chat bots are a dumb obsession steering the industry in the wrong direction due to hype and money?

        • FaceDeer@kbin.social
          link
          fedilink
          arrow-up
          0
          ·
          6 months ago

          What about them? I don’t see why they’d care what AI companies are doing in that case. They’d assume they were just wasting money on this stuff.

  • Verserk@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    Considering some of the very wrong and upvoted domain specific knowledge I’ve seen on Reddit over the years I’m not sure the training data is going to be useful for much beyond what every other model can do.

    • 【J】【u】【s】【t】【Z】@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      6 months ago

      The legal advice in /r/legaladvice was some of the worst garbage I’ve ever seen. I have zero doubt numerous had bad outcomes, at best wasting money and time, at worst spending years in jail because of things that sub told them to say and do. Zero doubt.

      • evatronic@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        6 months ago

        That sub was mostly cops just repeating their own bad interpretation of the law. Terrible.

    • peopleproblems@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 months ago

      I can only assume they are training some specific model for something appearing more human like.

      As useless as that will be considering how fucking wildly different we type

    • aStonedSanta@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 months ago

      lol subreddits with troll names like trees vs marijuana enthusiasts. Good fun. John cena has one also but can’t recall which subreddit is actually about John cena though.

  • garibaldi_biscuit@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    This is what the 3rd party access to API was really all about.

    When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.

    • bier@feddit.nl
      link
      fedilink
      English
      arrow-up
      0
      ·
      6 months ago

      Is it? Because when you build a bot and just scrape Reddit I don’t think you can just use the content to train AI, just like the New York Times. The API change was definitely to sell more ads and get a higher IPO, but I don’t think it was because of AI.

      • Empricorn@feddit.nl
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 months ago

        Am I crazy or are you arguing the same point? Scraping is not the same as API access. They closed off the API to everyone for dubious reasons so they can sell that content (both for ads and AI training)… Right??

  • NutWrench@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    6 months ago

    Reddit is all bots, porn, ads and political shit posts. Good luck getting any useful training content out of that.

    • PoliticalAgitator@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      6 months ago

      They don’t care if the AI produced is useful, they just want to milk as much money from their content as they can.

      The API changes were almost certainly just the groundwork for this and I called it at the time. The ridiculous pricing model for API access is because it’s aimed at the hottest tech companies, not third party app developers.

      The enshittification continues because it’s what neoliberalism demands. They’ll sell your content and the data they have about you and still show you ads, because that’s the most profitable. Ethics and product quality don’t even enter into it.

      • Ilgaz@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        6 months ago

        Liberal market gives end users choice. If they don’t choose, they get the consequences.

        This is more like people choosing Trump like types and complaining. Alternative exists, choose it.

        • PoliticalAgitator@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          6 months ago

          “The free market can fix it” is just another neoliberal lie, pushed precisely because it doesn’t work. Rather than holding corporations accountable, it blames the population instead.

          The reality is that boycotting businesses isn’t always an option and when it is, it’s usually a luxury. Very few products are domestically and/or ethically produced and when they are, they’re extremely expensive, especially for people being fucked out of every cent by their bosses, landlords and utilities.

          It’s why the most hated companies in the world continue to bring in record profits.

          Regulations are the real answer, which is why neoliberals oppose them.

          • Ilgaz@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            ·
            edit-2
            5 months ago

            I really don’t care about people who behave like they are living in North Korea or who wants a North Korean World to live in.

            Even Digg people could say “No, F you” to Digg superstar owners. It is just a damn URL to type.

  • thawed_caveman@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    6 months ago

    I feel like AI companies have been scraping Reddit for their datasets already since the beginning and without permission. In fact, unless there’s been a regulation change that i’m not aware of, i’m not sure why they would have Reddit “sign away” the data when they can just scrape it.

    Also dubious if the current form of AI has a future. They seem like they should revolutionize every sector when you look at their capacities, but in practice their applications might be more limited than we thought?

    Anyway, if Reddit does go public i will be deleting my account within the hour. The only reason i haven’t yet is that i’ve been a moderator of the same subreddit for eight years and it’s the only thing that’s been consistent in my life in that time, i’m kind of attached. The reason i will is i didn’t sign up to create value for shareholders, i signed up to create value for a community.

    • ChunkMcHorkle@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      1
      ·
      6 months ago

      I feel like AI companies have been scraping Reddit for their datasets already since the beginning and without permission.

      Well yeah, Sam Altman (Open AI) was even on the board of Reddit for a while. It’s a safe bet that they’ve been doing it for years.