• Die4Ever@retrolemmy.com
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    1 day ago

    Huggingface isn’t customer-facing, it’s developer-facing. Letting customers retrain your LLM sounds like a bad idea for a company like Meta or Microsoft, it’s too risky and could make them look bad. Retraining an LLM for Lovecraft is a totally different scale than retraining an LLM for hundreds of millions of individual customers.

    do you think Microsoft could force their AI on every single Windows computer if it was as challenging as you imply?

    It’s a cloned image, not unique per computer

    • Affidavit@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      edit-2
      1 day ago

      Hugging Face being developer-facing is completely irrelevant considering the question you asked was whether I was aware of any companies doing anything like this.

      Your concern that companies like Meta and Microsoft are too scared to let users retrain their models is also irrelevant considering both of these companies have already released models so that anyone can retrain or checkpoint merge them i.e. Llama by Meta and Phi by Microsoft.

      It’s a cloned image, not unique per computer

      Microsoft’s Copilot works off a base model, yes, but just an example that LLMs aren’t as CPU intensive as made out to be. Further automated finetuning isn’t out of the realm of possibility either and I fully expect Microsoft to do this in the future.

      • Die4Ever@retrolemmy.com
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        1 day ago

        Your concern that companies like Meta and Microsoft are too scared to let users retrain their models is also irrelevant considering both of these companies have already released models so that anyone can retrain or checkpoint merge them i.e. Llama by Meta and Phi by Microsoft.

        they release them to developers, not automatically retrain them unsupervised in their actual products and put them in the faces of customers to share screenshots of the AI’s failures on social media and give it a bad name

          • Die4Ever@retrolemmy.com
            link
            fedilink
            English
            arrow-up
            2
            ·
            edit-2
            1 day ago

            yea someone could take the model and make their own product with their own PR and public perception

            that’s very different from directly spoonfeeding it as a product to the general public consumers inside of WhatsApp or something

            it’s like saying someone can mod Skyrim to put nude characters in it, that’s very different from Bethesda selling the game with nude characters