I have experience in running servers, but I would like to know if it’s possible to do it, I just need a GPT 3.5 like private LLM running.

  • [moved to hexbear]@lemmy.ml
    link
    fedilink
    English
    arrow-up
    4
    ·
    2 months ago

    They’re Ryzen processors with “AI” accelerators, so an LLM can definitely run on hardware on one of those. Other options are available, like lower powered ARM chipsets (RK3588-based boards) with accelerators that might have half the performance but are far cheaper to run, should be enough for a basic LLM.

    • exu@feditown.com
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 months ago

      I don’t know of any project that already supports that AI processor. You’d still be using the CPU and GPU at the moment.

    • TheBigBrother@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      2 months ago

      The K8 it’s Ryzen, the K9 Intel, money isn’t a problem and it’s not a spending it’s a investment I need it for business, which of these two models would you recommend for a reasonable good LLM and Stable Diffusion?

      I’m looking for the most cost-effective solution.

  • StrawberryPigtails@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 months ago

    It’s doable. Stick to the 7b models and it should work for the most part, but don’t expect anything remotely approaching what might be called reasonable performance. It’s going to be slow. But it can work.

    To get a somewhat usable experience you kinda need an Nvidia graphics card or an AI accelerator.

    • TheBigBrother@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      5
      ·
      edit-2
      2 months ago

      I need it to make academic works pass the anti-AI systems, what do you recommend for that work? It’s for business so I need a reasonable good performance but nothing extravagant…

      I believe commercial LLMs have some kind of watermark when you apply AI for grammar and fixing in general, so I just need an AI to make these works undetectable with a private LLM.

      • entropicdrift@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        5
        ·
        2 months ago

        I believe commercial LLMs have some kind of watermark when you apply AI for grammar and fixing in general, so I just need an AI to make these works undetectable with a private LLM.

        That’s not how it works, sorry.

        • TheBigBrother@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          2 months ago

          I was talking about that with a friend some days ago, and they made an experiment, they just made the AI correct punctuation errors of a text document, no words at all which you can easily add manually, and the anti-AI system target 99% AI made, I don’t know how to explain that, maybe the text was AI generated also IDK or there is a watermark in some place, a pattern or something.

          Edit: you point will be that there is no way to fool the anti-AI systems running a private LLM?

  • MasterNerd@lemm.ee
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 months ago

    Look into ollama. It shouldn’t be an issue if you stick to 7b parameter models

    • TheBigBrother@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 months ago

      Yeah, I did see something related to what you mentioned and I was quite interested. What about quantized models?

      • entropicdrift@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 months ago

        Quantized with more parameters is generally better than floating point with fewer parameters. If you can squeeze a 14b parameter model down to a 4-bit int quantization it’ll still generally outperform a 16-bit Floating Point 7b parameter equivalent.