FaceDeer

FaceDeer@fedia.io · 18 days ago

Even if you trained the AI yourself from scratch you still can’t be confident you know what the AI is going to say under any given circumstance. LLMs have an inherent unpredictability to them. That’s part of their purpose, they’re not databases or search engines.

if I were to download a pre-trained model from what I thought was a reputable source, but was man-in-the middled and provided with a maliciously trained model

This is a risk for anything you download off the Internet, even source code could be MITMed to give you something with malicious stuff embedded in it. And no, I don’t believe you’d read and comprehend every line of it before you compile and run it. You need to verify checksums

As I said above, the real security comes from the code that’s running the LLM model. If someone wanted to “listen in” on what you say to the AI, they’d need to compromise that code to have it send your inputs to them. The model itself can’t do that. If someone wanted to have the model delete data or mess with your machine, it would be the execution framework of the model that’s doing that, not the model itself. And so forth.

You can probably come up with edge cases that are more difficult to secure, such as a troubleshooting AI whose literal purpose is messing with your system’s settings and whatnot, but that’s why I said “99% of the way there” in my original comment. There’s always edge cases.

FaceDeer@fedia.io · 19 days ago

Ironically, as far as I’m aware it’s based off of research done by some AI decelerationists over on the alignment forum who wanted to show how “unsafe” open models were in the hopes that there’d be regulation imposed to prevent companies from distributing them. They demonstrated that the “refusals” trained into LLMs could be removed with this method, allowing it to answer questions they considered scary.

The open LLM community responded by going “coooool!” And adapting the technique as a general tool for “training” models in various other ways.

FaceDeer@fedia.io · 19 days ago

That would be part of what’s required for them to be “open-weight”.

A plain old binary LLM model is somewhat equivalent to compiled object code, so redistributability is the main thing you can “open” about it compared to a “closed” model.

An LLM model is more malleable than compiled object code, though, as I described above there’s various ways you can mutate an LLM model without needing its “source code.” So it’s not exactly equivalent to compiled object code.

FaceDeer@fedia.io · 19 days ago

Fortunately, LLMs don’t really need to be fully open source to get almost all of the benefits of open source. From a safety and security perspective it’s fine because the model weights don’t really do anything; all of the actual work is done by the framework code that’s running them, and if you can trust that due to it being open source you’re 99% of the way there. The LLM model just sits there transforming the input text into the output text.

From a customization standpoint it’s a little worse, but we’re coming up with a lot of neat tricks for retraining and fine-tuning model weights in powerful ways. The most recent bit development I’ve heard of is abliteration, a technique that lets you isolate a particular “feature” of an LLM and either enhance it or remove it. The first big use of it is to modify various “censored” LLMs to remove their ability to refuse to comply with instructions, so that all those “safe” and “responsible” AIs like Goody-2 can turned into something that’s actually useful. A more fun example is MopeyMule, a LLaMA3 model that has had all of his hope and joy abliterated.

So I’m willing to accept open-weight models as being “nearly as good” as a full-blown open source model. I’d like to see full-blown open source models develop more, sure, but I’m not terribly concerned about having to rely on an open-weight model to make an AI system work for the immediate term.

FaceDeer@fedia.io · edit-2 19 days ago

They’re not claiming it’s AGI, though. You’re missing a broad middle ground between dumb calculators and HAL 9000.

FaceDeer@fedia.io · 19 days ago

It’s similar to my own reaction to the people getting angry about Reddit data being used to train AIs. As someone who’s been commenting rather prolifically on Reddit for 13 years I’m actually quite pleased by the thought that my views and interests are being incorporated into the foundations of modern AI. The only downside is that all those people I argued with over that period are also getting in there. :)

FaceDeer@fedia.io · 19 days ago

The term “AI” has been in use since 1956 to describe a wide variety of computer algorithms and capabilities. Neural nets and large language models fall very firmly under the term’s umbrella.

What you’re talking about is a specific kind of AI, artificial general intelligence (AGI). Very few people believe that an LLM on its own can become AGI and even fewer believes that current LLMs are AGI, so unfortunately you’re jousting with a strawman here.

FaceDeer@fedia.io · 19 days ago

And thus future AIs will have a bias toward having American attitudes because that’s where the data they’re built on comes from. A win for Europe?

FaceDeer@fedia.io · 21 days ago

But when you die and an AI company contacts all your grieving friends and family to offer them access to an AI based on you (for a low, low fee!)

You can stop right there, you’re just imagining a scenario that suits your prejudices. Of all the applications for AI that I can imagine that would be better served by a model that is entirely under my control this would be the top of the list.

With that out of the way the rest of your rhetorical questions are moot.

FaceDeer@fedia.io · 21 days ago

Even with that, being absolutist about this sort of thing is wrong. People undergoing surgery have spent time on heart/lung machines that breathe for them. People sometimes fast for good reasons, or get IV fluids or nutrients provided to them. You don’t see protestors outside of hospitals decrying how humans aren’t meant to be kept alive with such things, though, at least not in most cases (as always there are exceptions, the Terri Schiavo case for example).

If I want to create an AI substitute for myself it is not anyone’s right to tell me I can’t because they don’t think I was meant to do that.

FaceDeer@fedia.io · 22 days ago

I don’t believe humans are “meant” to do anything. We are a result of evolution, not intentional design. So I believe humans should do whatever they personally want to do in a situation like this.

If you have a loved one who does this and you don’t feel comfortable interacting with their AI version, then don’t interact with their AI version. That’s on you. But don’t belittle them for having preferences different from your own. Different people want different things and deal with death in different ways.

FaceDeer@fedia.io · edit-2 22 days ago

If you don’t want to do it then don’t do it. Can we stop trying to tell everyone else they have to have the same values as you?

FaceDeer@fedia.io · 27 days ago

If their goal is to prevent AI trainers from scraping their art then an open federated platform is the opposite of what they want.

FaceDeer@fedia.io · edit-2 27 days ago

It also has an expensive back end and no plans for any kind of monetization, so it’s dead in the water from that side too. The moment they’re successful they’re broke.

FaceDeer@fedia.io · 29 days ago

If they feel less need to add proper alt-text because peoples’ browsers are doing a better job anyway, I don’t see why that’s a problem. The end result is better alt text.

FaceDeer@fedia.io · edit-2 29 days ago

I would expect it’d be not too hard to expand the context fed into the AI from just the pixels to including adjacent text as well. Multimodal AIs can accept both kinds of input. Might as well start with the basics though.

FaceDeer@fedia.io · edit-2 29 days ago

It is true AI, it’s just not AGI. Artificial General Intelligence is the sort of thing you see on Star Trek. AI is a much broader term and it encompasses large language models, as well as even simpler things like pathfinding algorithms or OCR. The term “AI” has been in use for this kind of thing since 1956, it’s not some sudden new marketing buzzword that’s being misapplied. Indeed, it’s the people who are insisting that LLMs are not AI that are attempting to redefine a word that’s already been in use for a very long time.

You can see this when chat bots keep giving the same 2 pieces incorrect information. They have no concept of they are wrong.

Reminds me of the classic quote from Charles Babbage:

“On two occasions I have been asked, – “Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?” … I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question”

How is the chatbot supposed to know that the information it’s been given is wrong?

If you were talking with a human and they thought something was true that wasn’t actually true, do you not count them as an intelligence any more?

FaceDeer@fedia.io · 29 days ago

You’re falling into a no true Scotsman fallacy. There are plenty of uses for recent AI developments, I use them quite frequently myself. Why are those uses not “true” uses?

FaceDeer@fedia.io · 29 days ago

You used an LLM for one of the things it is specifically not good at. Dismissing its overall value on that basis is like complaining that your snowmobile is bad at making its way up and down your basement stairs, and so it is therefore useless.

FaceDeer@fedia.io · 30 days ago

The case isn’t closed until sentencing is done.