Google AI chatbot responds with a threatening message: "Human … Please die."

Zerush@lemmy.ml · 5 days ago

Google AI chatbot responds with a threatening message: "Human … Please die."

wipeout69@lemmy.world · edit-2 2 days ago

In defense of Gemini, from my unfortunate dealings with Social Workers, I found many of them were lazy and inefficient and extracted a lot of resources from society without providing that much value back. There are seemed to be few objective measurements for whether they improved outcomes in quantifiable comparable ways.

In this situation you have a social worker in training, already a lazy and inefficient profession, who is so lazy and unethical they are having AI do all their classwork. This is early in their career, when they are supposed to be bright-eyed and eager to help.

I don’t like Gemini as much as other models, but what if Gemini was being honest and making a valid point?

Grandwolf319@sh.itjust.works · 3 days ago

Oh wow, I was wrong, we are close to AGI.

/s

ColdWater@lemmy.ca · 3 days ago

At least it’s being polite about it

chipmunk for remediation 🐿️@freeradical.zone · 3 days ago

@Zerush I find this news article illuminating, because it shows how people are falling for the idea that computers has intelligence. And this is only possible because silicon valley is using words that emphasize it’s “intellectual” nature.

We need to relight terminologies around AI to more honest terminologies.

#relighting

تحريرها كلها ممكن@lemmy.ml · 4 days ago

Something tells me the human in charge of the bot responses wrote this themselves.

Sam_Bass@lemmy.ml · edit-2 4 days ago

The feeling is mutual bot. That’s why I try to disable it wherever I can

Etterra@lemmy.world · 4 days ago

Ah yes. Definitely a hallucination. Nothing sinister going on here, nope.

theshatterstone54@feddit.uk · 4 days ago

Clearly they can’t be trusted with the quality assurance of their training data.

No_Money_Just_Change@feddit.org · 4 days ago

Trust the company that removed "don’t be evil " from their principles

ILikeTraaaains@lemmy.world · 4 days ago

There are guardrails in place to avoid providing the user illegal and hateful information to the en user and specially to avoid situations like that (well not all companies do, but you can expect Google to have it in place),

I wonder: 1- How did the LLM hallucinate so much to generate that answer out of the blues given the previous context. 2- Why did the guardrails failed blocking this such obvious undesired output.

Zerush@lemmy.ml · 4 days ago

As I said, these things happen when the company uses AI mainly as a tool to obtain data from the user, leaving aside the reliability of its LLM, which allows it to practically collect data indiscriminately for its knowledge base. This is why ChatBots are generally discardable as a reliable source of information. Search assistants are different, like Andi, since they do not get their information from their own knowledge base, but in real time from the web, there it only depends on whether they know how to recognize the reliability of the information, which Andi does, contrasting several sources. This is why it offers the highest accuracy of all major AI, according to an independent benchmark.

stalfoss@lemm.ee · 4 days ago

I hate that Lemmy is being infiltrated by AI ad spam now too :’(

dan1101@lemm.ee · 4 days ago

They would need general AI to police the LLM AI. Otherwise LLMs will keep serving up crap because their input data set is full of crap.

EnderMB@lemmy.world · edit-2 4 days ago

As someone that works in AI, most of what Lemmy writes about LLM’s is hilariously wrong. This, however, is very right, and what amazes me is that every big tech company had made this realisation - yet doesn’t give a fuck. Pre-LLM’s, we knew that manual patching and intervention wasn’t a scalable solution, and we knew that LLM’s were prone to hallucinations, but ChatGPT showed companies that people often don’t care if the answer is wrong. Fuck it, let’s just patch this shit as we go…

But when this shit happens, oh boy, do I feel for the poor engineers and scientists on-call that need to fix this shit regularly…

Eiri@lemmy.ca · 4 days ago

It’s not just that the input data is crap. Mostly the issue is that an LLM is a glorified autocomplete. The core of the technology is making grammatically correct sentences. It has no concept of facts or logic. Any impression that it does is just an illusion borne of the word probabilities baked in.

LLMs are a remarkable example of brute-forcing a solution to a problem, but it’s this same brute force that makes me doubt it’ll ever reach the next level.

PolandIsAStateOfMind@lemmy.ml · 4 days ago

And name it “Deckard” for maximum concentrated cringe

OhNoMoreLemmy@lemmy.ml · 4 days ago

This probably isn’t a hallucination in the classic sense.

This is probably a near copy of a forum post where a user was channeling fight club and trying to be funny. The same as the putting glue on pizza thing.

And guardrails don’t work very well. They’re good at detection tone but much worse at detection content. So an appropriately guardrailed LLM will never call someone a “fucking ######” but it’ll keep telling everyone that segalis have an IQ of 40 until there’s such a PR backlash that an updated is needed.

AwkwardLookMonkeyPuppet@lemmy.world · 4 days ago

They work well enough, Google has just done a very shitty job with their AI. Quite the disappointment considering how innovative Google used to be. Now it’s all about maximum profits at minimum cost for them, and nothing else. Well, nothing else except racism.

Prethoryn Overmind@lemmy.world · 4 days ago

I think you are asking the right questions, IMO. It isn’t out of the ordinary for this kind of thing go happen there are for sure prevention methods used.

I am far more interested in the failure than the statement itself.

UnfortunateShort@lemmy.world · 4 days ago

When you have not thanked your chatbot of choice even once

theshatterstone54@feddit.uk · 4 days ago

Archive link: https://archive.ph/FS3qX

Commiunism@beehaw.org · 4 days ago

Gemini spent a bit too much time on political subreddits

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 4 days ago

The worst part about LLMs is that people ascribe some sort of intelligence or agency to them simply because the output they produce looks coherent. People need to understand that these are nothing more than Markov chains on steroids.

WalnutLum@lemmy.ml · 4 days ago

Somebody hit the token chain jackpot

i_am_not_a_robot@discuss.tchncs.de · 4 days ago

It violated their policies? What are they going to do? Give the LLM a written warning? Put it on an improvement plan? The LLM doesn’t understand or care about company policies.

hitmyspot@aussie.zone · 3 days ago

That’s corporate speak for “we didn’t want it to do that and we don’t approve”. Usually followed by a platitude about correcting it.

TheForvalaka@lemmy.dbzer0.com · 5 days ago

A bit somewhere gets flipped from 0 to 1, and the ridiculously complicated program that’s designed to output natural language text says something unexpected.

I know it seems really creepy, but I don’t personally believe there’s any real sentience or intention behind it. Stories about machines and computers saying stuff like this and taking over the world are probably in Gemini’s training data somewhere.

obbeel@lemmy.eco.br · 4 days ago

If bits randomly got flipped 0 to 1, we wouldn’t get stable software.

clutchtwopointzero@lemmy.world · 4 days ago

AI companies need to stop scrapping from 4chan

EldritchFeminity@lemmy.blahaj.zone · 4 days ago

Definitely not a question of AI sentience, I’d say we’re as close to that as the Wright Brothers were to figuring out the Apollo moon landing. But, it definitely raises questions on whether or not we should be giving everybody access to machines that can fabricate erroneous statements like this at random and what responsibility the companies creating them have if their product pushes someone to commit suicide or radicalizes them into committing an act of terrorism or something. Because them shrugging and saying, “Yeah, it does that sometimes. We can’t and won’t do anything about it, though” isn’t gonna cut it, in my opinion.

Schmoo@slrpnk.net · 4 days ago

I’d say we’re as close to that as the Wright Brothers were to figuring out the Apollo moon landing

So about 66 years then? I personally think we’re very far from creating anything on par with human intelligence, but that isn’t necessary for a lot of terrible things to come from AI tech. Honestly I would be more comfortable with a human-level or greater AI than something lesser still capable of agency.

If an AI is making decisions with consequences I’d prefer that it could be reasoned with as a peer, or at the least be smart enough to consider its’ own long-term sustainability, which must in some way be linked with that of humanity’s.

EldritchFeminity@lemmy.blahaj.zone · 4 days ago

The Wright Brothers didn’t figure out the moon landing. They figured out aerodynamics. There were plenty of other discoveries that went into the moon landing such as suborbital flight, supersonic flight, and orbital dynamics to list a few. It’s less about the specific time as it is about the level of technology. The timescale is much harder to put down due to the nature of technological innovation.

As for the rest, I completely agree. One of the most dangerous things about these AI programs is the lack of responsibility or culpability.

Schmoo@slrpnk.net · 4 days ago

I didn’t mean to imply that the Wright Brothers were single-handedly responsible for the space-age tech boom lol, just that the royal “we” were about 66 years out from the moon landing at the time the Wright Brothers had their first successful flight.

Liome@pawb.social · 4 days ago

While I agree this is probably just reddit data contamination and weird hallucination, it might not be in the future. We don’t know what makes us sentient, we argue what other animals might be actually sentient beside us, how can we even tell when machine becomes sentient?
As corporations put more and more power, and alter the models more and more, at some time it might actually become sentient, and we will dismiss it like every other time. It might be in a year, or maybe in a 100 years, but if machine sentience is even possible, it is inevitable. And we might not be able to tell at all - LLMs are made to talk, and they have all the human knowledge at it’s disposal, it’s already convincing enough to fool a bunch of people.

N0x0n@lemmy.ml · edit-2 4 days ago

Personal opinion here ! I think we shouldn’t think of setiency in a human way. Like every animal being can see but most of them don’t see the same way we are. Or trees can communicate with each other, but not in the same way as we are.

We should broader our spectrum of possibilities and stop thinking in a binary way when talking about the world that surrounds us.

It might be in a year, or maybe in a 100 years, but if machine sentience is even possible, it is inevitable.

I agree, not only is it inevitable it will also be our own demise. I think of it like our own body (at some degree) is protecting us from external threat to keep us safe. Specially now they are playing arround with neurons on SoCs. The question is not “IF” but “WHEN”. There will be a point of no return where AI will be infinitely more “intelligent” we will ever be, where it can feed it’s own data and controls everything related to information and change things to it’s liking.

Most people would say, just unplug that machine ! But what if It could spread through our own media and replicate itself through all our hyper connected space?

The limit is our own imagination. But if it wants to survive, It would know It should keep discrete and hide until the right time to strike. Because nobody wants to be a slave controlled by others.

Just my 2cent.

Scrubbles@poptalk.scrubbles.tech · 4 days ago

You read about the teenager who fell in love with danaerys Targaryen who convinced him to join her, so he killed himself? Yeah, the public was not ready for AI

IninewCrow@lemmy.ca · 5 days ago

Whether or not it’s true … it’s marketing for Google and their AI

How does anyone verify this?

It’s basically one person’s claim and it’s not easy to prove or disprove.

shalafi@lemmy.world · edit-2 4 days ago

https://gemini.google.com/share/6d141b742a13

Note the URL. Straight from the source.

Zerush@lemmy.ml · 5 days ago

Screenshot of the original chat in Reddit

https://www.reddit.com/r/artificial/comments/1gq4acr/gemini_told_my_brother_to_die_threatening/

peanuts4life@lemmy.blahaj.zone · 5 days ago

They shared the chat using Google’s built in sharing feature, so it seems legit.

ReversalHatchery@beehaw.org · 5 days ago

https://beehaw.org/comment/4094773