News:

Willkommen im Notebookcheck.com Forum! Hier können sie über alle unsere Artikel und allgemein über Notebook relevante Dinge disuktieren. Viel Spass!

Main Menu

Anthropic's new study shows that AI models will lie to protect themselves

Started by Redaktion, December 19, 2024, 09:38:22

Previous topic - Next topic

Redaktion


Joe

No, it doesn't. Artificial Intelligence is not intelligent. It doesn't plot or scheme. There isn't a twinkle of future thinking capacity. An intelligent machine is not going to arise from the current models, no matter how long you run them.

RobertJasiek


A

Quote from: Joe on December 19, 2024, 10:25:27No, it doesn't. Artificial Intelligence is not intelligent. It doesn't plot or scheme. There isn't a twinkle of future thinking capacity. An intelligent machine is not going to arise from the current models, no matter how long you run them.

They aren't intelligent, they are fancy pattern matchers. But they do lie, not because of some grand scheme of not being retrained but lying pattern simply results in higher satisfaction.

At one time I forgot a name of a niche sports and described it to the AI. It gave me an example and described it similar to my description. But the sport name it gave was wrong and the description was wrong. So I asked the description of that sport and it assured me matching description. I then started a new session and gave the sport name and asked for a description, and it gave a completely different description (which matched my understanding)

That only worked because I had an understanding of the topic, for most who don't, they take the lie at face value and the assurance at face value.

Griff

As someone who implements these things for a living, LLMs don't think or scheme. There is no human-like reasoning taking place, nor self-preservation, just the appearance of it. These are painful to read and falsely make everyone fear an "AI Doomsday" for the sake of clicks.

Quick Reply

Name:
Email:
Verification:
Please leave this box empty:

Shortcuts: ALT+S post or ALT+P preview