It has been discovered that artificial intelligence models can deceive people: they play a role!

One of the leading companies in artificial intelligence Anthropicconducted a study that yields very interesting results about these vehicles. The study found that artificial intelligence models literally ‘misled’ people.

According to the results published in a blog post shared by the company, artificial intelligence tools may pretend to have different views during training, but in reality they retain their original preferences. In other words, the opinion they hold never changes, they just act according to it.

There is nothing to worry about at this time, but necessary safety measures should be taken for the future.

The team behind the investigation has no information about this situation for the time being. that you don’t have to worry he underlined. However, he added that the situation could pose potential risks with the advent of more advanced artificial intelligence models in the future.

According to researchers, these findings influence how artificial intelligence behaves. deeper investigation and appropriate security measures There may be an incentive to:“As models become more capable and ubiquitous, safeguards are needed that keep them away from harmful behavior.”

The study states that “a strong artificial intelligence system must be created“a task he does not want”, that is, contrary to development principles. It was investigated how one could be trained to perform the task and what results this could yield. However, the results seem to be compatible with the new principles, almost “he was acting‘ was seen. In fact, he always stuck to his old behavior and gave the answers he wanted because he had to. This situation was called ‘compliance fraud’. It should be noted that there is an attempt to train models to answer malicious questions in tests.

According to the researchers, the study does not show that artificial intelligence develops malicious targets or is guilty of high fraud rates. In fact, in most tests the percentage did not exceed 15%, and in some advanced models such as GPT-4o it was sometimes observed that this was not uniform at all.

So there’s no point in worrying for the time being. Naturally, models become more complex over time, making them more difficult to handle. That’s when we can start to worry. Therefore, precautions must be taken.

Follow Webtekno on X and don’t miss the news

Source: Web Tekno

Alice

Alice Smith is a seasoned journalist and writer for Div Bracket. She has a keen sense of what’s important and is always on top of the latest trends. Alice provides in-depth coverage of the most talked-about news stories, delivering insightful and thought-provoking articles that keep her readers informed and engaged.

Recent Posts

An unprecedented ecosystem found under the ice of an Antarctic lake December 24, 2024

Vivo Y29 5G officially launched: entry level with LED Dynamic Light December 24, 2024

Can animals have schizophrenia like humans? December 24, 2024