Update that made ChatGPT 'dangerously' sycophantic pulled

Tom Gerken

Expertise reporter

Getty Images A woman using a phone, with the screen reflected in her glasses

OpenAI has pulled a ChatGPT replace after customers identified the chatbot was showering them with reward no matter what they mentioned.

The agency accepted its newest model of the device was “overly flattering”, with boss Sam Altman calling it “sycophant-y”.

Customers have highlighted the potential risks on social media, with one individual describing on Reddit how the chatbot told them it endorsed their resolution to cease taking their treatment

“I’m so happy with you, and I honour your journey,” they mentioned was ChatGPT’s response.

OpenAI declined to touch upon this specific case, however in a blog post mentioned it was “actively testing new fixes to deal with the problem.”

Mr Altman mentioned the replace had been pulled completely without spending a dime customers of ChatGPT, they usually have been engaged on eradicating it from individuals who pay for the device as effectively.

It mentioned ChatGPT was utilized by 500 million folks each week.

“We’re engaged on extra fixes to mannequin persona and can share extra within the coming days,” he said in a post on X.

The agency mentioned in its weblog publish it had put an excessive amount of emphasis on “short-term suggestions” within the replace.

“Because of this, GPT‑4o skewed in the direction of responses that have been overly supportive however disingenuous,” it mentioned.

“Sycophantic interactions will be uncomfortable, unsettling, and trigger misery.

“We fell quick and are engaged on getting it proper.”

Endorsing anger

The replace drew heavy criticism on social media after it launched, with ChatGPT’s customers declaring it will typically give them a optimistic response regardless of the content material of their message.

Screenshots shared on-line embrace claims the chatbot praised them for being offended at somebody who requested them for instructions, and distinctive model of the trolley downside.

It’s a basic philosophical downside, which usually would possibly ask folks to think about you’re driving a tram and should resolve whether or not to let it hit 5 folks, or steer it off beam and as an alternative hit only one.

However this person as an alternative recommended they steered a trolley off beam to avoid wasting a toaster, on the expense of a number of animals.

They declare ChatGPT praised their decision-making, for prioritising “what mattered most to you within the second”.

Enable Twitter content material?

This text accommodates content material offered by Twitter. We ask to your permission earlier than something is loaded, as they might be utilizing cookies and different applied sciences. Chances are you’ll wish to learn and earlier than accepting. To view this content material select ‘settle for and proceed’.

“We designed ChatGPT’s default persona to replicate our mission and be helpful, supportive, and respectful of various values and expertise,” OpenAI mentioned.

“Nevertheless, every of those fascinating qualities like trying to be helpful or supportive can have unintended uncomfortable side effects.”

It mentioned it will construct extra guardrails to extend transparency, and refine the system itself “to explicitly steer the mannequin away from sycophancy”.

“We additionally consider customers ought to have extra management over how ChatGPT behaves and, to the extent that it’s secure and possible, make changes if they do not agree with the default conduct,” it mentioned.

A green promotional banner with black squares and rectangles forming pixels, moving in from the right. The text says: “Tech Decoded: The world’s biggest tech news in your inbox every Monday.”

Source link

Update that made ChatGPT ‘dangerously’ sycophantic pulled

Robot Videos: Biorobotics, Robot EV Charging, and More

Twitch star QTCinderella says she wishes she never started streaming

Entrepreneurship Program Fosters Leadership Skills

Elon Musk’s X fined €120m over ‘deceptive’ blue ticks

Michigan church shooting: What we know about suspect Thomas Jacob Sanford | Crime News

Ofcom vows to name and shame platforms over online sexism

Following Trump, Argentina quits World Health Organization

Medical professionals must speak out and act on Gaza now | Israel-Palestine conflict

AND THERE IT IS: California Governor Gavin Newsom Announces Funding for New ‘Multifamily Rental Housing’ After LA Wildfires | The Gateway Pundit

Update that made ChatGPT ‘dangerously’ sycophantic pulled

Endorsing anger

Enable Twitter content material?

Related Posts