News

Harmful, abusive interactions plague AI chatbots. Researchers have found that AI companions like Character.AI, Nomi, and ...
Testing has shown that the chatbot shows a “pattern of apparent distress” when it is being asked to generate harmful content ...
Amazon.com-backed Anthropic said on Tuesday it will offer its Claude AI model to the U.S. government for $1, joining a ...
Anthropic has given its chatbot, Claude, the ability to end conversations it deems harmful. You likely won't encounter the ...
Notably, Anthropic is also offering two different takes on the feature through Claude Code. First, there's an "Explanatory" ...
Claude AI can now withdraw from conversations to defend itself, signalling a move where safeguarding the model becomes ...
The company has given its AI chatbot the ability to end toxic conversations as part of its broader 'model welfare' initiative ...
Mental health experts say cases of people forming delusional beliefs after hours with AI chatbots are concerning and offer ...
Claude Opus 4 and 4.1 AI models can now end harmful conversations with users unilaterally, as per an Anthropic announcement.
According to the company, this only happens in particularly serious or concerning situations. For example, Claude may choose ...
Anthropic says the conversations make Claude show ‘apparent distress.’ ...
However, Anthropic also backtracks on its blanket ban on generating all types of lobbying or campaign content to allow for ...