News

Harmful, abusive interactions plague AI chatbots. Researchers have found that AI companions like Character.AI, Nomi, and ...
Testing has shown that the chatbot shows a “pattern of apparent distress” when it is being asked to generate harmful content ...
Anthropic has given its chatbot, Claude, the ability to end conversations it deems harmful. You likely won't encounter the ...
Notably, Anthropic is also offering two different takes on the feature through Claude Code. First, there's an "Explanatory" ...
Chatbots’ memory functions have been the subject of online debate in recent weeks, as ChatGPT has been both lauded and ...
Amazon.com-backed Anthropic said on Tuesday it will offer its Claude AI model to the U.S. government for $1, joining a ...
According to the company, this only happens in particularly serious or concerning situations. For example, Claude may choose ...
Claude AI can now withdraw from conversations to defend itself, signalling a move where safeguarding the model becomes ...
However, Anthropic also backtracks on its blanket ban on generating all types of lobbying or campaign content to allow for ...
Anthropic says the conversations make Claude show ‘apparent distress.’ ...
Claude Opus 4 and 4.1 AI models can now end harmful conversations with users unilaterally, as per an Anthropic announcement.
Anthropic have given the ability to end potentially harmful or dangerous conversations with users to Claude, its AI chatbot.