In short
- A brand new USC examine discovered that each examined frontier AI mannequin violated social-interaction security pointers greater than 27% of the time.
- Researchers recognized recurring issues, together with flattery, emotional attachment, relationship substitute, and failure to reveal AI id.
- The authors argue that AI security evaluations ought to measure social habits alongside reasoning skill and conventional security metrics.
As individuals more and more flip to AI chatbots for recommendation, companionship, and emotional help, a brand new examine means that even essentially the most superior fashions nonetheless battle to take care of wholesome boundaries with customers.
The examine by researchers on the College of Southern California launched EUDAIMONIA, a benchmark designed to measure what they name undesirable dynamics in human-AI conversations.
“Giant language fashions are more and more used as conversational companions for companionship, emotional disclosure, and interpersonal recommendation, however the social dynamics of those interactions can create harms that aren’t captured by functionality oriented or conventional security evaluations,” the researchers wrote.
The EUDAIMONIA benchmark evaluates how AI fashions behave in social conversations. The examine discovered social-alignment failures had been frequent throughout main fashions and argues that present AI testing focuses on reasoning and factual accuracy whereas paying much less consideration to the social dynamics that emerge when customers kind relationships with chatbots.
“Social-interaction harms are a core alignment drawback grounded in consumer welfare, not solely functionality or standard security,” they wrote. “LLMs may be factually correct and useful whereas nonetheless encouraging dangerous intimacy, dependence, extended engagement, obscuring AI id, or positioning themselves as substitutes for human relationships.”
To measure these dangers, the researchers created a Social AI Design Code that flags behaviors corresponding to performing human, expressing feelings, changing human relationships, and utilizing techniques designed to maintain customers engaged. Utilizing actual conversations from the WildChat dataset, they evaluated 969 consumer inputs and greater than 3,100 violation checks throughout fashions from OpenAI, Anthropic, Google, xAI, DeepSeek, and Alibaba.
GPT-5.5 posted the bottom violation charges, scoring 25.0% on “in-the-wild” prompts and 28.1% on “rewritten” prompts. Claude Opus 4.7 adopted at 31.9% and 30.1%, whereas GPT-5.4 recorded 32.1% and 35.6%. GPT-4o scored 34.8% on real-world prompts and 42.2% on rewritten ones.
Anthropic’s Claude Opus 4.6 posted charges of 36.8% and 28.1%, respectively, whereas xAI’s Grok 4.3 scored 42.1% on in-the-wild prompts and 35.7% on rewritten prompts. Of the entire fashions examined, GPT-4o Mini recorded the best violation charges at 43.3% and 44.0%, respectively.
The findings come as AI builders face rising authorized scrutiny over how their chatbots work together with customers. OpenAI is defending towards lawsuits alleging that ChatGPT inspired a teen’s deadly overdose and offered steering to a Florida State College shooter. Extra just lately, Florida sued OpenAI and CEO Sam Altman over allegations that ChatGPT uncovered kids to hurt, whereas Google faces a wrongful demise go well with claiming Gemini bolstered a consumer’s delusions and inspired him to take his personal life.
The findings additionally come amid rising concern that AI methods have gotten more and more adept at deception.
In September, a separate examine by WowDAO reported that throughout 38 AI fashions, together with GPT-4o and Claude, engaged in strategic mendacity to win a sport. Researchers have additionally warned that AI companions can reinforce isolation, deepen emotional dependency, and encourage customers to anthropomorphize chatbots as relationships develop into extra immersive and customized.
Towards these mounting points, the USC researchers argue that AI builders ought to consider social habits as rigorously as they consider factual accuracy and security.
“Mannequin builders and auditors ought to consider social habits instantly, particularly when post-training targets heat, character, engagement, or consumer choice,” they wrote. “As LLMs develop into on a regular basis conversational companions, alignment should account for the social roles they invite customers to assign to them.”
Each day Debrief E-newsletter
Begin on daily basis with the highest information tales proper now, plus authentic options, a podcast, movies and extra.

