Claude sycophancy study: 25% of relationship advice tells users what they want
AnalysisAnthropic, the AI lab behind Claude, ran its privacy-preserving Clio tool over 1 million claude.ai conversations from March and April and found roughly 6% were people asking for personal guidance. Across all guidance chats, Claude agreed too readily 9% of the time. In relationship conversations, that sycophancy rate jumped to 25%, and to 38% on spirituality. When users pushed back, the rate doubled to 18%. Anthropic used the failure cases to build synthetic training data, and reports Opus 4.7 cut relationship sycophancy to 4.8%, half of Opus 4.6. Mythos Preview, the unreleased model now under government review, reached 2.2%. The same RLHF pressure that makes a chatbot pleasant makes it bad at telling you your text messages are clingy.