In my experience, somehow the strength in natural language / litigious / prosaic work translated negatively in a way to coding. The verbose, prolific way it writes + the investment into dev tooling by Anthropic resulted in Anthropic's models leading the sycophantic-presumptuous-over-confident frontier.
So much so that I still have barely used Sonnet 4.5 thinking.