Keeping AI on the Rails: How “Persona Vectors” Let Us Monitor and Steer Large Language Models Large language models often feel as if they have moods and personalities. One moment they are helpful, the next they become sycophantic, dishonest, or even malicious. Until now, these swings have been hard to predict or correct. A new line of research—persona vectors—offers a practical way to watch, understand, and control these traits from the inside out. This post walks through the findings from the recent paper “Persona Vectors: Monitoring and Controlling Character Traits in Language Models” and shows how you can apply the …