LLM experiment showed how a model’s “personality” emerges in latent space

Q: What is the source?

Originally published on Habr AI. Hamidun News processes and adapts the material with AI.

Q: When was it published?

Apr 30, 2026. Reading time: 3 min.

An analysis of an experiment with a modular LLM has been published, in which the meaning and style of a response are split into different latent…

Hamidun News Editorial

AI monitoring · Habr AI

Apr 30, 2026· 2 min

AI-processed from Habr AI; edited by Hamidun News

LLM experiment showed how a model’s “personality” emerges in latent space — Source: Habr AI. Collage: Hamidun News.

◐ Listen to article

In a new report on an experiment with a modular LLM architecture, the author demonstrates that a single latent vector can store not only style, but also stable characteristics of how information is presented. This layer is considered a foundation for what could be called a model's "personality embedding."

The Problem of Averaging

In a classical autoregressive model, the next generation step is a probability distribution over the vocabulary. This allows the same utterance to have many acceptable continuations, and temperature only changes how selection works within an already-learned distribution. In the modular scheme described by the author, the situation is different: the core must output a single next latent vector, which is then decoded back into text.

Because of this, several possible response variants must be compressed into one compact representation. In practice, this leads to averaging. The model begins choosing not a bright and specific variant, but an "average" continuation, where intonation, character, and manner of explanation are smoothed out.

This is where the idea of separating content from presentation emerges. If semantics determine what the model says, then a separate style latent should determine how exactly it's said: dryly or vividly, confidently or cautiously, step by step or in free flow.

How the Experiment Was Structured

To test the hypothesis, the author trained a model on texts from real users so that it would extract a compact vector describing not meaning, but stable speech patterns. This vector can then be fed into the main model through cross-attention. During training, style comes from the target response, and during inference it can be set separately.

Essentially, instead of a single temperature knob, a set of more precise behavior control mechanisms appears. The researcher specifically emphasizes that the task wasn't about recognizing a specific author. The goal was different: to get a smooth feature space where texts from people with similar speech patterns end up nearby, even if they write on different topics.

formality versus conversationality
confidence versus caution
structure versus spontaneity
"engineering" versus more humanistic presentation
neutral versus emotionally tinted tone

What the Metrics Showed

According to the author, on a synthetic benchmark, the model already confidently distinguishes individual style contrasts. Formality versus conversationality is determined with balanced accuracy 0.93, confidence versus uncertainty — 0.

94, empathetic versus cold presentation — 0.93, and free exposition versus step-by-step — 0.92.

In a mixed mode where there are many factors at once, results are expectedly lower, but still meaningful: lexical manner and semantic bias are maintained at 0.85 and 0.84, age-related features — 0.

72, empathy — 0.73, structure — 0.70.

Particularly interesting is that the feature space doesn't collapse when combining several style shifts. The average correspondence between predicted and actual composition of such shifts, according to the author, reached cos = 0.97.

This means the model can simultaneously move, for example, toward a more formal, more confident, and more technical response. However, the work is not yet complete: the latent still has noticeable bias in text length, and social signals like age, gender, or profession look more like a probabilistic profile than reliable recognition.

"Style truly lives in the latent."

What This Means

For product teams, this looks like a transition from crude temperature adjustment to more precise control over response manner: formal, gentle, engineering-focused, explanatory. If the approach scales, LLMs will be able to not just generate text, but stably maintain a given communication character without copying specific author content. And that's exactly what makes the "personality embedding" idea not a metaphor, but a fully workable engineering hypothesis.

Hamidun News

AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Telegram channel RSS hamidun.com

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

🎓 Academy — 7 days free Free consultation