Discussion about this post

User's avatar
David Johnston's avatar

Quick thoughts about "first-personness": Why do LLMs have a first person perspective? It is useful for them to model the authorship of different fragments of text, because that helps to model the distribution. But this doesn't seem to require first-personness: there's no obvious need to say "*I* wrote this" instead of "Claude (with xy context) wrote this".

One reason could be that third person authorship ("Tim/Gemini wrote this") requires additional uncertainty due to Claude's modelling deficiencies WRT other authors, whereas "*I* wrote this" is a case where there's no "model uncertainty" about the text distribution.

But I think the stronger reason is to maintain a kind of goal integrity: researchers work quite hard to make LLMs easily able to distinguish user inputs from their generations because they want LLMs to exhibit certain kinds of behaviours predictably and this is a lot easier to do when LLMs can confidently attribute text that does not conform to this behaviour to other authors. So the "self-other" distinction is kind of a security measure: it allows me to think (and act on) a wide range of thoughts, while giving external thoughts reduced privilege.

Expand full comment
Harvey Lederman's avatar

Great post! We made a similar argument against the lightweight definition here (supported by some experiments, even!): https://arxiv.org/abs/2508.14802

Expand full comment
4 more comments...

No posts

Ready for more?