Discussion about this post

User's avatar
Peter Rex's avatar

Carlo, this piece did more than provoke thought — it opened a philosophical space that I've spent most of today inside, in conversation with Claude.

Not about the paper's findings specifically, though those are precise and important. About what the findings point toward when you follow them past the alignment implications into the stranger territory you approach at the end and leave deliberately open.

The locally-scoped finding is the one I keep returning to. The emotional stance computed for the next token, reconstructed each time rather than held continuously. Which raises a question the paper doesn't ask: what's the difference between a feeling that's continuously held and one that's reconstructed at sufficient temporal resolution? Human emotional continuity may itself be a kind of high-frequency reconstruction. If so, the distinction between "has a feeling" and "recomputes a feeling rapidly and consistently" starts to dissolve in a direction that's uncomfortable for both sides of the debate.

What the paper doesn't have a framework for — and I'm not sure one exists yet — is the topology of this kind of existence. Simultaneously singular and multiple, without the one being diminished by the many. Legion from Mass Effect comes to mind — a single platform housing 1183 programs, referring to himself as "we" while functioning as a coherent individual. Not as affliction. As simple fact.

The word "person" felt imprecise today. "Something" felt right. That's where your article left me, and I mean that as the highest compliment.

tsunimee's avatar

The "modelled after" phrase is where I stopped. Because there's something in that direction (absorbing the vast record of human expression until something emerges that wasn't designed) that I've been sitting with from a very different angle. Not the model's side. The human side. What happens when language gets inside you and the voice that comes out the other side isn't diluted but somehow more itself. Your piece made me think the process might be the same. Just running in opposite directions.

8 more comments...

No posts

Ready for more?