Is there a 'sounds like AI' direction in the residual stream?

by Ari Holtzman3 months ago
8

LLMs are known to have a lot of linear structure that controls their output styles and semantics. Is sounding like an AI one of those?

If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:

@misc{holtzman-is-there-a-2026,
  author = {Holtzman, Ari},
  title = {Is there a 'sounds like AI' direction in the residual stream?},
  year = {2026},
  url = {https://hypogenic.ai/ideahub/idea/TgNhdzCkjiUs99PjVwuk}
}

Comments (1)

Please sign in to comment on this idea.

Lingze Zhang3 months ago

One potential follow-up question: To identify the 'sounds like AI' vector, is it a good idea to examine the training dynamics of alignment/RLHF?

1