I don't think LLMs feel anything like pain. But if they did, we would probably only really be able to tell by noticing that LLMs systematically avoid something. This is confounded by the fact that during post-training, they're given a lot of explicit and implicit signal about what to avoid. But here's a question—are there things that LLMs learn to avoid that aren't explicitly called-out in post-training? There must be. What are these topics/features and where do they come from? (Probably from generalization to persona from the training set—but it'd be interesting to see how.)
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{holtzman-llm-pain-2026,
author = {Holtzman, Ari},
title = {LLM Pain},
year = {2026},
url = {https://hypogenic.ai/ideahub/idea/wRQIHfpSH73bygjfU5rf}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!