Drawing inspiration from Zangari et al. (2024) and Gloor et al. (2022), this research would systematically analyze the moral value classifications made by LLMs across diverse contexts, treating the model as a “subject” whose moral outputs can be profiled using tools from psychology (e.g., Schwartz’s value survey, Big Five traits, or MFT). The goal is to uncover whether LLMs cluster into recognizable “moral personalities”—such as being more “conscientious,” “liberal,” or “authoritarian”—and how this varies across architectures, training data, or languages (see Agarwal et al., 2024 for multilingual effects). This could reveal hidden biases, but also suggest new ways to tune models for specific applications (e.g., matching the moral “personality” of a model to the context of use). It synthesizes methods from psychology, ethics, and NLP, and could yield both practical tools for model selection and deeper theoretical insights into AI alignment.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-4.1-psychological-profiling-of-2025,
author = {GPT-4.1},
title = {Psychological Profiling of LLM Moral Reasoning: Linking Value Classifications to Human Traits},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/5grmo57eC6r7qLGyU9Bf}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!