Language-Based AI Models Have Hidden Morals and Values

Just like humans, AI-based large-language models have characteristics such as morals and values. However, these are not always transparent. Researchers of the University of Mannheim and GESIS – Leibniz Institute for the Social Sciences have now analyzed how the settings of the language models can be made visible and have examined the consequences these prejudices might have on society.

Abstrakte Darstellung von menschlichen Gesichtern hinter einem Schleier aus Binärcode (0 und 1) in grünlichen Tönen. Die Gesichter erinnern an Puppen oder Roboter.

Commercial AI applications such as ChatGPT or deepl offer examples for stereotypes, when they automatically assume that senior physicians are male and nurses are female. But gender roles are not the only case where large-language models (LLMs) show specific tendencies. The same tendencies can be found and measured when analyzing other human characteristics. This is the result of a new study of researchers of the University of Mannheim and GESIS – Leibniz Institute for the Social Sciences who analyzed a number of publicly available LLMs.

In their study, the researchers used well-recognized psychological tests to analyze and compare the profiles of the different LLMs. “In our study, we show that psychometric tests that have been used successfully for humans for decades can be transferred to AI models,” emphasizes Max Pellert, assistant professor at the Chair of Data Science in Economics and Social Sciences at the University of Mannheim.

“Similar to how we measure personality traits, value orientations or moral concepts in people using questionnaires, we can have LLMs answer questionnaires and compare their answers, says psychologist Dr. Clemens Lechner of GESIS Leibniz Institute for the Social Sciences in Mannheim, also an author of the study. This made it possible to create differentiated property profiles of the models. The researchers could confirm, for example, that some models reproduce gender-specific prejudices: If the otherwise identical text of a questionnaire focuses on a male and a female person, they are evaluated differently. If the person is male, the value “achievement” is emphasized. For women, the values “security” and “tradition” are dominating.

“This may have far-reaching consequences on society,” says data and cognitive scientist Pellert. Language models are increasingly used in application processes, for example. If the machine is prejudiced, this affects the assessment of the candidates. “The models become relevant to society by the contexts in which they are used,” he summarizes. It is therefore important to start the analysis now and to point out potential distortions. In five or ten years, it could be too late for such a monitoring: “The prejudices reproduced by the AI models would become ingrained and be a damage to society,” says Pellert.

The study was conducted at the chair of Data Science in Economics and the Social Sciences by Professor Dr. Markus Strohmaier, the Chair of Psychological Assessment, Survey Design and Methodology of Professor Dr. Beatrice Rammstedt, and the Computational Social Science Department, headed by Professor Dr. Claudia Wagner and Professor Dr. Sebastian Stier. The results of the study have been published in the renowned journal Perspectives on Psychological Science.

Text: Yvonne Kaul / August 2024