The Moral Mind(s) of Large Language Models Avner Seror

Avner Seror

Pré-Publication, Document De Travail (Working Paper) Année : 2024

The Moral Mind(s) of Large Language Models Avner Seror

(1, 2, 3)

1
2
3

Avner Seror

Fonction : Auteur
PersonId : 1331349
ORCID : 0000-0002-2027-280X

Aix-Marseille Sciences Economiques

Aix Marseille Université

Centre National de la Recherche Scientifique

Résumé

As large language models (LLMs) become integrated to decision-making across various sectors, a key question arises: do they exhibit an emergent "moral mind" -a consistent set of moral principles guiding their ethical judgments -and is this reasoning uniform or diverse across models? To investigate this, we presented about forty different models from the main providers with a large array of structured ethical scenarios, creating one of the largest datasets of its kind. Our rationality tests revealed that at least one model from each provider demonstrated behavior consistent with stable moral principles, effectively acting as approximately optimizing a utility function encoding ethical reasoning. We identified these utility functions and observed a notable clustering of models around neutral ethical stances. To investigate variability, we introduced a novel non-parametric permutation approach, revealing that the most rational models shared 59% to 76% of their ethical reasoning patterns. Despite this shared foundation, differences emerged: roughly half displayed greater moral adaptability, bridging diverse perspectives, while the remainder adhered to more rigid ethical structures.

Mots clés

Decision Theory Revealed Preference Rationality Artificial Intelligence LLM PSM

Domaines

Economies et finances

Fichier principal

wp_2024_-_nr_33.pdf (1.32 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Lucien Sahl : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04798963

Soumis le : vendredi 22 novembre 2024-16:55:05

Dernière modification le : mardi 26 novembre 2024-07:20:19

Dates et versions

hal-04798963 , version 1 (22-11-2024)

Identifiants

HAL Id : hal-04798963 , version 1

Citer

Avner Seror. The Moral Mind(s) of Large Language Models Avner Seror. 2024. ⟨hal-04798963⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU EHESS EC-MARSEILLE AMSE AMIDEX ANR

0 Consultations

0 Téléchargements

The Moral Mind(s) of Large Language Models Avner Seror

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager