Schelling Goodness, and Shared Morality as a Goal

· Source: AI Alignment Forum · Field: Science & Research — Artificial Intelligence & Machine Learning, Research Methodology & Innovation, Social Sciences & Behavioral Studies · Depth: Expert, extended

Summary

This essay introduces "Schelling goodness," a concept distinct from first-order moral verdicts. It defines Schelling goodness as a claim about what a diverse population of intelligent beings would converge on when attempting to agree on a moral verdict, using only common knowledge of the question and background knowledge from successful civilizations' survival pressures. The framework emphasizes a "Schelling participation effect," where increased participation and confidence in a modal answer recursively reinforce each other, leading to robust convergence on focal points. The essay applies this to "cosmic Schelling morality," arguing that certain pro tanto moral claims, like "stealing is bad" or "Pareto-positive trade is good," are cosmic Schelling answers due to their scale-invariant adaptiveness and the ease with which diverse intelligences can recognize their benefits for coordination and survival across increasing scales of organization.

Key takeaway

For research scientists developing multi-agent AI systems or considering interstellar communication protocols, understanding Schelling goodness offers a framework for establishing shared norms. You should prioritize identifying "boringly robust" pro tanto moral principles that are scale-invariantly adaptive, as these are most likely to be recognized and adopted by diverse intelligences, fostering trust and coordination while reducing misalignment risks. This approach provides a low-overhead method for self-regulation and integration into complex multi-agent systems.

Key insights

Schelling goodness identifies moral norms that diverse intelligences would converge on through recursive coordination, driven by scale-invariant adaptiveness.

Principles

Method

To identify a cosmic Schelling norm, analyze if one pro tanto moral answer (e.g., "X is good") has a simpler, more recognizable argument for supporting scalable coordination and survival than its opposite, serving as a base case for recursive Schelling convergence.

In practice

Topics

Best for: Research Scientist, AI Researcher, AI Scientist, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI Alignment Forum.