Definition

AI Safety

Also known as: AI alignment, AI risk, responsible AI

The field of research and practice focused on ensuring artificial intelligence systems behave as intended and do not cause unintended harm. Encompasses alignment (ensuring AI goals match human values), robustness (reliability under unexpected conditions), interpretability (understanding AI decision-making), and governance (institutional frameworks for responsible development).

Related Coverage

AI Governance

Related Terms

Frontier AI

The most advanced artificial intelligence systems that push the boundaries of cu...

Regulatory Sandbox

A framework set up by regulators that allows innovators to test new products, se...

THE LONG VIEW Glossary