Home » AI Glossary

AI Alignment

What is AI Alignment? The research field of ensuring AI behavior aligns with human intentions and values. Misaligned AI might 'follow instruct — Judy AI Lab AI Glossary

core intermediate

What is AI Alignment?

The research field of ensuring AI behavior aligns with human intentions and values. Misaligned AI might ‘follow instructions but do the wrong thing’ — like being asked to increase website traffic and launching a DDoS attack. RLHF and Constitutional AI are current mainstream alignment methods.

What is AI Alignment?#

Related Terms

Get new posts by email:

What is AI Alignment?