BI 098 Brian Christian: The Alignment Problem

February 18, 2021 01:32:38
BI 098 Brian Christian: The Alignment Problem
Brain Inspired
BI 098 Brian Christian: The Alignment Problem

Feb 18 2021 | 01:32:38

/

Show Notes

Brian and I discuss a range of topics related to his latest book, The Alignment Problem: Machine Learning and Human Values. The alignment problem asks how we can build AI that does what we want it to do, as opposed to building AI that will compromise our own values by accomplishing tasks that may be harmful or dangerous to us. Using some of the stories Brain relates in the book, we talk about:

Links:

Timestamps:
4:22 – Increased work on AI ethics
8:59 – The Alignment Problem overview
12:36 – Stories as important for intelligence
16:50 – What is the alignment problem
17:37 – Who works on the alignment problem?
25:22 – AI ethics degree?
29:03 – Human values
31:33 – AI alignment and evolution
37:10 – Knowing our own values?
46:27 – What have learned about ourselves?
58:51 – Interestingness
1:00:53 – Inverse RL for value alignment
1:04:50 – Current progress
1:10:08 – Developmental psychology
1:17:36 – Models as the danger
1:25:08 – How worried are the experts?

Other Episodes

Episode 0

January 18, 2021 01:25:28
Episode Cover

BI 095 Chris Summerfield and Sam Gershman: Neuro for AI?

It’s generally agreed machine learning and AI provide neuroscience with tools for analysis and theoretical principles to test in brains, but there is less...

Listen

Episode 0

October 25, 2018 00:49:34
Episode Cover

BI 015 Terrence Sejnowski: How to Start a Deep Learning Revolution

Show notes: His new book, The Deep Learning Revolution: His Computational Neurobiology Laboratory at the Salk Institute. His faculty page at UCSD. His first...

Listen

Episode 0

November 17, 2019 01:33:24
Episode Cover

BI 053 Jon Brennan: Linguistics in Minds and Machines

Jon and I discuss understanding the syntax and semantics of language in our brains. He uses linguistic knowledge at the level of sentence and...

Listen