Question 1

Hilton’s Argument:

Accepted Answer

It's somewhat likely that AI will cause an existential catastrophe

Question 2

What kind of AI should we worry about?

Accepted Answer

having goals + making plans to achieve them

having strategic awareness

Human-level (or better) capabilities when it
comes to:

Persuasion/manipulation

Hacking

Scientific research

Business/military strategy

Question 3

What is misalignment?

Accepted Answer

It’s when a goal-directed system aims at goals
that aren’t good goals (neither relative to human interests nor
objectively).

This happens a lot when the system’s reward differs—even
slightly—from what we really care about.

Question 4

how it could go bad:

Accepted Answer

Human disempowerment and/or extinction

Question 5

The Most Pressing World
Problems

Accepted Answer

Risks from artificial
intelligence

Catastrophic pandemics

Nuclear war

Great power conflict

Question 6

“Other Pressing World
Problems”

Accepted Answer

Global health–deaths from,
e.g, HIV and malaria

Climate change

Safeguarding liberal
democracy

Unfair and harmful
immigration restrictions

Question 7

reward misspecification

Accepted Answer

an AI becomes great at achieving the thing it
was rewarded for during training, but in a way that doesn’t get at what
really matters.

Question 8

goal misalinment

Accepted Answer

even though the AI is good at achieving the
thing it’s rewarded for during training, that strategy fails when it’s
moved to a new environment.

Question 9

Instrumental Convergence

Accepted Answer

No matter what final goals we fix for the AI, any planning AI will also develop certain instrumental goals—intermediate goals that promote success in its final goals.

Question 10

instruamental goals

Accepted Answer

Self-preservation (“you can’t fetch the coffee if you’re dead”)

Preventing serious changes to the system’s current goals

Gaining more resources and capabilities to help with achieving
goals

Question 11

Expected Utility

Accepted Answer

if its a good or bad deal

Question 12

Effective Altruism

Accepted Answer

Altruism: It’s very morally
important to help others who
are suffering or in danger.

Effectiveness: We should choose
the actions that most effectively
help others.

Question 13

Longtermism

Accepted Answer

Longtermist EAs are redirecting money from people who are definitely
dying right now (e.g. the Against Malaria Foundation) just to reduce
the possibility of a bad thing happening in the future (e.g. AI
apocalypse).

Question 14

Explain how Peter Singer's drowning-child example has been used to support effective altruism

Accepted Answer

if a child is drowining and the room its in is locked and you hvae to pay to enter but thats all you have for the vending machine that day you should still save the child

Question 15

why are Effective altruists impartial

Accepted Answer

by targeting those who need it most. You shouldn’t
favor helping people just because they’re nearby, or your friends, or
from your nation, racial in-group, etc.

Question 16

how does Srinivasan explain charitable work should be impatal

Accepted Answer

by saying we're triaging like ER doctors

Question 17

why do alterests treat AI alignment as a more pressing matter then global poverty

Accepted Answer

because AI could possibly ripe out the human race if we're not careful

Question 18

how can ai harm the enviroment

Accepted Answer

data centers require a lot of energy
make climate change worst

Question 19

harm princaple

Accepted Answer

Sinnott-Armstrong says: A joyride in a gas-guzzling car (like a single use of
GenAI) does not harm anyone.

Question 20

The general action principle

Accepted Answer

Sinnott-Armstrong’s Response: It’d be worse (in fact, disastrous) if
everyone chose not to study medicine. But that doesn’t mean
everyone is individually obligated to study medicine.

"Know" box contains:
Time elapsed:
Retries:

Phil 3