Question 1

_____ is used for general relationship analyses

Accepted Answer

OLS

Question 2

_____ and _____ are used for regressions with binary outcomes

Accepted Answer

Probit, logit

Question 3

_____ is used for regressions with censored data

Accepted Answer

Tobit

Question 4

_____ is used for regressions with count data

Accepted Answer

Poisson

Question 5

In probit and logit regressions, we are trying to find the _____ that the dependent variable equals _____

Accepted Answer

Probability, 1

Question 6

_____ _____ and _____ _____ are used for regressions with categorical, ordered independent variables

Accepted Answer

Ordered probit, ordered logit

Question 7

We need _____ dummy variables in an ordered regression, since we always exclude the _____ category

Accepted Answer

K-1, baseline

Question 8

Probit, logit, and poisson use _____ _____ to quantify the magnitude of the changes

Accepted Answer

Marginal effects

Question 9

A variable that impacts the dependent variable through another independent variable

Accepted Answer

Instrumental variable

Question 10

_____: an independent variable that is correlated with the error term
_____: an independent variable that is not correlated with the error term

Accepted Answer

Endogeneous
Exogeneous

Question 11

If data is quite dispersed, it is better to use a _____ _____ model instead of a poisson model

Accepted Answer

Negative binomial

Question 12

_____ _____ are used to observe characteristics specific to each entity or time period and examine how that specifically impacts the dependent variable

Accepted Answer

Fixed effects

Question 13

When regressing with a binary or ordered dependent variable, we don't need a _____ _____ of _____ _____. We instead need to estimate the _____ of each outcome

Accepted Answer

Predicted line, continuous values, probability

Question 14

Marginal effects are computed after probit, logit, and poisson regression to act as a measure of _____, and they are measured in _____

Accepted Answer

Magnitude, percentages

Question 15

In probit, logit, and poisson regressions, the coefficients tell us _____ about the magnitude at which the independent variable affects the dependent variable. The _____ and _____ are what we care about

Accepted Answer

Nothing, sign, significance

Question 16

Greek letter denoting year fixed effects

Accepted Answer

γ

Question 17

Subscript for entity fixed effects in regression models

Accepted Answer

i

Question 18

Subscript for year fixed effects in regression models

Accepted Answer

t

Question 19

Greek letter denoting the error term in regression models

Accepted Answer

ε

Question 20

Greek letter denoting the second stage coefficients in regression models

Accepted Answer

α

Question 21

Greek letter denoting the first stage coefficients in regression models

Accepted Answer

β

Question 22

In difference in differences regression:
The _____ group is the group that receives the program or policy
The _____ group is the group that does not receive the program or policy

Accepted Answer

Treatment
Control

Question 23

In difference in differences regression, any difference in outcome between the two groups can be interpreted as a _____ _____

Accepted Answer

Causal effect

Question 24

In difference in differences regression, we need _____ _____, meaning that in the absence of treatment, we would expect no _____ in the trends of the _____

Accepted Answer

Parallel trends, difference, groups

Question 25

Main question of an interaction term:
Is the gap between the groups _____ _____ for both categories?

Accepted Answer

The same

Question 26

Stata will automatically drop a variable if it is _____ _____ with another variable

Accepted Answer

Perfectly collinear

Question 27

_____-_____: a one unit change in X leads to a 100*Beta percent change in Y
_____-_____: a one percent change in X leads to a one percent change in Y
_____-_____: a one percent change in X leads to a Beta/100 unit change in Y

Accepted Answer

Log-linear
Log-log
Linear-log

Question 28

These are the two criteria for creating a valid instrumental variable

Accepted Answer

1. Instrumental relevance (the instrumental variable needs to be correlated with the endogeneous variable)
2. instrumental exogeneity (the instrumental variable needs to be  uncorrelated with the dependent variable)

Question 29

We add this to the endogeneous variable to indicate that it has been taken from the first stage regression equation and is now placed into the second stage

Accepted Answer

Hat symbol ^

Question 30

Stata code to regress using two stage least squares:

_____ _____ _____ (_____ = _____) _____, _____

Accepted Answer

ivregress 2sls dependent_var (endogeneous = instrumental) independent_vars, robust

Question 31

When we have a categorical independent variable, we interpret coefficients relative to the _____ _____, and we begin interpretations with "_____ _____"

Accepted Answer

Base group, on average

Question 32

Testparm F-tests are used to test for _____ _____ (a _____ _____ impact on dependent for all groups at once)

Accepted Answer

Joint significance, jointly significant

Question 33

F-tests with var1 = var2 are used to test for the _____ _____ (both variables have _____ _____ impacts on the dependent variable)

Accepted Answer

Same impact, statistically similar

Question 34

In a first stage regression equation, the _____ variable is used as the dependent variable, and the _____ variable is used as another independent variable, with other independent variables also present

Accepted Answer

Endogeneous, instrumental

Question 35

This condition is used to measure if an instrumental variable is valid or not

Accepted Answer

Valid if F > 10

Question 36

List 4 situations where we might log-transform a variable

Accepted Answer

1. Dollar denominated
2. Variables span a large range (population, income)
3. Data is right-skewed with outliers
4. Interpreting based on percentage changes makes more sense

Question 37

A term included in polynomial regressions that indicates a convex/concave U shape in the relationship

Accepted Answer

Squared term

Question 38

The purpose of the ramsey reset test

Accepted Answer

Check whether the model suffers from ommitted polynomial terms

Question 39

The number of _____ variables can't be larger than the number of _____ in instrumental regression

Accepted Answer

Endogeneous, instrumental

Question 40

A variable is endogeneous if there is _____ _____ between that variable and the dependent variable

Accepted Answer

Reverse causality

Question 41

_____ _____ is the stata command to test for endogeneity
_____ _____, _____ is the stata command to test for instrument validity

Accepted Answer

estat endogeneous
estat firststage, all

Question 42

The classic difference in differences regression includes an _____ _____ between the treatment group and the time after policy implementation

Accepted Answer

Interaction term

Question 43

Difference in differences measures _____ _____ in the treatment group relative to the control group. By subtracting D-B from C-A, we remove the _____ _____ and isolate the _____ _____

Accepted Answer

Additional changes, common trend, treatment effect

Question 44

Regression discontinuity designs exploit a _____ that assigns treatment based on whether a _____ variable crosses a _____. They key idea is that individuals near the cutoff are nearly _____ except for treatment assignment

Accepted Answer

Policy, forcing, threshold, identical

Question 45

The variable that determines treatment assignment in regression discontinuity

Accepted Answer

Forcing variable

Question 46

The threshold that determines treatment in regression discontinuity

Accepted Answer

Cutoff

Question 47

Observations close to the cutoff in regression discontinuity

Accepted Answer

Bandwidth

Question 48

Regression discontinuity relies on the assumption that individuals cannot precisely _____ the running variable at the cutoff

Accepted Answer

Manipulate

Question 49

In regression discontinuity, we aren't actually interested in the running variable. We are interested in how the _____ changes the observations right around the _____ inside the _____

Accepted Answer

Treatment, cutoff, bandwidth

Question 50

_____ regression discontinuity: Assignment to treatment only depends on X
_____ regression discontinuity: Having X doesn't guarantee assignment to treatment, but it does increase the probability

Accepted Answer

Sharp
Fuzzy

Question 51

The main difference between regression discontinuity and difference in differences is that in RD, assignment to treatment is _____

Accepted Answer

Random

Question 52

Simple regression equation in regression discontinuity:

Y = Beta0 + Beta1(_____ - _____) + Beta2(_____ dummy variable) + Beta3((_____ - _____) * _____ dummy variable)

Accepted Answer

Running - cutoff, treatment, running - cutoff, treatment

Question 53

Regression discontinuity credibility depends on:

1. _____ of potential outcomes around the cutoff
2. No _____ or _____ of the running variable
3. _____ should be smooth at the cutoff
4. Correct local _____ form

Accepted Answer

Continuity
Sorting, manipulating
X
Functional

Question 54

Choosing a bandwidth is essentially a tradeoff between _____ and _____, since we need to get enough data points but we don't want to manipulate the outcome

Accepted Answer

Bias, precision

Question 55

To perform a regression discontinuity in stata:

_____ _____ _____, _____

Accepted Answer

rdrobust outcome running_variable, c(cutoff)

Question 56

Multinomial logit and probit are used for _____ _____ dependent variables

Accepted Answer

Unordered categorical

Question 57

The two necessary characteristics of count data

Accepted Answer

1. Discrete values
2. Non-negative integers

Question 58

Effect of X on Y with a quadratic term:

_____ = _____ + _____ * _____(_____)

Accepted Answer

Dy/dx = Beta1 + 2 * Beta2(X)

Question 59

Quadratic term coefficients tell us if the effect on the X variable is _____ (U-shaped) or _____ (inverted U-shaped)

Accepted Answer

Convex, concave

Question 60

We must omit one dummy variable when analyzing categorical outcomes to protect the model against _____ _____

Accepted Answer

Perfect multicollinearity

Question 61

We interpret time fixed and entity fixed effects in relation to the _____

Accepted Answer

Baseline

Question 62

Two-stage least square corrects for endogeneity by _____ the endogeneous variable with a clean version driven only by _____ factors

Accepted Answer

Replaces, exogeneous

"Know" box contains:
Time elapsed:
Retries:

ECO 418 FINAL