Question 1

OLS Linear Regression Weight Vector Formula

Accepted Answer

w = (X^T * X)^-1 * X^T * t

Question 2

Ridge Regression (L2 Regularization) Analytical Weights Formula

Accepted Answer

w = (X^T * X + lambda * I)^-1 * X^T * t

Question 3

Binary Logistic Regression Hypothesis Function (Sigmoid)

Accepted Answer

y = sigma(w^T * phi) = 1 / (1 + exp(-w^T * phi))

Question 4

Binary Logistic Regression Cross-Entropy Loss Function

Accepted Answer

E(w) = - sum_{n=1}^N [ t_n * ln(y_n) + (1 - t_n) * ln(1 - y_n) ]

Question 5

Multiclass Logistic Regression Hypothesis Function (Softmax)

Accepted Answer

p(C_k | phi) = y_k(phi) = exp(w_k^T * phi) / sum_j exp(w_j^T * phi)

Question 6

Multiclass Logistic Regression Negative Log-Likelihood Loss

Accepted Answer

L(w_1, ..., w_K) = - sum_{n=1}^N sum_{k=1}^K t_{nk} * ln(y_{nk})

Question 7

Perceptron Error Function (Loss over misclassified patterns)

Accepted Answer

E_P(w) = - sum_{n in M} w^T * (phi_n * t_n)

Question 8

Mallow's C_p Statistic Formula

Accepted Answer

C_p = (1 / N) * (RSS + 2 * d * sigma_tilde^2)

Question 9

Akaike Information Criterion (AIC) Formula

Accepted Answer

AIC = -2 * ln(L) + 2 * d

Question 10

Bayesian Information Criterion (BIC) Formula

Accepted Answer

BIC = -2 * ln(L) + d * ln(N)

Question 11

Generalization Error Bound for Finite Hypothesis Spaces (Agnostic)

Accepted Answer

L_true(h) <= L_train(h) + sqrt( (ln|H| + ln(2/delta)) / (2*N) )

Question 12

Generalization Error Bound for Infinite Hypothesis Spaces (VC Bound)

Accepted Answer

L_true(h) <= L_train(h) + sqrt( (VC(H) * (ln(2*N / VC(H)) + 1) + ln(4/delta)) / N )

Question 13

PAC Learning Sample Complexity Bound (Finite Space, Agnostic)

Accepted Answer

N >= (1 / (2 * epsilon^2)) * ( ln|H| + ln(2 / delta) )

Question 14

VC Dimension Sample Complexity Bound

Accepted Answer

N >= (1 / epsilon) * ( 4 * log2(2/delta) + 8 * VC(H) * log2(13/epsilon) )

Question 15

Dual Representation of Linear Regression (Dual Weight Vector)

Accepted Answer

w = X^T * a  where a = (I * sigma^2 + X * X^T)^-1 * t

Question 16

Gram Matrix (Kernel Matrix) Element Definition

Accepted Answer

K_nm = k(x_n, x_m) = phi(x_n)^T * phi(x_m)

Question 17

Soft-Margin Support Vector Machine (SVM) Primal Objective

Accepted Answer

min_{w, b, xi} (1/2)||w||^2 + C * sum_{i=1}^N xi_i

Question 18

Soft-Margin SVM Constraints

Accepted Answer

t_i * (w^T * x_i + b) >= 1 - xi_i  and  xi_i >= 0

Question 19

Soft-Margin SVM Dual Maximization Objective

Accepted Answer

max_alpha sum(alpha_n) - (1/2) * sum_n sum_m alpha_n * alpha_m * t_n * t_m * k(x_n, x_m)

Question 20

Soft-Margin SVM Dual Constraints

Accepted Answer

0 <= alpha_n <= C  and  sum(alpha_n * t_n) = 0

Question 21

Gaussian Process Predictive Mean Function

Accepted Answer

m(x_{N+1}) = k^T * C_N^-1 * t

Question 22

Gaussian Process Predictive Variance Function

Accepted Answer

sigma^2(x_{N+1}) = k(x_{N+1}, x_{N+1}) + sigma^2 - k^T * C_N^-1 * k

Question 23

State-Value Function V^pi(s) Bellman Expectation Equation

Accepted Answer

V^pi(s) = sum_a pi(a|s) [ R(s,a) + gamma * sum_{s'} P(s'|s,a) * V^pi(s') ]

Question 24

Action-Value Function Q^pi(s,a) Bellman Expectation Equation

Accepted Answer

Q^pi(s,a) = R(s,a) + gamma * sum_{s'} P(s'|s,a) * sum_{a'} pi(a'|s') * Q^pi(s',a')

Question 25

Optimal State-Value Function V*(s) Bellman Optimality Equation

Accepted Answer

V*(s) = max_a [ R(s,a) + gamma * sum_{s'} P(s'|s,a) * V*(s') ]

Question 26

Optimal Action-Value Function Q*(s,a) Bellman Optimality Equation

Accepted Answer

Q*(s,a) = R(s,a) + gamma * sum_{s'} P(s'|s,a) * max_{a'} Q*(s', a')

Question 27

Value Iteration Value Update Rule

Accepted Answer

V_{k+1}(s) <- max_a [ R(s,a) + gamma * sum_{s'} P(s'|s,a) * V_k(s') ]

Question 28

Policy Iteration (Greedy Improvement Rule)

Accepted Answer

pi_{k+1}(s) <- argmax_a [ R(s,a) + gamma * sum_{s'} P(s'|s,a) * V^{pi_k}(s') ]

Question 29

Bellman Optimality Operator T* acting on V

Accepted Answer

(T*V)(s) = max_a [ R(s,a) + gamma * sum_{s'} P(s'|s,a) * V(s') ]

Question 30

Max-Norm Contraction Property of Bellman Operators

Accepted Answer

||T f_1 - T f_2||_infinity <= gamma * ||f_1 - f_2||_infinity

Question 31

Temporal Difference Error (TD Error) delta_t

Accepted Answer

delta_t = r_{t+1} + gamma * V(s_{t+1}) - V(s_t)

Question 32

Temporal Difference TD(0) State-Value Update Rule

Accepted Answer

V(s_t) <- V(s_t) + alpha * (r_{t+1} + gamma * V(s_{t+1}) - V(s_t))

Question 33

SARSA (On-Policy Control) Action-Value Update Rule

Accepted Answer

Q(s,a) <- Q(s,a) + alpha * (r + gamma * Q(s',a') - Q(s,a))

Question 34

Q-Learning (Off-Policy Control) Action-Value Update Rule

Accepted Answer

Q(s,a) <- Q(s,a) + alpha * (r + gamma * max_{a'} Q(s',a') - Q(s,a))

Question 35

Thompson Sampling Parameter Update for Bernoulli Success

Accepted Answer

alpha_i <- alpha_i + 1

Question 36

Thompson Sampling Parameter Update for Bernoulli Failure

Accepted Answer

beta_i <- beta_i + 1

Question 37

Incremental Target/Reward Mean Formula

Accepted Answer

Q_{k}(a) <- Q_{k-1}(a) + (1 / k) * (r_k - Q_{k-1}(a))

"Know" box contains:
Time elapsed:
Retries:

ML