click below
click below
Normal Size Small Size show me how
1Quan Variable
Basic Stats
| Question | Answer |
|---|---|
| useful graph | boxplot or histogram |
| useful numbers | location: mean and median spread: standard deviation and IQR 5-number summary: min, max, IQR and median |
| formula for mean | x ̅=1/N ∑_((i=1))^N▒xi preferable for approximately normal data |
| formula for median | M=midn or midx1+midx2/2 less affected by outliers therefore used for outlier ridden data |
| formula for standard deviation | s=√(1/(N-1) ∑_(i=1)^N▒〖(xi-x)〗^2 ) preferable for approximately normal data |
| formula for IQR | Q3 - Q1= IQR less affected by outliers therefore used for outlier ridden data |
| numerically define an outlier | more than 1.5 x IQR lower than Q1 more than 1.5 x IQR higher than Q3 |
| define linear transformation | transformation of a variable from x to xnew |
| examples of linear transformation use | change of units use of normal assumption therefore to find 'z' scores |
| formula for linear transformation | xnew=a+bx |
| formula for new mean once linear transformation has occurred | xbar new=a+bxbar |
| formula for new median once linear transformation has occurred | Mnew=a+bM |
| formula for new standard deviation once linear transformation has occurred | snew=bs |
| formula for new IQR once linear transformation has occurred | 1QRnew=bIQR |
| density curves | area under the curve in any range of values is the proportion of all observations that fall within that range for a quantitative variable = like a smoothed out histogram describes probabilistic behaviour |
| total area under the density curve equals? | 1 |
| normality assumption | normal curve can be used if a histogram looks like a normal curve termed 'reasonable' must start at 0 and end at 0 |
| normal quantile plot | if in a straight line, or close to it, then normal and assumption is reasonable |
| 68-95-99.7% rule | 68% of results will be within 1 standard deviation of the mean 95% of results will be within 2 standard deviations of the mean 99.7% of data will be within 3 standard deviations of the mean |
| symbol for the mean of a density curve | μ |
| symbol for the standard deviation of a density curve | σ |
| normal distribution shorthand | X = random variable N = normal distribution first number in brackets = mean second number in brackets = standard deviation |
| standard normal variable | Z corresponds to the area under the curve of the corresponding region will always be to the left |
| standard normal distribution table | to find P: Z found along x and y axis to find Z: P found in table ordered from smallest to largest |
| reverse standard normal distribution table | P(Z<c) c = right of Z |
| X is | N(μ,σ) |
| standardising transformation | Z= (X-μ)/σ used when distribution is N(0,1)(is normal but needs proportions changed) |
| useful test | 1-sample test for μ when σ is unknown |
| useful inference | Cl for μ |