click below
click below
Normal Size Small Size show me how
Stats Chapter 4
vocab
| Question | Answer |
|---|---|
| distribution | slices up all the possible values of the variable into equal width bins and gives the number of values (or counts) falling into each bin |
| histogram (relative frequency histogram) | uses adjacent bars to show the distribution of a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in each bin |
| gap | a region of the distribution where there are no values |
| stem-and-leaf display | shows quantitative values in a way that sketches the distribution of the data |
| dotplot | graphs a dot for each case against a single axis |
| shape | described by: single vs. multiple modes, symmetry vs skewness, |
| center | the place in the distribution of a variable that you'd point to if you wanted to attempt the impossible by summarizing the entire distribution with a single number (mean and median) |
| spread | a numerical summary of how tightly the values are clustered around the center; IQR, STD DEV |
| mode | a hump or local high point in the shape of the distribution of a variable; apparent location can change as scale of a histogram changes |
| unimodal (bimodal) | having one (two) modes |
| uniform | a distribution that is roughly flat |
| symmetric | a distribution with two halves on either side of the center that look like mirror images of each other |
| tails | are the parts that typically trail off on either side; distributions can be characterized as having long tails or short tails |
| skewed | a distribution that is not symmetric and one tail is longer than the other (skewed LEFT if the tail is longer on the right vice versa) |
| outliers | extreme values that don't appear to belong with the rest of the data; can be unusual values that need more investigation or mistakes |
| median | middle value with half data above and half below; if N is even then it is the average of the two values; usually paired with the IQR |
| range | max - min |
| quartile | lower quartile (Q1): 1/4 data below it; upper quartile (Q3) 1/4 data above it; used with median it divides the data into 4 parts |
| Interquartile Range (IQR) | difference between Q1 and Q3; Q3 - Q1 = IQR; reported along with the median |
| percentile | the ith percentile is the number that falls above i% of the data |
| 5-number Summary | reports the minimum, Q1, median, Q3, and the maximum values |
| mean | average; paired with STDDEV |
| resistant | a calculated summary where outliers have a small effect |
| variance | sum of squared deviations from the mean divided by the count minus 1 |
| standard deviation | square root of the variance; s = sqrt((sigma(y-ybar)^2) / n-1); reported with the mean |