click below
click below
Normal Size Small Size show me how
AP Stats Final
| Question | Answer |
|---|---|
| percentiles | a value on a scale of one hundred that indicates the percent of a distribution that is equal to or below it |
| cumulative frequency | The sums of the frequencies of the data values from smallest to largest. |
| z-scores | indicates by how many standard deviations a score is above or below the mean --> (x-mean)/standard deviation |
| empirical rule | 1 standard deviation (68%), 2 standard deviations (95%) and 3 standard deviations (99.7%) of the mean |
| residuals | Differences between observed and predicted values. |
| correlation coefficient (r) | single number, ranging from -1.0 to 1.0, that indicates the strength and direction of an association between two variables |
| coefficient of determination (r^2) | The percent of changes in y that are explained by changes in x |
| frequency vs relative frequency | frequency refers to how many individuals are within a certain category, whereas relative frequency is the frequency divided by the total number of individuals |
| SOCS | shape, outlier, center (median), spread |
| box plots | shows the median, the Q1 (25%) and Q3 (75%) by finding the median again of the upper and lower sections |
| standard deviation | a computed measure of how much scores vary around the mean score |
| stemplots | Represents quantitative data by separating each value into two parts: the stem (such as the leftmost digit) and the leaf (such as the rightmost digit) |
| histograms | bar graphs taken from frequency distributions, quantitative data and drawn with no space between rectangles |
| bar graphs | use horizontal or vertical bars to contrast categorical data |
| Convenience sampling | use results that are easy to get |
| Voluntary response sampling | allows people to choose to be in the sample by responding to a general invitation |
| Random sampling | involves using a chance process to determine which members of population are included in sample |
| simple random sampling | every member of the population has an equal probability of being selected for the sample |
| Sampling frame | a numbered list of individuals from whom the sample is drawn |
| Nonresponse bias | bias introduced to a sample when a large fraction of those sampled fails to respond |
| Question wording bias | bias created by the wording of the survey questions |
| Self-reported response bias | when individuals inaccurately report their own traits |
| Cluster random sampling | dividing the total population into groups (or clusters), then using simple random sampling to select which clusters participate; all observations in a selected cluster are included in the sample |
| Stratified random sampling | Population divided into subgroups and one random sample taken from each group |
| Systematic random sampling | selects a sample from an ordered arrangement of the population by randomly selecting one of the first k individuals and choosing every kth individual thereafter |
| confounding variable | an "extra" variable that is not the focus of a study but can ruin the results by influencing both the independent and dependent variables |
| Matched pairs design | participants are grouped into pairs based on similar characteristics, like age or gender, and then one member of each pair is randomly assigned to a treatment group and the other to a control group |
| completely randomized design | experimental units are randomly assigned to different treatment groups |
| randomized block design | an experimental method where subjects are first grouped into "blocks" based on a shared characteristic, and then treatments are randomly assigned within each block |
| discrete variables | a variable that can only take on specific, distinct, and countable values, such as whole numbers |
| how to find outliers with IQR | calculate IQR=Q3-Q1 and the upper and lower "fences" by adding and subtracting 1.5 times the IQR from Q3 and Q1, respectively. Any data point outside of these fences is considered an outlier. |
| continuous variable | a type of quantitative variable that can take on any value within a given range and is measured rather than counted |