click below
click below
Normal Size Small Size show me how
Stats Ch. 2 Vocab
| Term | Definition |
|---|---|
| Area principle | In a statistical display, each data value should be represented by the same amount of area |
| Frequency table | A frequency table lists the categories of a categorical variable and gives the number of observations of each category |
| Relative frequency table | A relative frequency table lists the categories of a categorical variable and gives the fraction or percent of observations of each category |
| Distribution | The distribution of a variable gives the possible values of the variable and the relative frequency of each value |
| Bar chart | Bar charts show a bar whose area represents the count of observations for each category of a categorical variable |
| Pie chart | Pie charts show how a "whole" divides into categories by showing a wedge of a circle whose are corresponds to the proportion in each category |
| Categorical Data Condition | Basically just use these methods for CATEGORICAL data, not quantitative (Ch. 2) |
| Contingency table | Table that displays counts/percentages of individuals falling into named categories on two/more variables. It categorizes individuals on all variables at once, revealing possible patterns in 1 variable that may be contingent on the category of the other. |
| Marginal distribution | In a contingency table, the distribution of either variable alone is called the marginal distribution. Counts/percentages are the totals found in the margins (last row/column) of the table. |
| Conditional distribution | The distribution of a variable restricting the 'who' to consider only a smaller group of individuals is called a conditional distribution |
| Segmented bar chart | Bar chart w/ bars stacked on top of each other in vertical graph/lined up side-by-side in horizontal graph. Shows relative frequencies so that the distribution of categorical variable is more easily compared between groups. |
| Independence | Conditional distribution of one variable is same for each category of the other; no association between variables |
| Side-by-side bar chart | Bar chart which interweaves the bars of 2/more conditional distributions to facilitate their contrast. BE CAREFUL READING/USING |
| Mosaic plot | A mosaic plot is a special kind of segmented bar chart whose bars' widths display the marginal distribution of the variable represented y the bars |
| Simulation | A random reenactment of data collection under one or more assumptions. If real data look very different from simulated data, then assumptions are called into question |
| Simpson's paradox | When averages are taken across different groups, they can appear to contradict the overall averages. |