Question 1

Sample

Accepted Answer

Subgroup of the population

Question 2

Sampling

Accepted Answer

Process of selecting sample from population

Question 3

Random sampling

Accepted Answer

Independent selection

Question 4

Descriptive vs. Inferential Statistics

Accepted Answer

–   Descriptive: primary purpose is to describe some aspect of the data
   Inferential: primary purpose is to infer (to estimate or to make a decision, test a hypothesis)

Question 5

All inferential statistics have the following in common:

Accepted Answer

–   use of some descriptive statistic
–   use of probability
–   potential for estimation
–   sampling variability
–   sampling distributions
–   use of a theoretical distribution
–   two hypotheses, two decisions, two types of error

Question 6

Research defined

Accepted Answer

Structured Problem Solving

Question 7

Scientific methods: steps (cyclic)

Accepted Answer

–   1. encounter and identify problem
–   2. formulate hypotheses, define variables
–   3. think through consequences of hypotheses
–   4. design & run study, collect data, compute statistics, test hypotheses
–   5. draw conclusions

Question 8

Variable

Accepted Answer

entity that is free to take on different values

Question 9

ndependent variable (IV)

Accepted Answer

its values are manipulated by the researcher, comes first in time

Question 10

Dependent variable (DV)

Accepted Answer

measured by researcher, follows the IV in time

Question 11

Population

Accepted Answer

Target group for inference

Question 12

Extraneous variable (EV)

Accepted Answer

controlled by researcher
•    randomization of subjects to groups
•    keep all subjects constant on EV
•    include EV in the design of the experiment

Question 13

Predictor variable (PV)

Accepted Answer

comes first in time but there is no manipulation, analogous to IV.

Question 14

Criterion variable (CV):

Accepted Answer

follows PV in time, analogous to DV.

Question 15

Causal relationship:

Accepted Answer

IV causes the DV

Question 16

Predictive relationship:

Accepted Answer

PV predicts the CV

Question 17

2 Types of research

Accepted Answer

1. experimental 2. observational

Question 18

True experiment

Accepted Answer

•    manipulation of IV
•    randomization of subjects to groups
•    causal relationship between IV and DV

Question 19

Observational research

Accepted Answer

•    no manipulation
•    minimal control of EV
•    predictive relationship between PV and CV

Question 20

Stem and Leaf Display

Accepted Answer

•      The first digit(s) of a score form the stem, the last digit(s) form the leaf.
•      We want 10-20 total number   of stems.
•      Number of stems per digit depends on total number of stems: can do 1, 2, or 5 stems per digit.

Question 21

Description With Statistics
Aspects or characteristics of data that we can describe are:

Accepted Answer

–   Middle
–   Spread
–   Skewness
–   Kurtosis

Question 22

Other words that describe Middle

Accepted Answer

central tendency, location, center

Question 23

Statistics that Measure middle are:

Accepted Answer

mean, median, mode
•      “Middle” is the aspect of data
    we want to describe.
•      We describe/measure the middle of data in a population with the parameter m (‘mu’); we usually don’t know m, so we estimate it with X-bar.

Question 24

Other words that describe Spread

Accepted Answer

variability, dispersion, skatter

Question 25

Statistics that Measure spread are:

Accepted Answer

range, variance, standard deviation, midrange

•      “Spread” is the aspect of data we want to describe.
•      Any statistic that describes/measures spread should have these characteristics: it should
–   Equal zero when the spread is zero.
–   Inc

Question 26

Skewness

Accepted Answer

=departure from symmetry
–   Positive skewness = tail (extreme scores) in positive direction
–   Negative skewness = tail (extreme scores) in negative direction
(The Few name the Skew)

Question 27

Kurtosis

Accepted Answer

peakedness relative to normal curve

Question 28

Sample Mean

Accepted Answer

-The sample mean is the sum of the scores divided by the number of scores, and is symbolized by X-bar, X = SX/N
-For example, for X1=4, X2=1, X3=7, N=3, SX=12 and X = SX/N = 12/3 = 4
•      Characteristics:
–   X-bar is the balance point

Question 29

Sample Median

Accepted Answer

•      The median is the middle of the    ordered scores, and is symbolized as X50.
•      Median position (as distinct from the median itself) is (N+1)/2 and is used to find the median.
•      Example: X1=4, X2=1, X3=7, then N=3.
•      Characteristic

Question 30

Sample Mode

Accepted Answer

•      The mode is the most frequent score.
•      Examples:
–   1 1 4 7, the mode is 1.
–   1 1 4 7 7, there are two modes, 1 and 7.
–   1 4 7, there is no mode.
•      Characteristics:
–   Has problems: more than one, or none; maybe not in the mid

Question 31

Spred cont.

Accepted Answer

•      We describe/measure the spread of data in a sample with the statistics:
–   Range = high score-low score.
–   Midrange, MR.
–   Sample variance, s*².
–   Sample standard deviation, s*.
–   Unbiased variance estimate, s².
–   s.
•      We des

Question 32

Midrange (MR)

Accepted Answer

•    Formula is MR=UH-LH
–   UH=upper hinge
–   LH=lower hinge
–   Hinges cut off 25% of the data in each tail
•      Hinge position is ([median position]+1)/2.
–   [median position] is the whole number part of the median position (remember, median p

Question 33

Hinge position

Accepted Answer

([median position]+1)/2
–   [median position] is the whole number part of the median position (remember, median pos.=(N+1)/2)
•      Use hinge position to count in from the tails to find the hinges.

Question 34

Sample Standard Deviation, s*Sample Variance, s*²

Accepted Answer

•      Definitional formula: s*²=S(X-X)²/N, the average squared deviation from X-bar.
Sample Standard Deviation= s*
Unbiased Variance Estimate, s²

Question 35

Box-plots

Accepted Answer

•      A pictorial description that uses a box to show the middle of the data and lines called whiskers to show the tails of a distribution.

Question 36

3 Parts to Box Plot

Accepted Answer

1.) Box
 2.) Wiskers 
3.) Outliers

Question 37

Box

Accepted Answer

–   Upper end is at the UH, lower end is at the LH    - Line across the middle is X50

Question 38

Whiskers

Accepted Answer

–   Whiskers are lines drawn from the ends of the box (the hinges) to adjacent values, UAV & LAV. 
–   Adjacent values are the first real data values inside the inner fences. 
– Inner fences, upper and lower
•    Upper, UIF=UH+1.5MR
•    Lower, LIF= L

Question 39

Outliers

Accepted Answer

Outliers: outside whiskers, marked with

Question 40

Midrange (MR)

Accepted Answer

UH- LH

Question 41

z Scores

Accepted Answer

•      The aspect of the data we want to describe/measure is relative position. •      z  scores are statistics that describe the relative position of something in its distribution.

Question 42

Z score formula

Accepted Answer

z is something minus its mean divided by its standard deviation.

Question 43

z score characteristics

Accepted Answer

–   The mean of a distribution of z scores is zero.
–   The variance of a distribution of z scores is one.
–   The shape of a distribution of z scores is reflective, the shape is the same as the shape of the distribution of the Xs.

Question 44

Characteristics of Normal Distributions

Accepted Answer

–   Symmetric, continuous, unimodal.
–   Bell-shaped.
–   Scores range from -¥ to +¥ .
–   Mean, median, and mode are all the same value.
–   Each distribution has two parameters, m and s².

Question 45

Use of Z score

Accepted Answer

•      We use this distribution to get probabilities associated with a z score (probability, proportion, and area under the curve are synonymous).
- look up z in table to find probabilities.

Question 46

Correlation

Accepted Answer

–   Defined as the degree of linear relationship between X and Y. –  Is measured/described by the statistic r.

Question 47

Regression

Accepted Answer

–   Is concerned with the prediction of Y from X Forms a prediction equation to predict Y from X
     Uses the formula for a straight line, Y’=bX+a.
–   Y’ is the predicted Y score on the criterion variable.
–   b is the slope, b=DY/ D X=rise/run.
–

Question 48

r=

Accepted Answer

r=SzXzY/N, the average product of z scores for X and Y
–   Works with two variables, X and Y
–   -1<r<1, r measures positive or negative relationships
–   Measures only the degree of linear relationship
–   r2=proportion of variability in Y that is e

Question 49

r2=

Accepted Answer

proportion of variability in Y that is explained by X.

Question 50

Correlation: Undefined

Accepted Answer

If there is no spread in X or Y, then r is undefined. Note that any z is undefined if the standard deviation is zero, and r=SzXzY/N.

Question 51

Population correlation coefficient,

Accepted Answer

r (rho)

Question 52

regression cont.

Accepted Answer

•      Linear only.
•      Generalize only for X values in
    your sample.
•      Actual observed Y is different from Y’ by an amount called error, e, that is, Y=Y’+e.
•      Error in regression is e=Y-Y’.
•      Many different potential regression

Question 53

Line of Best Fit

Accepted Answer

The statistics b and a are computed so as to minimize the sum of squared errors, –   Se2=S(Y-Y’)2 is a minimum. –   This is called the Least Squares Criterion.

Question 54

Partition total spread

Accepted Answer

–   Total = Explained + Not Explained
–   This is true for proportion of spread and amount of spread.
•    Proportion: 1 = r2 + (1-r2)
•    Amount: s2y = s2y r2 + s2y(1-r2)

Question 55

Probability

Accepted Answer

Defined as relative frequency of occurence.

Question 56

Sample space

Accepted Answer

all possible outcomes of an experiment

Question 57

Elementary event

Accepted Answer

a single member of the sample space

Question 58

Event

Accepted Answer

any collection of elementary events

Question 59

p(elementary event

Accepted Answer

1/(total number)

Question 60

p(event)

Accepted Answer

(number in the event)/(total number)

Question 61

Conditional probability

Accepted Answer

•    p(A|B)=(number in [A and B])/(number in B)
•    The probability of A in the redefined (reduced) sample space of B.

Question 62

Big 3 Probability Rules

Accepted Answer

1. independence 2. mulitplication, mutually exclusive 3.) addition

Question 63

Independence (1)

Accepted Answer

events A and B are independent if
•    p(A|B)=p(A)
•    The A probability is not changed by
   reducing the sample space to B.

Question 64

Multiplication (And) Rule (2)

Accepted Answer

•    p(A and B)=p(A)p(B|A)=p(A|B)p(B)

stat Midterm front

stat flashcards

"Know" box contains:
Time elapsed:
Retries: