World's most versatile flashcards

or...
Reset Password Sign Up

Statistics

        Help  

Question
Answer
Flap 3
A population   is the set of all the possible items to be observed. example: Whilst investigating the height of males in Wales, the population would be the height of all the males in Wales.    
Random sampling:   this method gives every item of the population an equal chance of selection. This can be done in various ways for example by simply picking out of a hat or by using a random number generator on a calculator.    
Stratified sampling:   some populations are naturally split into a number of strata (kind of like sub groups). We can separate the strata and find what proportion of the population is in each stratum. We can then select a random sample from each stratum proportional to its size    
measure of central tendency   is just a mathematical and rather posh way of saying "averages".    
The Mode   It is the piece or pieces of data that occur most often.    
The Median   The median is the middle piece of data when the data is in numerical order.->With 50 pieces of data, even, we must find halfway and the next value. In this case, the 25th and 26th values. The median will be halfway between these values.    
The Mean   The mean of a set of data is the sum of all the values divided by the number of values. - Ex x=----- n    
Ex   just means the sum of all the x’s - for instance, add all the bits of data together.    
grouped frequency table->find MEAN   We can however, find an estimate of the mean by assuming each footballer is the height halfway within his interval    
(f)   means frequency    
"o- "definition   standart deviation->gives a measure of how the data is dispersed about the mean->the lower the standard deviation, the more compact our data is around the mean    
Formula standart deviation   square root of ((the sum of x2 - ((mean of x)squared)) divided by the number of units    
"o- 2" definition   The variance is the square of the standard deviation.    
The variance   is the square of the standard deviation.    
m-and-leaf diagram   presenting it in an easy and quick way to help spot patterns in the spread of data->They are best used when we have a relatively small set of data and want to find the median or quartiles    
Box-and-whisker plots (or boxplots)   These are very basic diagrams used to highlight the quartiles and median to give a quick and clear way of presenting the spread of the data.   1.The ‘box’ part is drawn from the lower quartile to the upper quartile. The median is then drawn within the box. 2.The ‘box’ shows the inter-quartile range, which houses the middle half of the data. 3.The ‘whiskers’ are then drawn to the lowest and hig  
Negatively skewed distribution:   There is a greater proportion of the data at the upper end.   (blank)  
Positively skewed distribution:   There is a greater proportion of the data at the lower end.   (blank)  
Outliers   Values of data are usually labelled as outliers if they are more than 1.5 times of the inter-quartile range from either quartile.   (blank)  
Histograms   Histograms are best used for large sets of data, especially when the data has been grouped into classes. They look a little similar to bar charts or frequency diagrams. ->In histograms, the frequency of the data is shown by the area of the bars and not ju   (blank)  
frequency density   The vertical axis of a histogram is labelled   frequency / class with  
Cumulative frequency   is kind of like a running total. We add each frequency to the ones before to get an ‘at least’ total.   (blank)  
cumulative frequency curve.   cumulative frequencies (‘at least’ totals) are plotted against the upper class boundaries to give us a cumulative frequency curve.   (blank)  
P(A)   The probability that an event, A, will happen is written as   (blank)  
complement of A   The probability that the event A, does not happen is called the complement of A and is written as A'   (blank)  
mutually exclusive   Two events are mutually exclusive if the event of one happening excludes the other from happening->they both cannot happen simultaneously->When a fair die is rolled find the probability of rolling a 4 or a 1. P(4 u 1) = P(4) + P(1)=>1/6 +1/6=>1/3   Exclusive events will involve the words ‘or’, ‘either’ or something which implies ‘or’.->Remember ‘OR’ means ‘add’. P(A or B) = P(A) + P(B) P(A u B) = P(A) + P(B) P(A u B u C) = P(A) + P(B) + P(C)  
Independent Events   Two events are independent if the occurrence of one happening does not affect the occurrence of the other.->P(A and B) = P(A) ' P(B) ->P(A n B) = P(A) ' P(B) Independent events will involve ‘and’, ‘both’,"either"->means multiply   A coin is flipped at the same time as a dice is rolled. Find the probability of obtaining a head and a 5.->P(H n 5)=P(H)'P(5)=> 1/2 x 1/6=> 1/12  
How do you write Find the probability that given he falls P(F) it was a rainy day P(R).   P(R I F)   P(R n F) / P(F)  
discrete random variable   A random variable is a variable which takes numerical values and whose value depends on the outcome of an experiment. It is discrete if it can only take certain values.   Capital letters are used to denote the random variables, whereas lower case letters are used to denote the values that can be obtained.  
random variable   is a variable which takes numerical values and whose value depends on the outcome of an experiment   (blank)  
exclusive events Rewrite -> Sum?   E P(X = x) = 1 -> always sum to 1   (blank)  
Probability density function   Sometimes we are given a formula to calculate probabilities. We call this the probability density function of X or the p.d.f. of X.   (blank)  
Cumulative distribution function   ‘Cumulative’ gives us a kind of running total so a cumulative distribution function gives us a running total of probabilities within our probability table. The cumulative distribution function, F(x) of X is defined as: F(x) = P(X < x)   (blank)  
Expectation   The expectation is the expected value of X, written as E(X) or sometimes as u->The expectation is what you would expect to get if you were to carry out the experiment a large number of times and calculate the ‘mean’..   E(X) = € xP(X = x) -> You multiply each value of x with its corresponding probability. If we then add all these up we obtain the expectation of X. This is best seen in an example.  
uniform distribution   This is a ‘special’ discrete random variable as all the probabilities are the same.->it is possible to calculate the expectation by using the symmetry of the table. The expectation, E(X) is calculated by finding the halfway point.   (blank)  
symmetry of the table   With uniform distributions it is possible to calculate the expectation by using the symmetry of the table. The expectation, E(X) is calculated by finding the halfway point.   (blank)  
Expectation of any function of x   E[f(x)] = € f(x)P(X = x)   (blank)  
E(aX + b) Equals   aE(X) + b   (blank)  
E(a) Equals   a   (blank)  
variance   is a measure of how spread out the values of X would be if the experiment leading to X were repeated a number of times.   (blank)  
E(X)   -> mean -> u -> Example of Calculation->(0 x 0.1) + (1 x 0.2) + (2 x 0.5) + (3 x 0.2)   (blank)  
Var(aX) Equals   a2Var(X)   Var(2X) = 22 x Var(X) = 4 x 2.5 = 10 Var(4X – 3) = 42 x Var(X) = 16 x 2.5 = 40  
Var(aX + b) Equals   a2Var(X) This means by knowing just the variance, Var(X), we can calculate other variances quickly. Example:   Var(2X) = 22 x Var(X) = 4 x 2.5 = 10 Var(4X – 3) = 42 x Var(X) = 16 x 2.5 = 40  
The Standard Deviation   The square root of the Variance is called the Standard Deviation of X. standard deviation is given the symbol o-   (blank)  
convert any normal distribution of X into the normal distribution of Z   (X - u) / o-   (blank)  
Normal Distribution Graph   much of the data is gathered around the mean. The distribution has a characteristic ‘bell shape’ symmetrical about the mean. ->The area of the bell shape = 1.   (blank)  
The standard deviation   is an important measure of the spread of our data. The greater the standard deviation, the greater our spread of data.   (blank)  
§   this Greek letter just describes the area under the bell from that point!   (blank)  
line of best fit’   Any line of best fit must go through the mean of x, and the mean of y.   (blank)  
linear correlation   If all (or nearly all) of these points seem to lie in a straight line   (blank)  
Equation of regression line   (blank)   (blank)  
Regression Line x on y->Formula for b:   Sxy / Syy   (blank)  
Regression Line y on x->Formula for b:   Sxy / Sxx   (blank)  
Independent/dependent variables   With the above data, x looks to be controlled, where y appears to be dependent on an experiment and x. In this case, we say that x is an independent variable and y a dependent variable. As x appears controlled and accurate we only need to calculate the re   (blank)  
product moment correlation coefficient   r -> is a measure of the degree of scatter.->will lie between -1 and 1.   (blank)  
"r"   The product moment correlation coefficient, r, is a measure of the degree of scatter.->will lie between -1 and 1.   (blank)  
Calculate E(X)   €x times P(X = x) / or € f(x)P(X = x)   (blank)  


   


 

 

 
Follow us on Twitter
Be a StudyStack fan on Facebook
www.eapps.com




Copyright ©2001-2010 John Weidner All rights reserved.
About -  Terms of Service -  Privacy Statement